Message ID | 20230118061256.2689-4-dakr@redhat.com (mailing list archive) |
---|---|
State | New, archived |
Headers | show |
Series | DRM GPUVA Manager & Nouveau VM_BIND UAPI | expand |
On Wed, Jan 18, 2023 at 07:12:45AM +0100, Danilo Krummrich wrote: > This adds the infrastructure for a manager implementation to keep track > of GPU virtual address (VA) mappings. "Add infrastructure for ..." > + * Analogue to drm_gpuva_sm_map_ops_create() drm_gpuva_sm_unmap_ops_create() > + * provides drivers a the list of operations to be executed in order to unmap > + * a range of GPU VA space. The logic behind this functions is way simpler > + * though: For all existent mappings enclosed by the given range unmap > + * operations are created. For mappings which are only partically located within > + * the given range, remap operations are created such that those mappings are > + * split up and re-mapped partically. "Analogous to ..." > + * > + * The following paragraph depicts the basic constellations of existent GPU VA > + * mappings, a newly requested mapping and the resulting mappings as implemented > + * by drm_gpuva_sm_map_ops_create() - it doesn't cover arbitrary combinations > + * of those constellations. > + * > + * :: > + * > + * 1) Existent mapping is kept. > + * ---------------------------- > + * > + * 0 a 1 > + * old: |-----------| (bo_offset=n) > + * > + * 0 a 1 > + * req: |-----------| (bo_offset=n) > + * > + * 0 a 1 > + * new: |-----------| (bo_offset=n) > + * > + * > + * 2) Existent mapping is replaced. > + * -------------------------------- > + * > + * 0 a 1 > + * old: |-----------| (bo_offset=n) > + * > + * 0 a 1 > + * req: |-----------| (bo_offset=m) > + * > + * 0 a 1 > + * new: |-----------| (bo_offset=m) > + * > + * > + * 3) Existent mapping is replaced. > + * -------------------------------- > + * > + * 0 a 1 > + * old: |-----------| (bo_offset=n) > + * > + * 0 b 1 > + * req: |-----------| (bo_offset=n) > + * > + * 0 b 1 > + * new: |-----------| (bo_offset=n) > + * > + * > + * 4) Existent mapping is replaced. > + * -------------------------------- > + * > + * 0 a 1 > + * old: |-----| (bo_offset=n) > + * > + * 0 a 2 > + * req: |-----------| (bo_offset=n) > + * > + * 0 a 2 > + * new: |-----------| (bo_offset=n) > + * > + * Note: We expect to see the same result for a request with a different bo > + * and/or bo_offset. > + * > + * > + * 5) Existent mapping is split. > + * ----------------------------- > + * > + * 0 a 2 > + * old: |-----------| (bo_offset=n) > + * > + * 0 b 1 > + * req: |-----| (bo_offset=n) > + * > + * 0 b 1 a' 2 > + * new: |-----|-----| (b.bo_offset=n, a.bo_offset=n+1) > + * > + * Note: We expect to see the same result for a request with a different bo > + * and/or non-contiguous bo_offset. > + * > + * > + * 6) Existent mapping is kept. > + * ---------------------------- > + * > + * 0 a 2 > + * old: |-----------| (bo_offset=n) > + * > + * 0 a 1 > + * req: |-----| (bo_offset=n) > + * > + * 0 a 2 > + * new: |-----------| (bo_offset=n) > + * > + * > + * 7) Existent mapping is split. > + * ----------------------------- > + * > + * 0 a 2 > + * old: |-----------| (bo_offset=n) > + * > + * 1 b 2 > + * req: |-----| (bo_offset=m) > + * > + * 0 a 1 b 2 > + * new: |-----|-----| (a.bo_offset=n,b.bo_offset=m) > + * > + * > + * 8) Existent mapping is kept. > + * ---------------------------- > + * > + * 0 a 2 > + * old: |-----------| (bo_offset=n) > + * > + * 1 a 2 > + * req: |-----| (bo_offset=n+1) > + * > + * 0 a 2 > + * new: |-----------| (bo_offset=n) > + * > + * > + * 9) Existent mapping is split. > + * ----------------------------- > + * > + * 0 a 2 > + * old: |-----------| (bo_offset=n) > + * > + * 1 b 3 > + * req: |-----------| (bo_offset=m) > + * > + * 0 a 1 b 3 > + * new: |-----|-----------| (a.bo_offset=n,b.bo_offset=m) > + * > + * > + * 10) Existent mapping is merged. > + * ------------------------------- > + * > + * 0 a 2 > + * old: |-----------| (bo_offset=n) > + * > + * 1 a 3 > + * req: |-----------| (bo_offset=n+1) > + * > + * 0 a 3 > + * new: |-----------------| (bo_offset=n) > + * > + * > + * 11) Existent mapping is split. > + * ------------------------------ > + * > + * 0 a 3 > + * old: |-----------------| (bo_offset=n) > + * > + * 1 b 2 > + * req: |-----| (bo_offset=m) > + * > + * 0 a 1 b 2 a' 3 > + * new: |-----|-----|-----| (a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2) > + * > + * > + * 12) Existent mapping is kept. > + * ----------------------------- > + * > + * 0 a 3 > + * old: |-----------------| (bo_offset=n) > + * > + * 1 a 2 > + * req: |-----| (bo_offset=n+1) > + * > + * 0 a 3 > + * old: |-----------------| (bo_offset=n) > + * > + * > + * 13) Existent mapping is replaced. > + * --------------------------------- > + * > + * 1 a 2 > + * old: |-----| (bo_offset=n) > + * > + * 0 a 2 > + * req: |-----------| (bo_offset=n) > + * > + * 0 a 2 > + * new: |-----------| (bo_offset=n) > + * > + * Note: We expect to see the same result for a request with a different bo > + * and/or non-contiguous bo_offset. > + * > + * > + * 14) Existent mapping is replaced. > + * --------------------------------- > + * > + * 1 a 2 > + * old: |-----| (bo_offset=n) > + * > + * 0 a 3 > + * req: |----------------| (bo_offset=n) > + * > + * 0 a 3 > + * new: |----------------| (bo_offset=n) > + * > + * Note: We expect to see the same result for a request with a different bo > + * and/or non-contiguous bo_offset. > + * > + * > + * 15) Existent mapping is split. > + * ------------------------------ > + * > + * 1 a 3 > + * old: |-----------| (bo_offset=n) > + * > + * 0 b 2 > + * req: |-----------| (bo_offset=m) > + * > + * 0 b 2 a' 3 > + * new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2) > + * > + * > + * 16) Existent mappings are merged. > + * --------------------------------- > + * > + * 0 a 1 > + * old: |-----------| (bo_offset=n) > + * > + * 2 a 3 > + * old': |-----------| (bo_offset=n+2) > + * > + * 1 a 2 > + * req: |-----------| (bo_offset=n+1) > + * > + * a > + * new: |----------------------------------| (bo_offset=n) > + */ Factor out lists from the big code block above: ---- >8 ---- diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c index e665f642689d03..411c0aa80bfa1f 100644 --- a/drivers/gpu/drm/drm_gpuva_mgr.c +++ b/drivers/gpu/drm/drm_gpuva_mgr.c @@ -129,15 +129,14 @@ * the given range, remap operations are created such that those mappings are * split up and re-mapped partically. * - * The following paragraph depicts the basic constellations of existent GPU VA + * The following diagram depicts the basic relationships of existent GPU VA * mappings, a newly requested mapping and the resulting mappings as implemented - * by drm_gpuva_sm_map_ops_create() - it doesn't cover arbitrary combinations - * of those constellations. + * by drm_gpuva_sm_map_ops_create() - it doesn't cover any arbitrary + * combinations of these. * - * :: - * - * 1) Existent mapping is kept. - * ---------------------------- + * 1) Existent mapping is kept. + * + * :: * * 0 a 1 * old: |-----------| (bo_offset=n) @@ -149,8 +148,9 @@ * new: |-----------| (bo_offset=n) * * - * 2) Existent mapping is replaced. - * -------------------------------- + * 2) Existent mapping is replaced. + * + * :: * * 0 a 1 * old: |-----------| (bo_offset=n) @@ -162,8 +162,9 @@ * new: |-----------| (bo_offset=m) * * - * 3) Existent mapping is replaced. - * -------------------------------- + * 3) Existent mapping is replaced. + * + * :: * * 0 a 1 * old: |-----------| (bo_offset=n) @@ -175,8 +176,9 @@ * new: |-----------| (bo_offset=n) * * - * 4) Existent mapping is replaced. - * -------------------------------- + * 4) Existent mapping is replaced. + * + * :: * * 0 a 1 * old: |-----| (bo_offset=n) @@ -187,12 +189,14 @@ * 0 a 2 * new: |-----------| (bo_offset=n) * - * Note: We expect to see the same result for a request with a different bo - * and/or bo_offset. + * .. note:: + * We expect to see the same result for a request with a different bo + * and/or bo_offset. * * - * 5) Existent mapping is split. - * ----------------------------- + * 5) Existent mapping is split. + * + * :: * * 0 a 2 * old: |-----------| (bo_offset=n) @@ -203,12 +207,14 @@ * 0 b 1 a' 2 * new: |-----|-----| (b.bo_offset=n, a.bo_offset=n+1) * - * Note: We expect to see the same result for a request with a different bo - * and/or non-contiguous bo_offset. + * .. note:: + * We expect to see the same result for a request with a different bo + * and/or non-contiguous bo_offset. * * - * 6) Existent mapping is kept. - * ---------------------------- + * 6) Existent mapping is kept. + * + * :: * * 0 a 2 * old: |-----------| (bo_offset=n) @@ -220,8 +226,9 @@ * new: |-----------| (bo_offset=n) * * - * 7) Existent mapping is split. - * ----------------------------- + * 7) Existent mapping is split. + * + * :: * * 0 a 2 * old: |-----------| (bo_offset=n) @@ -233,8 +240,9 @@ * new: |-----|-----| (a.bo_offset=n,b.bo_offset=m) * * - * 8) Existent mapping is kept. - * ---------------------------- + * 8) Existent mapping is kept. + * + * :: * * 0 a 2 * old: |-----------| (bo_offset=n) @@ -246,8 +254,9 @@ * new: |-----------| (bo_offset=n) * * - * 9) Existent mapping is split. - * ----------------------------- + * 9) Existent mapping is split. + * + * :: * * 0 a 2 * old: |-----------| (bo_offset=n) @@ -259,104 +268,113 @@ * new: |-----|-----------| (a.bo_offset=n,b.bo_offset=m) * * - * 10) Existent mapping is merged. - * ------------------------------- + * 10) Existent mapping is merged. * - * 0 a 2 - * old: |-----------| (bo_offset=n) + * :: * - * 1 a 3 - * req: |-----------| (bo_offset=n+1) + * 0 a 2 + * old: |-----------| (bo_offset=n) * - * 0 a 3 - * new: |-----------------| (bo_offset=n) + * 1 a 3 + * req: |-----------| (bo_offset=n+1) + * + * 0 a 3 + * new: |-----------------| (bo_offset=n) * * - * 11) Existent mapping is split. - * ------------------------------ + * 11) Existent mapping is split. * - * 0 a 3 - * old: |-----------------| (bo_offset=n) + * :: * - * 1 b 2 - * req: |-----| (bo_offset=m) + * 0 a 3 + * old: |-----------------| (bo_offset=n) * - * 0 a 1 b 2 a' 3 - * new: |-----|-----|-----| (a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2) + * 1 b 2 + * req: |-----| (bo_offset=m) + * + * 0 a 1 b 2 a' 3 + * new: |-----|-----|-----| (a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2) * * - * 12) Existent mapping is kept. - * ----------------------------- + * 12) Existent mapping is kept. * - * 0 a 3 - * old: |-----------------| (bo_offset=n) + * :: * - * 1 a 2 - * req: |-----| (bo_offset=n+1) + * 0 a 3 + * old: |-----------------| (bo_offset=n) * - * 0 a 3 - * old: |-----------------| (bo_offset=n) + * 1 a 2 + * req: |-----| (bo_offset=n+1) + * + * 0 a 3 + * old: |-----------------| (bo_offset=n) * * - * 13) Existent mapping is replaced. - * --------------------------------- + * 13) Existent mapping is replaced. * - * 1 a 2 - * old: |-----| (bo_offset=n) + * :: * - * 0 a 2 - * req: |-----------| (bo_offset=n) + * 1 a 2 + * old: |-----| (bo_offset=n) * - * 0 a 2 - * new: |-----------| (bo_offset=n) + * 0 a 2 + * req: |-----------| (bo_offset=n) * - * Note: We expect to see the same result for a request with a different bo - * and/or non-contiguous bo_offset. + * 0 a 2 + * new: |-----------| (bo_offset=n) + * + * .. note:: + * We expect to see the same result for a request with a different bo + * and/or non-contiguous bo_offset. * * - * 14) Existent mapping is replaced. - * --------------------------------- + * 14) Existent mapping is replaced. * - * 1 a 2 - * old: |-----| (bo_offset=n) + * :: * - * 0 a 3 - * req: |----------------| (bo_offset=n) + * 1 a 2 + * old: |-----| (bo_offset=n) * - * 0 a 3 - * new: |----------------| (bo_offset=n) + * 0 a 3 + * req: |----------------| (bo_offset=n) * - * Note: We expect to see the same result for a request with a different bo - * and/or non-contiguous bo_offset. + * 0 a 3 + * new: |----------------| (bo_offset=n) + * + * .. note:: + * We expect to see the same result for a request with a different bo + * and/or non-contiguous bo_offset. * * - * 15) Existent mapping is split. - * ------------------------------ + * 15) Existent mapping is split. * - * 1 a 3 - * old: |-----------| (bo_offset=n) + * :: * - * 0 b 2 - * req: |-----------| (bo_offset=m) + * 1 a 3 + * old: |-----------| (bo_offset=n) * - * 0 b 2 a' 3 - * new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2) + * 0 b 2 + * req: |-----------| (bo_offset=m) + * + * 0 b 2 a' 3 + * new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2) * * - * 16) Existent mappings are merged. - * --------------------------------- + * 16) Existent mappings are merged. * - * 0 a 1 - * old: |-----------| (bo_offset=n) + * :: * - * 2 a 3 - * old': |-----------| (bo_offset=n+2) + * 0 a 1 + * old: |-----------| (bo_offset=n) * - * 1 a 2 - * req: |-----------| (bo_offset=n+1) + * 2 a 3 + * old': |-----------| (bo_offset=n+2) * - * a - * new: |----------------------------------| (bo_offset=n) + * 1 a 2 + * req: |-----------| (bo_offset=n+1) + * + * a + * new: |----------------------------------| (bo_offset=n) */ /** However, the relationship scenario descriptions are too generic (different diagrams are described by the same text). Please rewrite them, taking into account bo_offset values in each scenario. Thanks.
On 1/19/23 05:14, Bagas Sanjaya wrote: > On Wed, Jan 18, 2023 at 07:12:45AM +0100, Danilo Krummrich wrote: >> This adds the infrastructure for a manager implementation to keep track >> of GPU virtual address (VA) mappings. > > "Add infrastructure for ..." > >> + * Analogue to drm_gpuva_sm_map_ops_create() drm_gpuva_sm_unmap_ops_create() >> + * provides drivers a the list of operations to be executed in order to unmap >> + * a range of GPU VA space. The logic behind this functions is way simpler >> + * though: For all existent mappings enclosed by the given range unmap >> + * operations are created. For mappings which are only partically located within >> + * the given range, remap operations are created such that those mappings are >> + * split up and re-mapped partically. > > "Analogous to ..." > >> + * >> + * The following paragraph depicts the basic constellations of existent GPU VA >> + * mappings, a newly requested mapping and the resulting mappings as implemented >> + * by drm_gpuva_sm_map_ops_create() - it doesn't cover arbitrary combinations >> + * of those constellations. >> + * >> + * :: >> + * >> + * 1) Existent mapping is kept. >> + * ---------------------------- >> + * >> + * 0 a 1 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 a 1 >> + * req: |-----------| (bo_offset=n) >> + * >> + * 0 a 1 >> + * new: |-----------| (bo_offset=n) >> + * >> + * >> + * 2) Existent mapping is replaced. >> + * -------------------------------- >> + * >> + * 0 a 1 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 a 1 >> + * req: |-----------| (bo_offset=m) >> + * >> + * 0 a 1 >> + * new: |-----------| (bo_offset=m) >> + * >> + * >> + * 3) Existent mapping is replaced. >> + * -------------------------------- >> + * >> + * 0 a 1 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 b 1 >> + * req: |-----------| (bo_offset=n) >> + * >> + * 0 b 1 >> + * new: |-----------| (bo_offset=n) >> + * >> + * >> + * 4) Existent mapping is replaced. >> + * -------------------------------- >> + * >> + * 0 a 1 >> + * old: |-----| (bo_offset=n) >> + * >> + * 0 a 2 >> + * req: |-----------| (bo_offset=n) >> + * >> + * 0 a 2 >> + * new: |-----------| (bo_offset=n) >> + * >> + * Note: We expect to see the same result for a request with a different bo >> + * and/or bo_offset. >> + * >> + * >> + * 5) Existent mapping is split. >> + * ----------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 b 1 >> + * req: |-----| (bo_offset=n) >> + * >> + * 0 b 1 a' 2 >> + * new: |-----|-----| (b.bo_offset=n, a.bo_offset=n+1) >> + * >> + * Note: We expect to see the same result for a request with a different bo >> + * and/or non-contiguous bo_offset. >> + * >> + * >> + * 6) Existent mapping is kept. >> + * ---------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 a 1 >> + * req: |-----| (bo_offset=n) >> + * >> + * 0 a 2 >> + * new: |-----------| (bo_offset=n) >> + * >> + * >> + * 7) Existent mapping is split. >> + * ----------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 1 b 2 >> + * req: |-----| (bo_offset=m) >> + * >> + * 0 a 1 b 2 >> + * new: |-----|-----| (a.bo_offset=n,b.bo_offset=m) >> + * >> + * >> + * 8) Existent mapping is kept. >> + * ---------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 1 a 2 >> + * req: |-----| (bo_offset=n+1) >> + * >> + * 0 a 2 >> + * new: |-----------| (bo_offset=n) >> + * >> + * >> + * 9) Existent mapping is split. >> + * ----------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 1 b 3 >> + * req: |-----------| (bo_offset=m) >> + * >> + * 0 a 1 b 3 >> + * new: |-----|-----------| (a.bo_offset=n,b.bo_offset=m) >> + * >> + * >> + * 10) Existent mapping is merged. >> + * ------------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 1 a 3 >> + * req: |-----------| (bo_offset=n+1) >> + * >> + * 0 a 3 >> + * new: |-----------------| (bo_offset=n) >> + * >> + * >> + * 11) Existent mapping is split. >> + * ------------------------------ >> + * >> + * 0 a 3 >> + * old: |-----------------| (bo_offset=n) >> + * >> + * 1 b 2 >> + * req: |-----| (bo_offset=m) >> + * >> + * 0 a 1 b 2 a' 3 >> + * new: |-----|-----|-----| (a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2) >> + * >> + * >> + * 12) Existent mapping is kept. >> + * ----------------------------- >> + * >> + * 0 a 3 >> + * old: |-----------------| (bo_offset=n) >> + * >> + * 1 a 2 >> + * req: |-----| (bo_offset=n+1) >> + * >> + * 0 a 3 >> + * old: |-----------------| (bo_offset=n) >> + * >> + * >> + * 13) Existent mapping is replaced. >> + * --------------------------------- >> + * >> + * 1 a 2 >> + * old: |-----| (bo_offset=n) >> + * >> + * 0 a 2 >> + * req: |-----------| (bo_offset=n) >> + * >> + * 0 a 2 >> + * new: |-----------| (bo_offset=n) >> + * >> + * Note: We expect to see the same result for a request with a different bo >> + * and/or non-contiguous bo_offset. >> + * >> + * >> + * 14) Existent mapping is replaced. >> + * --------------------------------- >> + * >> + * 1 a 2 >> + * old: |-----| (bo_offset=n) >> + * >> + * 0 a 3 >> + * req: |----------------| (bo_offset=n) >> + * >> + * 0 a 3 >> + * new: |----------------| (bo_offset=n) >> + * >> + * Note: We expect to see the same result for a request with a different bo >> + * and/or non-contiguous bo_offset. >> + * >> + * >> + * 15) Existent mapping is split. >> + * ------------------------------ >> + * >> + * 1 a 3 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 b 2 >> + * req: |-----------| (bo_offset=m) >> + * >> + * 0 b 2 a' 3 >> + * new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2) >> + * >> + * >> + * 16) Existent mappings are merged. >> + * --------------------------------- >> + * >> + * 0 a 1 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 2 a 3 >> + * old': |-----------| (bo_offset=n+2) >> + * >> + * 1 a 2 >> + * req: |-----------| (bo_offset=n+1) >> + * >> + * a >> + * new: |----------------------------------| (bo_offset=n) >> + */ > > Factor out lists from the big code block above: > > ---- >8 ---- > Thanks for your feedback and the patch, it's highly appreciated. - Danilo > diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c > index e665f642689d03..411c0aa80bfa1f 100644 > --- a/drivers/gpu/drm/drm_gpuva_mgr.c > +++ b/drivers/gpu/drm/drm_gpuva_mgr.c > @@ -129,15 +129,14 @@ > * the given range, remap operations are created such that those mappings are > * split up and re-mapped partically. > * > - * The following paragraph depicts the basic constellations of existent GPU VA > + * The following diagram depicts the basic relationships of existent GPU VA > * mappings, a newly requested mapping and the resulting mappings as implemented > - * by drm_gpuva_sm_map_ops_create() - it doesn't cover arbitrary combinations > - * of those constellations. > + * by drm_gpuva_sm_map_ops_create() - it doesn't cover any arbitrary > + * combinations of these. > * > - * :: > - * > - * 1) Existent mapping is kept. > - * ---------------------------- > + * 1) Existent mapping is kept. > + * > + * :: > * > * 0 a 1 > * old: |-----------| (bo_offset=n) > @@ -149,8 +148,9 @@ > * new: |-----------| (bo_offset=n) > * > * > - * 2) Existent mapping is replaced. > - * -------------------------------- > + * 2) Existent mapping is replaced. > + * > + * :: > * > * 0 a 1 > * old: |-----------| (bo_offset=n) > @@ -162,8 +162,9 @@ > * new: |-----------| (bo_offset=m) > * > * > - * 3) Existent mapping is replaced. > - * -------------------------------- > + * 3) Existent mapping is replaced. > + * > + * :: > * > * 0 a 1 > * old: |-----------| (bo_offset=n) > @@ -175,8 +176,9 @@ > * new: |-----------| (bo_offset=n) > * > * > - * 4) Existent mapping is replaced. > - * -------------------------------- > + * 4) Existent mapping is replaced. > + * > + * :: > * > * 0 a 1 > * old: |-----| (bo_offset=n) > @@ -187,12 +189,14 @@ > * 0 a 2 > * new: |-----------| (bo_offset=n) > * > - * Note: We expect to see the same result for a request with a different bo > - * and/or bo_offset. > + * .. note:: > + * We expect to see the same result for a request with a different bo > + * and/or bo_offset. > * > * > - * 5) Existent mapping is split. > - * ----------------------------- > + * 5) Existent mapping is split. > + * > + * :: > * > * 0 a 2 > * old: |-----------| (bo_offset=n) > @@ -203,12 +207,14 @@ > * 0 b 1 a' 2 > * new: |-----|-----| (b.bo_offset=n, a.bo_offset=n+1) > * > - * Note: We expect to see the same result for a request with a different bo > - * and/or non-contiguous bo_offset. > + * .. note:: > + * We expect to see the same result for a request with a different bo > + * and/or non-contiguous bo_offset. > * > * > - * 6) Existent mapping is kept. > - * ---------------------------- > + * 6) Existent mapping is kept. > + * > + * :: > * > * 0 a 2 > * old: |-----------| (bo_offset=n) > @@ -220,8 +226,9 @@ > * new: |-----------| (bo_offset=n) > * > * > - * 7) Existent mapping is split. > - * ----------------------------- > + * 7) Existent mapping is split. > + * > + * :: > * > * 0 a 2 > * old: |-----------| (bo_offset=n) > @@ -233,8 +240,9 @@ > * new: |-----|-----| (a.bo_offset=n,b.bo_offset=m) > * > * > - * 8) Existent mapping is kept. > - * ---------------------------- > + * 8) Existent mapping is kept. > + * > + * :: > * > * 0 a 2 > * old: |-----------| (bo_offset=n) > @@ -246,8 +254,9 @@ > * new: |-----------| (bo_offset=n) > * > * > - * 9) Existent mapping is split. > - * ----------------------------- > + * 9) Existent mapping is split. > + * > + * :: > * > * 0 a 2 > * old: |-----------| (bo_offset=n) > @@ -259,104 +268,113 @@ > * new: |-----|-----------| (a.bo_offset=n,b.bo_offset=m) > * > * > - * 10) Existent mapping is merged. > - * ------------------------------- > + * 10) Existent mapping is merged. > * > - * 0 a 2 > - * old: |-----------| (bo_offset=n) > + * :: > * > - * 1 a 3 > - * req: |-----------| (bo_offset=n+1) > + * 0 a 2 > + * old: |-----------| (bo_offset=n) > * > - * 0 a 3 > - * new: |-----------------| (bo_offset=n) > + * 1 a 3 > + * req: |-----------| (bo_offset=n+1) > + * > + * 0 a 3 > + * new: |-----------------| (bo_offset=n) > * > * > - * 11) Existent mapping is split. > - * ------------------------------ > + * 11) Existent mapping is split. > * > - * 0 a 3 > - * old: |-----------------| (bo_offset=n) > + * :: > * > - * 1 b 2 > - * req: |-----| (bo_offset=m) > + * 0 a 3 > + * old: |-----------------| (bo_offset=n) > * > - * 0 a 1 b 2 a' 3 > - * new: |-----|-----|-----| (a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2) > + * 1 b 2 > + * req: |-----| (bo_offset=m) > + * > + * 0 a 1 b 2 a' 3 > + * new: |-----|-----|-----| (a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2) > * > * > - * 12) Existent mapping is kept. > - * ----------------------------- > + * 12) Existent mapping is kept. > * > - * 0 a 3 > - * old: |-----------------| (bo_offset=n) > + * :: > * > - * 1 a 2 > - * req: |-----| (bo_offset=n+1) > + * 0 a 3 > + * old: |-----------------| (bo_offset=n) > * > - * 0 a 3 > - * old: |-----------------| (bo_offset=n) > + * 1 a 2 > + * req: |-----| (bo_offset=n+1) > + * > + * 0 a 3 > + * old: |-----------------| (bo_offset=n) > * > * > - * 13) Existent mapping is replaced. > - * --------------------------------- > + * 13) Existent mapping is replaced. > * > - * 1 a 2 > - * old: |-----| (bo_offset=n) > + * :: > * > - * 0 a 2 > - * req: |-----------| (bo_offset=n) > + * 1 a 2 > + * old: |-----| (bo_offset=n) > * > - * 0 a 2 > - * new: |-----------| (bo_offset=n) > + * 0 a 2 > + * req: |-----------| (bo_offset=n) > * > - * Note: We expect to see the same result for a request with a different bo > - * and/or non-contiguous bo_offset. > + * 0 a 2 > + * new: |-----------| (bo_offset=n) > + * > + * .. note:: > + * We expect to see the same result for a request with a different bo > + * and/or non-contiguous bo_offset. > * > * > - * 14) Existent mapping is replaced. > - * --------------------------------- > + * 14) Existent mapping is replaced. > * > - * 1 a 2 > - * old: |-----| (bo_offset=n) > + * :: > * > - * 0 a 3 > - * req: |----------------| (bo_offset=n) > + * 1 a 2 > + * old: |-----| (bo_offset=n) > * > - * 0 a 3 > - * new: |----------------| (bo_offset=n) > + * 0 a 3 > + * req: |----------------| (bo_offset=n) > * > - * Note: We expect to see the same result for a request with a different bo > - * and/or non-contiguous bo_offset. > + * 0 a 3 > + * new: |----------------| (bo_offset=n) > + * > + * .. note:: > + * We expect to see the same result for a request with a different bo > + * and/or non-contiguous bo_offset. > * > * > - * 15) Existent mapping is split. > - * ------------------------------ > + * 15) Existent mapping is split. > * > - * 1 a 3 > - * old: |-----------| (bo_offset=n) > + * :: > * > - * 0 b 2 > - * req: |-----------| (bo_offset=m) > + * 1 a 3 > + * old: |-----------| (bo_offset=n) > * > - * 0 b 2 a' 3 > - * new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2) > + * 0 b 2 > + * req: |-----------| (bo_offset=m) > + * > + * 0 b 2 a' 3 > + * new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2) > * > * > - * 16) Existent mappings are merged. > - * --------------------------------- > + * 16) Existent mappings are merged. > * > - * 0 a 1 > - * old: |-----------| (bo_offset=n) > + * :: > * > - * 2 a 3 > - * old': |-----------| (bo_offset=n+2) > + * 0 a 1 > + * old: |-----------| (bo_offset=n) > * > - * 1 a 2 > - * req: |-----------| (bo_offset=n+1) > + * 2 a 3 > + * old': |-----------| (bo_offset=n+2) > * > - * a > - * new: |----------------------------------| (bo_offset=n) > + * 1 a 2 > + * req: |-----------| (bo_offset=n+1) > + * > + * a > + * new: |----------------------------------| (bo_offset=n) > */ > > /** > > However, the relationship scenario descriptions are too generic (different > diagrams are described by the same text). Please rewrite them, taking into > account bo_offset values in each scenario. > > Thanks. >
On Wed, Jan 18, 2023 at 07:12:45AM +0100, Danilo Krummrich wrote: >This adds the infrastructure for a manager implementation to keep track >of GPU virtual address (VA) mappings. > >New UAPIs, motivated by Vulkan sparse memory bindings graphics drivers >start implementing, allow userspace applications to request multiple and >arbitrary GPU VA mappings of buffer objects. The DRM GPU VA manager is >intended to serve the following purposes in this context. > >1) Provide a dedicated range allocator to track GPU VA allocations and > mappings, making use of the drm_mm range allocator. > >2) Generically connect GPU VA mappings to their backing buffers, in > particular DRM GEM objects. > >3) Provide a common implementation to perform more complex mapping > operations on the GPU VA space. In particular splitting and merging > of GPU VA mappings, e.g. for intersecting mapping requests or partial > unmap requests. > >Idea-suggested-by: Dave Airlie <airlied@redhat.com> >Signed-off-by: Danilo Krummrich <dakr@redhat.com> >--- > Documentation/gpu/drm-mm.rst | 31 + > drivers/gpu/drm/Makefile | 1 + > drivers/gpu/drm/drm_gem.c | 3 + > drivers/gpu/drm/drm_gpuva_mgr.c | 1323 +++++++++++++++++++++++++++++++ > include/drm/drm_drv.h | 6 + > include/drm/drm_gem.h | 75 ++ > include/drm/drm_gpuva_mgr.h | 527 ++++++++++++ > 7 files changed, 1966 insertions(+) > create mode 100644 drivers/gpu/drm/drm_gpuva_mgr.c > create mode 100644 include/drm/drm_gpuva_mgr.h > >diff --git a/Documentation/gpu/drm-mm.rst b/Documentation/gpu/drm-mm.rst >index a52e6f4117d6..c9f120cfe730 100644 >--- a/Documentation/gpu/drm-mm.rst >+++ b/Documentation/gpu/drm-mm.rst >@@ -466,6 +466,37 @@ DRM MM Range Allocator Function References > .. kernel-doc:: drivers/gpu/drm/drm_mm.c > :export: > >+DRM GPU VA Manager >+================== >+ >+Overview >+-------- >+ >+.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >+ :doc: Overview >+ >+Split and Merge >+--------------- >+ >+.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >+ :doc: Split and Merge >+ >+Locking >+------- >+ >+.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >+ :doc: Locking >+ >+ >+DRM GPU VA Manager Function References >+-------------------------------------- >+ >+.. kernel-doc:: include/drm/drm_gpuva_mgr.h >+ :internal: >+ >+.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >+ :export: >+ > DRM Buddy Allocator > =================== > >diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile >index 4fe190aee584..de2ffca3b6e4 100644 >--- a/drivers/gpu/drm/Makefile >+++ b/drivers/gpu/drm/Makefile >@@ -45,6 +45,7 @@ drm-y := \ > drm_vblank.o \ > drm_vblank_work.o \ > drm_vma_manager.o \ >+ drm_gpuva_mgr.o \ > drm_writeback.o > drm-$(CONFIG_DRM_LEGACY) += \ > drm_agpsupport.o \ >diff --git a/drivers/gpu/drm/drm_gem.c b/drivers/gpu/drm/drm_gem.c >index 59a0bb5ebd85..65115fe88627 100644 >--- a/drivers/gpu/drm/drm_gem.c >+++ b/drivers/gpu/drm/drm_gem.c >@@ -164,6 +164,9 @@ void drm_gem_private_object_init(struct drm_device *dev, > if (!obj->resv) > obj->resv = &obj->_resv; > >+ if (drm_core_check_feature(dev, DRIVER_GEM_GPUVA)) >+ drm_gem_gpuva_init(obj); >+ > drm_vma_node_reset(&obj->vma_node); > INIT_LIST_HEAD(&obj->lru_node); > } >diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c >new file mode 100644 >index 000000000000..e665f642689d >--- /dev/null >+++ b/drivers/gpu/drm/drm_gpuva_mgr.c >@@ -0,0 +1,1323 @@ >+// SPDX-License-Identifier: GPL-2.0 >+/* >+ * Copyright (c) 2022 Red Hat. >+ * >+ * Permission is hereby granted, free of charge, to any person obtaining a >+ * copy of this software and associated documentation files (the "Software"), >+ * to deal in the Software without restriction, including without limitation >+ * the rights to use, copy, modify, merge, publish, distribute, sublicense, >+ * and/or sell copies of the Software, and to permit persons to whom the >+ * Software is furnished to do so, subject to the following conditions: >+ * >+ * The above copyright notice and this permission notice shall be included in >+ * all copies or substantial portions of the Software. >+ * >+ * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR >+ * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, >+ * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL >+ * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, DAMAGES OR >+ * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, >+ * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR >+ * OTHER DEALINGS IN THE SOFTWARE. >+ * >+ * Authors: >+ * Danilo Krummrich <dakr@redhat.com> >+ * >+ */ >+ >+#include <drm/drm_gem.h> >+#include <drm/drm_gpuva_mgr.h> >+ >+/** >+ * DOC: Overview >+ * >+ * The DRM GPU VA Manager, represented by struct drm_gpuva_manager keeps track >+ * of a GPU's virtual address (VA) space and manages the corresponding virtual >+ * mappings represented by &drm_gpuva objects. It also keeps track of the >+ * mapping's backing &drm_gem_object buffers. >+ * >+ * &drm_gem_object buffers maintain a list (and a corresponding list lock) of >+ * &drm_gpuva objects representing all existent GPU VA mappings using this >+ * &drm_gem_object as backing buffer. >+ * >+ * A GPU VA mapping can only be created within a previously allocated >+ * &drm_gpuva_region, which represents a reserved portion of the GPU VA space. >+ * GPU VA mappings are not allowed to span over a &drm_gpuva_region's boundary. >+ * >+ * GPU VA regions can also be flagged as sparse, which allows drivers to create >+ * sparse mappings for a whole GPU VA region in order to support Vulkan >+ * 'Sparse Resources'. >+ * So, is the sparse resources the only usecase for the VA region abstraction in uapi? Or are there any other potential usecases (other than kernel reserved space)? >+ * The GPU VA manager internally uses the &drm_mm range allocator to manage the >+ * &drm_gpuva mappings and the &drm_gpuva_regions within a GPU's virtual address >+ * space. >+ * >+ * Besides the GPU VA space regions (&drm_gpuva_region) allocated by a driver >+ * the &drm_gpuva_manager contains a special region representing the portion of >+ * VA space reserved by the kernel. This node is initialized together with the >+ * GPU VA manager instance and removed when the GPU VA manager is destroyed. >+ * >+ * In a typical application drivers would embed struct drm_gpuva_manager, >+ * struct drm_gpuva_region and struct drm_gpuva within their own driver >+ * specific structures, there won't be any memory allocations of it's own nor >+ * memory allocations of &drm_gpuva or &drm_gpuva_region entries. >+ */ >+ >+/** >+ * DOC: Split and Merge >+ * >+ * The DRM GPU VA manager also provides an algorithm implementing splitting and >+ * merging of existent GPU VA mappings with the ones that are requested to be >+ * mapped or unmapped. This feature is required by the Vulkan API to implement >+ * Vulkan 'Sparse Memory Bindings' - drivers UAPIs often refer to this as >+ * VM BIND. Looks like the split and merge is only based on address continuity (virtual and physical). How about potential page table attributes specified during VM_BIND for a mapping such as readonly, caching attributes, atomicity etc? Thanks, Niranjana >+ * >+ * Drivers can call drm_gpuva_sm_map_ops_create() to obtain a list of map, unmap >+ * and remap operations for a given newly requested mapping. This list >+ * represents the set of operations to execute in order to integrate the new >+ * mapping cleanly into the current state of the GPU VA space. >+ * >+ * Depending on how the new GPU VA mapping intersects with the existent mappings >+ * of the GPU VA space the &drm_gpuva_ops contain an arbitrary amount of unmap >+ * operations, a maximum of two remap operations and a single map operation. >+ * The set of operations can also be empty if no operation is required, e.g. if >+ * the requested mapping already exists in the exact same way. >+ * >+ * The single map operation, if existent, represents the original map operation >+ * requested by the caller. Please note that this operation might be altered >+ * comparing it with the original map operation, e.g. because it was merged with >+ * an already existent mapping. Hence, drivers must execute this map operation >+ * instead of the original one they passed to drm_gpuva_sm_map_ops_create(). >+ * >+ * &drm_gpuva_op_unmap contains a 'keep' field, which indicates whether the >+ * &drm_gpuva to unmap is physically contiguous with the original mapping >+ * request. Optionally, if 'keep' is set, drivers may keep the actual page table >+ * entries for this &drm_gpuva, adding the missing page table entries only and >+ * update the &drm_gpuva_manager's view of things accordingly. >+ * >+ * Drivers may do the same optimization, namely delta page table updates, also >+ * for remap operations. This is possible since &drm_gpuva_op_remap consists of >+ * one unmap operation and one or two map operations, such that drivers can >+ * derive the page table update delta accordingly. >+ * >+ * Note that there can't be more than two existent mappings to split up, one at >+ * the beginning and one at the end of the new mapping, hence there is a >+ * maximum of two remap operations. >+ * >+ * Generally, the DRM GPU VA manager never merges mappings across the >+ * boundaries of &drm_gpuva_regions. This is the case since merging between >+ * GPU VA regions would result into unmap and map operations to be issued for >+ * both regions involved although the original mapping request was referred to >+ * one specific GPU VA region only. Since the other GPU VA region, the one not >+ * explicitly requested to be altered, might be in use by the GPU, we are not >+ * allowed to issue any map/unmap operations for this region. >+ * >+ * Note that before calling drm_gpuva_sm_map_ops_create() again with another >+ * mapping request it is necessary to update the &drm_gpuva_manager's view of >+ * the GPU VA space. The previously obtained operations must be either fully >+ * processed or completely abandoned. >+ * >+ * To update the &drm_gpuva_manager's view of the GPU VA space >+ * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >+ * drm_gpuva_destroy_unlocked() should be used. >+ * >+ * Analogue to drm_gpuva_sm_map_ops_create() drm_gpuva_sm_unmap_ops_create() >+ * provides drivers a the list of operations to be executed in order to unmap >+ * a range of GPU VA space. The logic behind this functions is way simpler >+ * though: For all existent mappings enclosed by the given range unmap >+ * operations are created. For mappings which are only partically located within >+ * the given range, remap operations are created such that those mappings are >+ * split up and re-mapped partically. >+ * >+ * The following paragraph depicts the basic constellations of existent GPU VA >+ * mappings, a newly requested mapping and the resulting mappings as implemented >+ * by drm_gpuva_sm_map_ops_create() - it doesn't cover arbitrary combinations >+ * of those constellations. >+ * >+ * :: >+ * >+ * 1) Existent mapping is kept. >+ * ---------------------------- >+ * >+ * 0 a 1 >+ * old: |-----------| (bo_offset=n) >+ * >+ * 0 a 1 >+ * req: |-----------| (bo_offset=n) >+ * >+ * 0 a 1 >+ * new: |-----------| (bo_offset=n) >+ * >+ * >+ * 2) Existent mapping is replaced. >+ * -------------------------------- >+ * >+ * 0 a 1 >+ * old: |-----------| (bo_offset=n) >+ * >+ * 0 a 1 >+ * req: |-----------| (bo_offset=m) >+ * >+ * 0 a 1 >+ * new: |-----------| (bo_offset=m) >+ * >+ * >+ * 3) Existent mapping is replaced. >+ * -------------------------------- >+ * >+ * 0 a 1 >+ * old: |-----------| (bo_offset=n) >+ * >+ * 0 b 1 >+ * req: |-----------| (bo_offset=n) >+ * >+ * 0 b 1 >+ * new: |-----------| (bo_offset=n) >+ * >+ * >+ * 4) Existent mapping is replaced. >+ * -------------------------------- >+ * >+ * 0 a 1 >+ * old: |-----| (bo_offset=n) >+ * >+ * 0 a 2 >+ * req: |-----------| (bo_offset=n) >+ * >+ * 0 a 2 >+ * new: |-----------| (bo_offset=n) >+ * >+ * Note: We expect to see the same result for a request with a different bo >+ * and/or bo_offset. >+ * >+ * >+ * 5) Existent mapping is split. >+ * ----------------------------- >+ * >+ * 0 a 2 >+ * old: |-----------| (bo_offset=n) >+ * >+ * 0 b 1 >+ * req: |-----| (bo_offset=n) >+ * >+ * 0 b 1 a' 2 >+ * new: |-----|-----| (b.bo_offset=n, a.bo_offset=n+1) >+ * >+ * Note: We expect to see the same result for a request with a different bo >+ * and/or non-contiguous bo_offset. >+ * >+ * >+ * 6) Existent mapping is kept. >+ * ---------------------------- >+ * >+ * 0 a 2 >+ * old: |-----------| (bo_offset=n) >+ * >+ * 0 a 1 >+ * req: |-----| (bo_offset=n) >+ * >+ * 0 a 2 >+ * new: |-----------| (bo_offset=n) >+ * >+ * >+ * 7) Existent mapping is split. >+ * ----------------------------- >+ * >+ * 0 a 2 >+ * old: |-----------| (bo_offset=n) >+ * >+ * 1 b 2 >+ * req: |-----| (bo_offset=m) >+ * >+ * 0 a 1 b 2 >+ * new: |-----|-----| (a.bo_offset=n,b.bo_offset=m) >+ * >+ * >+ * 8) Existent mapping is kept. >+ * ---------------------------- >+ * >+ * 0 a 2 >+ * old: |-----------| (bo_offset=n) >+ * >+ * 1 a 2 >+ * req: |-----| (bo_offset=n+1) >+ * >+ * 0 a 2 >+ * new: |-----------| (bo_offset=n) >+ * >+ * >+ * 9) Existent mapping is split. >+ * ----------------------------- >+ * >+ * 0 a 2 >+ * old: |-----------| (bo_offset=n) >+ * >+ * 1 b 3 >+ * req: |-----------| (bo_offset=m) >+ * >+ * 0 a 1 b 3 >+ * new: |-----|-----------| (a.bo_offset=n,b.bo_offset=m) >+ * >+ * >+ * 10) Existent mapping is merged. >+ * ------------------------------- >+ * >+ * 0 a 2 >+ * old: |-----------| (bo_offset=n) >+ * >+ * 1 a 3 >+ * req: |-----------| (bo_offset=n+1) >+ * >+ * 0 a 3 >+ * new: |-----------------| (bo_offset=n) >+ * >+ * >+ * 11) Existent mapping is split. >+ * ------------------------------ >+ * >+ * 0 a 3 >+ * old: |-----------------| (bo_offset=n) >+ * >+ * 1 b 2 >+ * req: |-----| (bo_offset=m) >+ * >+ * 0 a 1 b 2 a' 3 >+ * new: |-----|-----|-----| (a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2) >+ * >+ * >+ * 12) Existent mapping is kept. >+ * ----------------------------- >+ * >+ * 0 a 3 >+ * old: |-----------------| (bo_offset=n) >+ * >+ * 1 a 2 >+ * req: |-----| (bo_offset=n+1) >+ * >+ * 0 a 3 >+ * old: |-----------------| (bo_offset=n) >+ * >+ * >+ * 13) Existent mapping is replaced. >+ * --------------------------------- >+ * >+ * 1 a 2 >+ * old: |-----| (bo_offset=n) >+ * >+ * 0 a 2 >+ * req: |-----------| (bo_offset=n) >+ * >+ * 0 a 2 >+ * new: |-----------| (bo_offset=n) >+ * >+ * Note: We expect to see the same result for a request with a different bo >+ * and/or non-contiguous bo_offset. >+ * >+ * >+ * 14) Existent mapping is replaced. >+ * --------------------------------- >+ * >+ * 1 a 2 >+ * old: |-----| (bo_offset=n) >+ * >+ * 0 a 3 >+ * req: |----------------| (bo_offset=n) >+ * >+ * 0 a 3 >+ * new: |----------------| (bo_offset=n) >+ * >+ * Note: We expect to see the same result for a request with a different bo >+ * and/or non-contiguous bo_offset. >+ * >+ * >+ * 15) Existent mapping is split. >+ * ------------------------------ >+ * >+ * 1 a 3 >+ * old: |-----------| (bo_offset=n) >+ * >+ * 0 b 2 >+ * req: |-----------| (bo_offset=m) >+ * >+ * 0 b 2 a' 3 >+ * new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2) >+ * >+ * >+ * 16) Existent mappings are merged. >+ * --------------------------------- >+ * >+ * 0 a 1 >+ * old: |-----------| (bo_offset=n) >+ * >+ * 2 a 3 >+ * old': |-----------| (bo_offset=n+2) >+ * >+ * 1 a 2 >+ * req: |-----------| (bo_offset=n+1) >+ * >+ * a >+ * new: |----------------------------------| (bo_offset=n) >+ */ >+ >+/** >+ * DOC: Locking >+ * >+ * Generally, the GPU VA manager does not take care of locking itself, it is >+ * the drivers responsibility to take care about locking. Drivers might want to >+ * protect the following operations: inserting, destroying and iterating >+ * &drm_gpuva and &drm_gpuva_region objects as well as generating split and merge >+ * operations. >+ * >+ * The GPU VA manager does take care of the locking of the backing >+ * &drm_gem_object buffers GPU VA lists though, unless the provided functions >+ * documentation claims otherwise. >+ */ >+ >+/** >+ * drm_gpuva_manager_init - initialize a &drm_gpuva_manager >+ * @mgr: pointer to the &drm_gpuva_manager to initialize >+ * @name: the name of the GPU VA space >+ * @start_offset: the start offset of the GPU VA space >+ * @range: the size of the GPU VA space >+ * @reserve_offset: the start of the kernel reserved GPU VA area >+ * @reserve_range: the size of the kernel reserved GPU VA area >+ * >+ * The &drm_gpuva_manager must be initialized with this function before use. >+ * >+ * Note that @mgr must be cleared to 0 before calling this function. The given >+ * &name is expected to be managed by the surrounding driver structures. >+ */ >+void >+drm_gpuva_manager_init(struct drm_gpuva_manager *mgr, >+ const char *name, >+ u64 start_offset, u64 range, >+ u64 reserve_offset, u64 reserve_range) >+{ >+ drm_mm_init(&mgr->va_mm, start_offset, range); >+ drm_mm_init(&mgr->region_mm, start_offset, range); >+ >+ mgr->mm_start = start_offset; >+ mgr->mm_range = range; >+ >+ mgr->name = name ? name : "unknown"; >+ >+ memset(&mgr->kernel_alloc_node, 0, sizeof(struct drm_mm_node)); >+ mgr->kernel_alloc_node.start = reserve_offset; >+ mgr->kernel_alloc_node.size = reserve_range; >+ drm_mm_reserve_node(&mgr->region_mm, &mgr->kernel_alloc_node); >+} >+EXPORT_SYMBOL(drm_gpuva_manager_init); >+ >+/** >+ * drm_gpuva_manager_destroy - cleanup a &drm_gpuva_manager >+ * @mgr: pointer to the &drm_gpuva_manager to clean up >+ * >+ * Note that it is a bug to call this function on a manager that still >+ * holds GPU VA mappings. >+ */ >+void >+drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr) >+{ >+ mgr->name = NULL; >+ drm_mm_remove_node(&mgr->kernel_alloc_node); >+ drm_mm_takedown(&mgr->va_mm); >+ drm_mm_takedown(&mgr->region_mm); >+} >+EXPORT_SYMBOL(drm_gpuva_manager_destroy); >+ >+static struct drm_gpuva_region * >+drm_gpuva_in_region(struct drm_gpuva_manager *mgr, u64 addr, u64 range) >+{ >+ struct drm_gpuva_region *reg; >+ >+ /* Find the VA region the requested range is strictly enclosed by. */ >+ drm_gpuva_for_each_region_in_range(reg, mgr, addr, addr + range) { >+ if (reg->node.start <= addr && >+ reg->node.start + reg->node.size >= addr + range && >+ ®->node != &mgr->kernel_alloc_node) >+ return reg; >+ } >+ >+ return NULL; >+} >+ >+static bool >+drm_gpuva_in_any_region(struct drm_gpuva_manager *mgr, u64 addr, u64 range) >+{ >+ return !!drm_gpuva_in_region(mgr, addr, range); >+} >+ >+/** >+ * drm_gpuva_insert - insert a &drm_gpuva >+ * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in >+ * @va: the &drm_gpuva to insert >+ * @addr: the start address of the GPU VA >+ * @range: the range of the GPU VA >+ * >+ * Insert a &drm_gpuva with a given address and range into a >+ * &drm_gpuva_manager. >+ * >+ * The function assumes the caller does not hold the &drm_gem_object's >+ * GPU VA list mutex. >+ * >+ * Returns: 0 on success, negative error code on failure. >+ */ >+int >+drm_gpuva_insert(struct drm_gpuva_manager *mgr, >+ struct drm_gpuva *va, >+ u64 addr, u64 range) >+{ >+ struct drm_gpuva_region *reg; >+ int ret; >+ >+ if (!va->gem.obj) >+ return -EINVAL; >+ >+ reg = drm_gpuva_in_region(mgr, addr, range); >+ if (!reg) >+ return -EINVAL; >+ >+ ret = drm_mm_insert_node_in_range(&mgr->va_mm, &va->node, >+ range, 0, >+ 0, addr, >+ addr + range, >+ DRM_MM_INSERT_LOW|DRM_MM_INSERT_ONCE); >+ if (ret) >+ return ret; >+ >+ va->mgr = mgr; >+ va->region = reg; >+ >+ return 0; >+} >+EXPORT_SYMBOL(drm_gpuva_insert); >+ >+/** >+ * drm_gpuva_link_locked - link a &drm_gpuva >+ * @va: the &drm_gpuva to link >+ * >+ * This adds the given &va to the GPU VA list of the &drm_gem_object it is >+ * associated with. >+ * >+ * The function assumes the caller already holds the &drm_gem_object's >+ * GPU VA list mutex. >+ */ >+void >+drm_gpuva_link_locked(struct drm_gpuva *va) >+{ >+ lockdep_assert_held(&va->gem.obj->gpuva.mutex); >+ list_add_tail(&va->head, &va->gem.obj->gpuva.list); >+} >+EXPORT_SYMBOL(drm_gpuva_link_locked); >+ >+/** >+ * drm_gpuva_link_unlocked - unlink a &drm_gpuva >+ * @va: the &drm_gpuva to unlink >+ * >+ * This adds the given &va to the GPU VA list of the &drm_gem_object it is >+ * associated with. >+ * >+ * The function assumes the caller does not hold the &drm_gem_object's >+ * GPU VA list mutex. >+ */ >+void >+drm_gpuva_link_unlocked(struct drm_gpuva *va) >+{ >+ drm_gem_gpuva_lock(va->gem.obj); >+ drm_gpuva_link_locked(va); >+ drm_gem_gpuva_unlock(va->gem.obj); >+} >+EXPORT_SYMBOL(drm_gpuva_link_unlocked); >+ >+/** >+ * drm_gpuva_unlink_locked - unlink a &drm_gpuva >+ * @va: the &drm_gpuva to unlink >+ * >+ * This removes the given &va from the GPU VA list of the &drm_gem_object it is >+ * associated with. >+ * >+ * The function assumes the caller already holds the &drm_gem_object's >+ * GPU VA list mutex. >+ */ >+void >+drm_gpuva_unlink_locked(struct drm_gpuva *va) >+{ >+ lockdep_assert_held(&va->gem.obj->gpuva.mutex); >+ list_del_init(&va->head); >+} >+EXPORT_SYMBOL(drm_gpuva_unlink_locked); >+ >+/** >+ * drm_gpuva_unlink_unlocked - unlink a &drm_gpuva >+ * @va: the &drm_gpuva to unlink >+ * >+ * This removes the given &va from the GPU VA list of the &drm_gem_object it is >+ * associated with. >+ * >+ * The function assumes the caller does not hold the &drm_gem_object's >+ * GPU VA list mutex. >+ */ >+void >+drm_gpuva_unlink_unlocked(struct drm_gpuva *va) >+{ >+ drm_gem_gpuva_lock(va->gem.obj); >+ drm_gpuva_unlink_locked(va); >+ drm_gem_gpuva_unlock(va->gem.obj); >+} >+EXPORT_SYMBOL(drm_gpuva_unlink_unlocked); >+ >+/** >+ * drm_gpuva_destroy_locked - destroy a &drm_gpuva >+ * @va: the &drm_gpuva to destroy >+ * >+ * This removes the given &va from GPU VA list of the &drm_gem_object it is >+ * associated with and removes it from the underlaying range allocator. >+ * >+ * The function assumes the caller already holds the &drm_gem_object's >+ * GPU VA list mutex. >+ */ >+void >+drm_gpuva_destroy_locked(struct drm_gpuva *va) >+{ >+ lockdep_assert_held(&va->gem.obj->gpuva.mutex); >+ >+ list_del(&va->head); >+ drm_mm_remove_node(&va->node); >+} >+EXPORT_SYMBOL(drm_gpuva_destroy_locked); >+ >+/** >+ * drm_gpuva_destroy_unlocked - destroy a &drm_gpuva >+ * @va: the &drm_gpuva to destroy >+ * >+ * This removes the given &va from GPU VA list of the &drm_gem_object it is >+ * associated with and removes it from the underlaying range allocator. >+ * >+ * The function assumes the caller does not hold the &drm_gem_object's >+ * GPU VA list mutex. >+ */ >+void >+drm_gpuva_destroy_unlocked(struct drm_gpuva *va) >+{ >+ drm_gem_gpuva_lock(va->gem.obj); >+ list_del(&va->head); >+ drm_gem_gpuva_unlock(va->gem.obj); >+ >+ drm_mm_remove_node(&va->node); >+} >+EXPORT_SYMBOL(drm_gpuva_destroy_unlocked); >+ >+/** >+ * drm_gpuva_find - find a &drm_gpuva >+ * @mgr: the &drm_gpuva_manager to search in >+ * @addr: the &drm_gpuvas address >+ * @range: the &drm_gpuvas range >+ * >+ * Returns: the &drm_gpuva at a given &addr and with a given &range >+ */ >+struct drm_gpuva * >+drm_gpuva_find(struct drm_gpuva_manager *mgr, >+ u64 addr, u64 range) >+{ >+ struct drm_gpuva *va; >+ >+ drm_gpuva_for_each_va_in_range(va, mgr, addr, range) { >+ if (va->node.start == addr && >+ va->node.size == range) >+ return va; >+ } >+ >+ return NULL; >+} >+EXPORT_SYMBOL(drm_gpuva_find); >+ >+/** >+ * drm_gpuva_find_prev - find the &drm_gpuva before the given address >+ * @mgr: the &drm_gpuva_manager to search in >+ * @start: the given GPU VA's start address >+ * >+ * Find the adjacent &drm_gpuva before the GPU VA with given &start address. >+ * >+ * Note that if there is any free space between the GPU VA mappings no mapping >+ * is returned. >+ * >+ * Returns: a pointer to the found &drm_gpuva or NULL if none was found >+ */ >+struct drm_gpuva * >+drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start) >+{ >+ struct drm_mm_node *node; >+ >+ if (start <= mgr->mm_start || >+ start > (mgr->mm_start + mgr->mm_range)) >+ return NULL; >+ >+ node = __drm_mm_interval_first(&mgr->va_mm, start - 1, start); >+ if (node == &mgr->va_mm.head_node) >+ return NULL; >+ >+ return (struct drm_gpuva *)node; >+} >+EXPORT_SYMBOL(drm_gpuva_find_prev); >+ >+/** >+ * drm_gpuva_find_next - find the &drm_gpuva after the given address >+ * @mgr: the &drm_gpuva_manager to search in >+ * @end: the given GPU VA's end address >+ * >+ * Find the adjacent &drm_gpuva after the GPU VA with given &end address. >+ * >+ * Note that if there is any free space between the GPU VA mappings no mapping >+ * is returned. >+ * >+ * Returns: a pointer to the found &drm_gpuva or NULL if none was found >+ */ >+struct drm_gpuva * >+drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end) >+{ >+ struct drm_mm_node *node; >+ >+ if (end < mgr->mm_start || >+ end >= (mgr->mm_start + mgr->mm_range)) >+ return NULL; >+ >+ node = __drm_mm_interval_first(&mgr->va_mm, end, end + 1); >+ if (node == &mgr->va_mm.head_node) >+ return NULL; >+ >+ return (struct drm_gpuva *)node; >+} >+EXPORT_SYMBOL(drm_gpuva_find_next); >+ >+/** >+ * drm_gpuva_region_insert - insert a &drm_gpuva_region >+ * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in >+ * @reg: the &drm_gpuva_region to insert >+ * @addr: the start address of the GPU VA >+ * @range: the range of the GPU VA >+ * >+ * Insert a &drm_gpuva_region with a given address and range into a >+ * &drm_gpuva_manager. >+ * >+ * Returns: 0 on success, negative error code on failure. >+ */ >+int >+drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, >+ struct drm_gpuva_region *reg, >+ u64 addr, u64 range) >+{ >+ int ret; >+ >+ ret = drm_mm_insert_node_in_range(&mgr->region_mm, ®->node, >+ range, 0, >+ 0, addr, >+ addr + range, >+ DRM_MM_INSERT_LOW| >+ DRM_MM_INSERT_ONCE); >+ if (ret) >+ return ret; >+ >+ reg->mgr = mgr; >+ >+ return 0; >+} >+EXPORT_SYMBOL(drm_gpuva_region_insert); >+ >+/** >+ * drm_gpuva_region_destroy - destroy a &drm_gpuva_region >+ * @mgr: the &drm_gpuva_manager holding the region >+ * @reg: the &drm_gpuva to destroy >+ * >+ * This removes the given ® from the underlaying range allocator. >+ */ >+void >+drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, >+ struct drm_gpuva_region *reg) >+{ >+ struct drm_gpuva *va; >+ >+ drm_gpuva_for_each_va_in_range(va, mgr, >+ reg->node.start, >+ reg->node.size) { >+ WARN(1, "GPU VA region must be empty on destroy.\n"); >+ return; >+ } >+ >+ if (®->node == &mgr->kernel_alloc_node) { >+ WARN(1, "Can't destroy kernel reserved region.\n"); >+ return; >+ } >+ >+ drm_mm_remove_node(®->node); >+} >+EXPORT_SYMBOL(drm_gpuva_region_destroy); >+ >+/** >+ * drm_gpuva_region_find - find a &drm_gpuva_region >+ * @mgr: the &drm_gpuva_manager to search in >+ * @addr: the &drm_gpuva_regions address >+ * @range: the &drm_gpuva_regions range >+ * >+ * Returns: the &drm_gpuva_region at a given &addr and with a given &range >+ */ >+struct drm_gpuva_region * >+drm_gpuva_region_find(struct drm_gpuva_manager *mgr, >+ u64 addr, u64 range) >+{ >+ struct drm_gpuva_region *reg; >+ >+ drm_gpuva_for_each_region_in_range(reg, mgr, addr, addr + range) >+ if (reg->node.start == addr && >+ reg->node.size == range) >+ return reg; >+ >+ return NULL; >+} >+EXPORT_SYMBOL(drm_gpuva_region_find); >+ >+static int >+gpuva_op_map_new(struct drm_gpuva_op **pop, >+ u64 addr, u64 range, >+ struct drm_gem_object *obj, u64 offset) >+{ >+ struct drm_gpuva_op *op; >+ >+ op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >+ if (!op) >+ return -ENOMEM; >+ >+ op->op = DRM_GPUVA_OP_MAP; >+ op->map.va.addr = addr; >+ op->map.va.range = range; >+ op->map.gem.obj = obj; >+ op->map.gem.offset = offset; >+ >+ return 0; >+} >+ >+static int >+gpuva_op_remap_new(struct drm_gpuva_op **pop, >+ struct drm_gpuva_op_map *prev, >+ struct drm_gpuva_op_map *next, >+ struct drm_gpuva_op_unmap *unmap) >+{ >+ struct drm_gpuva_op *op; >+ struct drm_gpuva_op_remap *r; >+ >+ op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >+ if (!op) >+ return -ENOMEM; >+ >+ op->op = DRM_GPUVA_OP_REMAP; >+ r = &op->remap; >+ >+ if (prev) { >+ r->prev = kmemdup(prev, sizeof(*prev), GFP_KERNEL); >+ if (!r->prev) >+ goto err_free_op; >+ } >+ >+ if (next) { >+ r->next = kmemdup(next, sizeof(*next), GFP_KERNEL); >+ if (!r->next) >+ goto err_free_prev; >+ } >+ >+ r->unmap = kmemdup(unmap, sizeof(*unmap), GFP_KERNEL); >+ if (!r->unmap) >+ goto err_free_next; >+ >+ return 0; >+ >+err_free_next: >+ if (next) >+ kfree(r->next); >+err_free_prev: >+ if (prev) >+ kfree(r->prev); >+err_free_op: >+ kfree(op); >+ *pop = NULL; >+ >+ return -ENOMEM; >+} >+ >+static int >+gpuva_op_unmap_new(struct drm_gpuva_op **pop, >+ struct drm_gpuva *va, bool merge) >+{ >+ struct drm_gpuva_op *op; >+ >+ op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >+ if (!op) >+ return -ENOMEM; >+ >+ op->op = DRM_GPUVA_OP_UNMAP; >+ op->unmap.va = va; >+ op->unmap.keep = merge; >+ >+ return 0; >+} >+ >+#define op_map_new_to_list(_ops, _addr, _range, \ >+ _obj, _offset) \ >+do { \ >+ struct drm_gpuva_op *op; \ >+ \ >+ ret = gpuva_op_map_new(&op, _addr, _range, \ >+ _obj, _offset); \ >+ if (ret) \ >+ goto err_free_ops; \ >+ \ >+ list_add_tail(&op->entry, _ops); \ >+} while (0) >+ >+#define op_remap_new_to_list(_ops, _prev, _next, \ >+ _unmap) \ >+do { \ >+ struct drm_gpuva_op *op; \ >+ \ >+ ret = gpuva_op_remap_new(&op, _prev, _next, \ >+ _unmap); \ >+ if (ret) \ >+ goto err_free_ops; \ >+ \ >+ list_add_tail(&op->entry, _ops); \ >+} while (0) >+ >+#define op_unmap_new_to_list(_ops, _gpuva, _merge) \ >+do { \ >+ struct drm_gpuva_op *op; \ >+ \ >+ ret = gpuva_op_unmap_new(&op, _gpuva, _merge); \ >+ if (ret) \ >+ goto err_free_ops; \ >+ \ >+ list_add_tail(&op->entry, _ops); \ >+} while (0) >+ >+/** >+ * drm_gpuva_sm_map_ops_create - creates the &drm_gpuva_ops to split and merge >+ * @mgr: the &drm_gpuva_manager representing the GPU VA space >+ * @req_addr: the start address of the new mapping >+ * @req_range: the range of the new mapping >+ * @req_obj: the &drm_gem_object to map >+ * @req_offset: the offset within the &drm_gem_object >+ * >+ * This function creates a list of operations to perform splitting and merging >+ * of existent mapping(s) with the newly requested one. >+ * >+ * The list can be iterated with &drm_gpuva_for_each_op and must be processed >+ * in the given order. It can contain map, unmap and remap operations, but it >+ * also can be empty if no operation is required, e.g. if the requested mapping >+ * already exists is the exact same way. >+ * >+ * There can be an arbitrary amount of unmap operations, a maximum of two remap >+ * operations and a single map operation. The latter one, if existent, >+ * represents the original map operation requested by the caller. Please note >+ * that the map operation might has been modified, e.g. if it was >+ * merged with an existent mapping. >+ * >+ * Note that before calling this function again with another mapping request it >+ * is necessary to update the &drm_gpuva_manager's view of the GPU VA space. >+ * The previously obtained operations must be either processed or abandoned. >+ * To update the &drm_gpuva_manager's view of the GPU VA space >+ * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >+ * drm_gpuva_destroy_unlocked() should be used. >+ * >+ * After the caller finished processing the returned &drm_gpuva_ops, they must >+ * be freed with &drm_gpuva_ops_free. >+ * >+ * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure >+ */ >+struct drm_gpuva_ops * >+drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr, >+ u64 req_addr, u64 req_range, >+ struct drm_gem_object *req_obj, u64 req_offset) >+{ >+ struct drm_gpuva_ops *ops; >+ struct drm_gpuva *va, *prev = NULL; >+ u64 req_end = req_addr + req_range; >+ bool skip_pmerge = false, skip_nmerge = false; >+ int ret; >+ >+ if (!drm_gpuva_in_any_region(mgr, req_addr, req_range)) >+ return ERR_PTR(-EINVAL); >+ >+ ops = kzalloc(sizeof(*ops), GFP_KERNEL); >+ if (!ops) >+ return ERR_PTR(-ENOMEM); >+ >+ INIT_LIST_HEAD(&ops->list); >+ >+ drm_gpuva_for_each_va_in_range(va, mgr, req_addr, req_end) { >+ struct drm_gem_object *obj = va->gem.obj; >+ u64 offset = va->gem.offset; >+ u64 addr = va->node.start; >+ u64 range = va->node.size; >+ u64 end = addr + range; >+ >+ /* Generally, we want to skip merging with potential mappings >+ * left and right of the requested one when we found a >+ * collision, since merging happens in this loop already. >+ * >+ * However, there is one exception when the requested mapping >+ * spans into a free VM area. If this is the case we might >+ * still hit the boundary of another mapping before and/or >+ * after the free VM area. >+ */ >+ skip_pmerge = true; >+ skip_nmerge = true; >+ >+ if (addr == req_addr) { >+ bool merge = obj == req_obj && >+ offset == req_offset; >+ if (end == req_end) { >+ if (merge) >+ goto done; >+ >+ op_unmap_new_to_list(&ops->list, va, false); >+ break; >+ } >+ >+ if (end < req_end) { >+ skip_nmerge = false; >+ op_unmap_new_to_list(&ops->list, va, merge); >+ goto next; >+ } >+ >+ if (end > req_end) { >+ struct drm_gpuva_op_map n = { >+ .va.addr = req_end, >+ .va.range = range - req_range, >+ .gem.obj = obj, >+ .gem.offset = offset + req_range, >+ }; >+ struct drm_gpuva_op_unmap u = { .va = va }; >+ >+ if (merge) >+ goto done; >+ >+ op_remap_new_to_list(&ops->list, NULL, &n, &u); >+ break; >+ } >+ } else if (addr < req_addr) { >+ u64 ls_range = req_addr - addr; >+ struct drm_gpuva_op_map p = { >+ .va.addr = addr, >+ .va.range = ls_range, >+ .gem.obj = obj, >+ .gem.offset = offset, >+ }; >+ struct drm_gpuva_op_unmap u = { .va = va }; >+ bool merge = obj == req_obj && >+ offset + ls_range == req_offset; >+ >+ if (end == req_end) { >+ if (merge) >+ goto done; >+ >+ op_remap_new_to_list(&ops->list, &p, NULL, &u); >+ break; >+ } >+ >+ if (end < req_end) { >+ u64 new_addr = addr; >+ u64 new_range = req_range + ls_range; >+ u64 new_offset = offset; >+ >+ /* We validated that the requested mapping is >+ * within a single VA region already. >+ * Since it overlaps the current mapping (which >+ * can't cross a VA region boundary) we can be >+ * sure that we're still within the boundaries >+ * of the same VA region after merging. >+ */ >+ if (merge) { >+ req_offset = new_offset; >+ req_addr = new_addr; >+ req_range = new_range; >+ op_unmap_new_to_list(&ops->list, va, true); >+ goto next; >+ } >+ >+ op_remap_new_to_list(&ops->list, &p, NULL, &u); >+ goto next; >+ } >+ >+ if (end > req_end) { >+ struct drm_gpuva_op_map n = { >+ .va.addr = req_end, >+ .va.range = end - req_end, >+ .gem.obj = obj, >+ .gem.offset = offset + ls_range + >+ req_range, >+ }; >+ >+ if (merge) >+ goto done; >+ >+ op_remap_new_to_list(&ops->list, &p, &n, &u); >+ break; >+ } >+ } else if (addr > req_addr) { >+ bool merge = obj == req_obj && >+ offset == req_offset + >+ (addr - req_addr); >+ if (!prev) >+ skip_pmerge = false; >+ >+ if (end == req_end) { >+ op_unmap_new_to_list(&ops->list, va, merge); >+ break; >+ } >+ >+ if (end < req_end) { >+ skip_nmerge = false; >+ op_unmap_new_to_list(&ops->list, va, merge); >+ goto next; >+ } >+ >+ if (end > req_end) { >+ struct drm_gpuva_op_map n = { >+ .va.addr = req_end, >+ .va.range = end - req_end, >+ .gem.obj = obj, >+ .gem.offset = offset + req_end - addr, >+ }; >+ struct drm_gpuva_op_unmap u = { .va = va }; >+ u64 new_end = end; >+ u64 new_range = new_end - req_addr; >+ >+ /* We validated that the requested mapping is >+ * within a single VA region already. >+ * Since it overlaps the current mapping (which >+ * can't cross a VA region boundary) we can be >+ * sure that we're still within the boundaries >+ * of the same VA region after merging. >+ */ >+ if (merge) { >+ req_end = new_end; >+ req_range = new_range; >+ op_unmap_new_to_list(&ops->list, va, true); >+ break; >+ } >+ >+ op_remap_new_to_list(&ops->list, NULL, &n, &u); >+ break; >+ } >+ } >+next: >+ prev = va; >+ } >+ >+ va = skip_pmerge ? NULL : drm_gpuva_find_prev(mgr, req_addr); >+ if (va) { >+ struct drm_gem_object *obj = va->gem.obj; >+ u64 offset = va->gem.offset; >+ u64 addr = va->node.start; >+ u64 range = va->node.size; >+ u64 new_offset = offset; >+ u64 new_addr = addr; >+ u64 new_range = req_range + range; >+ bool merge = obj == req_obj && >+ offset + range == req_offset; >+ >+ /* Don't merge over VA region boundaries. */ >+ merge &= drm_gpuva_in_any_region(mgr, new_addr, new_range); >+ if (merge) { >+ op_unmap_new_to_list(&ops->list, va, true); >+ >+ req_offset = new_offset; >+ req_addr = new_addr; >+ req_range = new_range; >+ } >+ } >+ >+ va = skip_nmerge ? NULL : drm_gpuva_find_next(mgr, req_end); >+ if (va) { >+ struct drm_gem_object *obj = va->gem.obj; >+ u64 offset = va->gem.offset; >+ u64 addr = va->node.start; >+ u64 range = va->node.size; >+ u64 end = addr + range; >+ u64 new_range = req_range + range; >+ u64 new_end = end; >+ bool merge = obj == req_obj && >+ offset == req_offset + req_range; >+ >+ /* Don't merge over VA region boundaries. */ >+ merge &= drm_gpuva_in_any_region(mgr, req_addr, new_range); >+ if (merge) { >+ op_unmap_new_to_list(&ops->list, va, true); >+ >+ req_range = new_range; >+ req_end = new_end; >+ } >+ } >+ >+ op_map_new_to_list(&ops->list, >+ req_addr, req_range, >+ req_obj, req_offset); >+ >+done: >+ return ops; >+ >+err_free_ops: >+ drm_gpuva_ops_free(ops); >+ return ERR_PTR(ret); >+} >+EXPORT_SYMBOL(drm_gpuva_sm_map_ops_create); >+ >+#undef op_map_new_to_list >+#undef op_remap_new_to_list >+#undef op_unmap_new_to_list >+ >+/** >+ * drm_gpuva_sm_unmap_ops_create - creates the &drm_gpuva_ops to split on unmap >+ * @mgr: the &drm_gpuva_manager representing the GPU VA space >+ * @req_addr: the start address of the range to unmap >+ * @req_range: the range of the mappings to unmap >+ * >+ * This function creates a list of operations to perform unmapping and, if >+ * required, splitting of the mappings overlapping the unmap range. >+ * >+ * The list can be iterated with &drm_gpuva_for_each_op and must be processed >+ * in the given order. It can contain unmap and remap operations, depending on >+ * whether there are actual overlapping mappings to split. >+ * >+ * There can be an arbitrary amount of unmap operations and a maximum of two >+ * remap operations. >+ * >+ * Note that before calling this function again with another range to unmap it >+ * is necessary to update the &drm_gpuva_manager's view of the GPU VA space. >+ * The previously obtained operations must be processed or abandoned. >+ * To update the &drm_gpuva_manager's view of the GPU VA space >+ * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >+ * drm_gpuva_destroy_unlocked() should be used. >+ * >+ * After the caller finished processing the returned &drm_gpuva_ops, they must >+ * be freed with &drm_gpuva_ops_free. >+ * >+ * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure >+ */ >+struct drm_gpuva_ops * >+drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr, >+ u64 req_addr, u64 req_range) >+{ >+ struct drm_gpuva_ops *ops; >+ struct drm_gpuva_op *op; >+ struct drm_gpuva_op_remap *r; >+ struct drm_gpuva *va; >+ u64 req_end = req_addr + req_range; >+ int ret; >+ >+ ops = kzalloc(sizeof(*ops), GFP_KERNEL); >+ if (!ops) >+ return ERR_PTR(-ENOMEM); >+ >+ INIT_LIST_HEAD(&ops->list); >+ >+ drm_gpuva_for_each_va_in_range(va, mgr, req_addr, req_end) { >+ struct drm_gem_object *obj = va->gem.obj; >+ u64 offset = va->gem.offset; >+ u64 addr = va->node.start; >+ u64 range = va->node.size; >+ u64 end = addr + range; >+ >+ op = kzalloc(sizeof(*op), GFP_KERNEL); >+ if (!op) { >+ ret = -ENOMEM; >+ goto err_free_ops; >+ } >+ >+ r = &op->remap; >+ >+ if (addr < req_addr) { >+ r->prev = kzalloc(sizeof(*r->prev), GFP_KERNEL); >+ if (!r->prev) { >+ ret = -ENOMEM; >+ goto err_free_op; >+ } >+ >+ r->prev->va.addr = addr; >+ r->prev->va.range = req_addr - addr; >+ r->prev->gem.obj = obj; >+ r->prev->gem.offset = offset; >+ } >+ >+ if (end > req_end) { >+ r->next = kzalloc(sizeof(*r->next), GFP_KERNEL); >+ if (!r->next) { >+ ret = -ENOMEM; >+ goto err_free_prev; >+ } >+ >+ r->next->va.addr = req_end; >+ r->next->va.range = end - req_end; >+ r->next->gem.obj = obj; >+ r->next->gem.offset = offset + (req_end - addr); >+ } >+ >+ if (op->remap.prev || op->remap.next) { >+ op->op = DRM_GPUVA_OP_REMAP; >+ r->unmap = kzalloc(sizeof(*r->unmap), GFP_KERNEL); >+ if (!r->unmap) { >+ ret = -ENOMEM; >+ goto err_free_next; >+ } >+ >+ r->unmap->va = va; >+ } else { >+ op->op = DRM_GPUVA_OP_UNMAP; >+ op->unmap.va = va; >+ } >+ >+ list_add_tail(&op->entry, &ops->list); >+ } >+ >+ return ops; >+ >+err_free_next: >+ if (r->next) >+ kfree(r->next); >+err_free_prev: >+ if (r->prev) >+ kfree(r->prev); >+err_free_op: >+ kfree(op); >+err_free_ops: >+ drm_gpuva_ops_free(ops); >+ return ERR_PTR(ret); >+} >+EXPORT_SYMBOL(drm_gpuva_sm_unmap_ops_create); >+ >+/** >+ * drm_gpuva_ops_free - free the given &drm_gpuva_ops >+ * @ops: the &drm_gpuva_ops to free >+ * >+ * Frees the given &drm_gpuva_ops structure including all the ops associated >+ * with it. >+ */ >+void >+drm_gpuva_ops_free(struct drm_gpuva_ops *ops) >+{ >+ struct drm_gpuva_op *op, *next; >+ >+ drm_gpuva_for_each_op_safe(op, next, ops) { >+ list_del(&op->entry); >+ if (op->op == DRM_GPUVA_OP_REMAP) { >+ if (op->remap.prev) >+ kfree(op->remap.prev); >+ >+ if (op->remap.next) >+ kfree(op->remap.next); >+ >+ kfree(op->remap.unmap); >+ } >+ kfree(op); >+ } >+ >+ kfree(ops); >+} >+EXPORT_SYMBOL(drm_gpuva_ops_free); >diff --git a/include/drm/drm_drv.h b/include/drm/drm_drv.h >index d7c521e8860f..6feacd93aca6 100644 >--- a/include/drm/drm_drv.h >+++ b/include/drm/drm_drv.h >@@ -104,6 +104,12 @@ enum drm_driver_feature { > * acceleration should be handled by two drivers that are connected using auxiliary bus. > */ > DRIVER_COMPUTE_ACCEL = BIT(7), >+ /** >+ * @DRIVER_GEM_GPUVA: >+ * >+ * Driver supports user defined GPU VA bindings for GEM objects. >+ */ >+ DRIVER_GEM_GPUVA = BIT(8), > > /* IMPORTANT: Below are all the legacy flags, add new ones above. */ > >diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h >index 772a4adf5287..4a3679034966 100644 >--- a/include/drm/drm_gem.h >+++ b/include/drm/drm_gem.h >@@ -36,6 +36,8 @@ > > #include <linux/kref.h> > #include <linux/dma-resv.h> >+#include <linux/list.h> >+#include <linux/mutex.h> > > #include <drm/drm_vma_manager.h> > >@@ -337,6 +339,17 @@ struct drm_gem_object { > */ > struct dma_resv _resv; > >+ /** >+ * @gpuva: >+ * >+ * Provides the list and list mutex of GPU VAs attached to this >+ * GEM object. >+ */ >+ struct { >+ struct list_head list; >+ struct mutex mutex; >+ } gpuva; >+ > /** > * @funcs: > * >@@ -479,4 +492,66 @@ void drm_gem_lru_move_tail(struct drm_gem_lru *lru, struct drm_gem_object *obj); > unsigned long drm_gem_lru_scan(struct drm_gem_lru *lru, unsigned nr_to_scan, > bool (*shrink)(struct drm_gem_object *obj)); > >+/** >+ * drm_gem_gpuva_init - initialize the gpuva list of a GEM object >+ * @obj: the &drm_gem_object >+ * >+ * This initializes the &drm_gem_object's &drm_gpuva list and the mutex >+ * protecting it. >+ * >+ * Calling this function is only necessary for drivers intending to support the >+ * &drm_driver_feature DRIVER_GEM_GPUVA. >+ */ >+static inline void drm_gem_gpuva_init(struct drm_gem_object *obj) >+{ >+ INIT_LIST_HEAD(&obj->gpuva.list); >+ mutex_init(&obj->gpuva.mutex); >+} >+ >+/** >+ * drm_gem_gpuva_lock - lock the GEM's gpuva list mutex >+ * @obj: the &drm_gem_object >+ * >+ * This unlocks the mutex protecting the &drm_gem_object's &drm_gpuva list. >+ */ >+static inline void drm_gem_gpuva_lock(struct drm_gem_object *obj) >+{ >+ mutex_lock(&obj->gpuva.mutex); >+} >+ >+/** >+ * drm_gem_gpuva_unlock - unlock the GEM's gpuva list mutex >+ * @obj: the &drm_gem_object >+ * >+ * This unlocks the mutex protecting the &drm_gem_object's &drm_gpuva list. >+ */ >+static inline void drm_gem_gpuva_unlock(struct drm_gem_object *obj) >+{ >+ mutex_unlock(&obj->gpuva.mutex); >+} >+ >+/** >+ * drm_gem_for_each_gpuva - iternator to walk over a list of gpuvas >+ * @entry: &drm_gpuva structure to assign to in each iteration step >+ * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with >+ * >+ * This iterator walks over all &drm_gpuva structures associated with the >+ * &drm_gpuva_manager. >+ */ >+#define drm_gem_for_each_gpuva(entry, obj) \ >+ list_for_each_entry(entry, &obj->gpuva.list, head) >+ >+/** >+ * drm_gem_for_each_gpuva_safe - iternator to safely walk over a list of gpuvas >+ * @entry: &drm_gpuva structure to assign to in each iteration step >+ * @next: &next &drm_gpuva to store the next step >+ * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with >+ * >+ * This iterator walks over all &drm_gpuva structures associated with the >+ * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence >+ * it is save against removal of elements. >+ */ >+#define drm_gem_for_each_gpuva_safe(entry, next, obj) \ >+ list_for_each_entry_safe(entry, next, &obj->gpuva.list, head) >+ > #endif /* __DRM_GEM_H__ */ >diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h >new file mode 100644 >index 000000000000..adeb0c916e91 >--- /dev/null >+++ b/include/drm/drm_gpuva_mgr.h >@@ -0,0 +1,527 @@ >+// SPDX-License-Identifier: GPL-2.0 >+ >+#ifndef __DRM_GPUVA_MGR_H__ >+#define __DRM_GPUVA_MGR_H__ >+ >+/* >+ * Copyright (c) 2022 Red Hat. >+ * >+ * Permission is hereby granted, free of charge, to any person obtaining a >+ * copy of this software and associated documentation files (the "Software"), >+ * to deal in the Software without restriction, including without limitation >+ * the rights to use, copy, modify, merge, publish, distribute, sublicense, >+ * and/or sell copies of the Software, and to permit persons to whom the >+ * Software is furnished to do so, subject to the following conditions: >+ * >+ * The above copyright notice and this permission notice shall be included in >+ * all copies or substantial portions of the Software. >+ * >+ * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR >+ * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, >+ * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL >+ * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, DAMAGES OR >+ * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, >+ * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR >+ * OTHER DEALINGS IN THE SOFTWARE. >+ */ >+ >+#include <drm/drm_mm.h> >+#include <linux/mm.h> >+#include <linux/rbtree.h> >+#include <linux/spinlock.h> >+#include <linux/types.h> >+ >+struct drm_gpuva_region; >+struct drm_gpuva; >+struct drm_gpuva_ops; >+ >+/** >+ * struct drm_gpuva_manager - DRM GPU VA Manager >+ * >+ * The DRM GPU VA Manager keeps track of a GPU's virtual address space by using >+ * the &drm_mm range allocator. Typically, this structure is embedded in bigger >+ * driver structures. >+ * >+ * Drivers can pass addresses and ranges in an arbitrary unit, e.g. bytes or >+ * pages. >+ * >+ * There should be one manager instance per GPU virtual address space. >+ */ >+struct drm_gpuva_manager { >+ /** >+ * @name: the name of the DRM GPU VA space >+ */ >+ const char *name; >+ >+ /** >+ * @mm_start: start of the VA space >+ */ >+ u64 mm_start; >+ >+ /** >+ * @mm_range: length of the VA space >+ */ >+ u64 mm_range; >+ >+ /** >+ * @region_mm: the &drm_mm range allocator to track GPU VA regions >+ */ >+ struct drm_mm region_mm; >+ >+ /** >+ * @va_mm: the &drm_mm range allocator to track GPU VA mappings >+ */ >+ struct drm_mm va_mm; >+ >+ /** >+ * @kernel_alloc_node: >+ * >+ * &drm_mm_node representing the address space cutout reserved for >+ * the kernel >+ */ >+ struct drm_mm_node kernel_alloc_node; >+}; >+ >+void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr, >+ const char *name, >+ u64 start_offset, u64 range, >+ u64 reserve_offset, u64 reserve_range); >+void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr); >+ >+/** >+ * struct drm_gpuva_region - structure to track a portion of GPU VA space >+ * >+ * This structure represents a portion of a GPUs VA space and is associated >+ * with a &drm_gpuva_manager. Internally it is based on a &drm_mm_node. >+ * >+ * GPU VA mappings, represented by &drm_gpuva objects, are restricted to be >+ * placed within a &drm_gpuva_region. >+ */ >+struct drm_gpuva_region { >+ /** >+ * @node: the &drm_mm_node to track the GPU VA region >+ */ >+ struct drm_mm_node node; >+ >+ /** >+ * @mgr: the &drm_gpuva_manager this object is associated with >+ */ >+ struct drm_gpuva_manager *mgr; >+ >+ /** >+ * @sparse: indicates whether this region is sparse >+ */ >+ bool sparse; >+}; >+ >+struct drm_gpuva_region * >+drm_gpuva_region_find(struct drm_gpuva_manager *mgr, >+ u64 addr, u64 range); >+int drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, >+ struct drm_gpuva_region *reg, >+ u64 addr, u64 range); >+void drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, >+ struct drm_gpuva_region *reg); >+ >+int drm_gpuva_insert(struct drm_gpuva_manager *mgr, >+ struct drm_gpuva *va, >+ u64 addr, u64 range); >+/** >+ * drm_gpuva_for_each_region_in_range - iternator to walk over a range of nodes >+ * @node__: &drm_gpuva_region structure to assign to in each iteration step >+ * @gpuva__: &drm_gpuva_manager structure to walk >+ * @start__: starting offset, the first node will overlap this >+ * @end__: ending offset, the last node will start before this (but may overlap) >+ * >+ * This iterator walks over all nodes in the range allocator that lie >+ * between @start and @end. It is implemented similarly to list_for_each(), >+ * but is using &drm_mm's internal interval tree to accelerate the search for >+ * the starting node, and hence isn't safe against removal of elements. It >+ * assumes that @end is within (or is the upper limit of) the &drm_gpuva_manager. >+ * If [@start, @end] are beyond the range of the &drm_gpuva_manager, the >+ * iterator may walk over the special _unallocated_ &drm_mm.head_node of the >+ * backing &drm_mm, and may even continue indefinitely. >+ */ >+#define drm_gpuva_for_each_region_in_range(node__, gpuva__, start__, end__) \ >+ for (node__ = (struct drm_gpuva_region *)__drm_mm_interval_first(&(gpuva__)->region_mm, \ >+ (start__), (end__)-1); \ >+ node__->node.start < (end__); \ >+ node__ = (struct drm_gpuva_region *)list_next_entry(&node__->node, node_list)) >+ >+/** >+ * drm_gpuva_for_each_region - iternator to walk over a range of nodes >+ * @entry: &drm_gpuva_region structure to assign to in each iteration step >+ * @gpuva: &drm_gpuva_manager structure to walk >+ * >+ * This iterator walks over all &drm_gpuva_region structures associated with the >+ * &drm_gpuva_manager. >+ */ >+#define drm_gpuva_for_each_region(entry, gpuva) \ >+ list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->region_mm), node.node_list) >+ >+/** >+ * drm_gpuva_for_each_region_safe - iternator to safely walk over a range of >+ * nodes >+ * @entry: &drm_gpuva_region structure to assign to in each iteration step >+ * @next: &next &drm_gpuva_region to store the next step >+ * @gpuva: &drm_gpuva_manager structure to walk >+ * >+ * This iterator walks over all &drm_gpuva_region structures associated with the >+ * &drm_gpuva_manager. It is implemented with list_for_each_safe(), so save >+ * against removal of elements. >+ */ >+#define drm_gpuva_for_each_region_safe(entry, next, gpuva) \ >+ list_for_each_entry_safe(entry, next, drm_mm_nodes(&(gpuva)->region_mm), node.node_list) >+ >+ >+/** >+ * enum drm_gpuva_flags - flags for struct drm_gpuva >+ */ >+enum drm_gpuva_flags { >+ /** >+ * @DRM_GPUVA_SWAPPED: flag indicating that the &drm_gpuva is swapped >+ */ >+ DRM_GPUVA_SWAPPED = (1 << 0), >+}; >+ >+/** >+ * struct drm_gpuva - structure to track a GPU VA mapping >+ * >+ * This structure represents a GPU VA mapping and is associated with a >+ * &drm_gpuva_manager. Internally it is based on a &drm_mm_node. >+ * >+ * Typically, this structure is embedded in bigger driver structures. >+ */ >+struct drm_gpuva { >+ /** >+ * @node: the &drm_mm_node to track the GPU VA mapping >+ */ >+ struct drm_mm_node node; >+ >+ /** >+ * @mgr: the &drm_gpuva_manager this object is associated with >+ */ >+ struct drm_gpuva_manager *mgr; >+ >+ /** >+ * @region: the &drm_gpuva_region the &drm_gpuva is mapped in >+ */ >+ struct drm_gpuva_region *region; >+ >+ /** >+ * @head: the &list_head to attach this object to a &drm_gem_object >+ */ >+ struct list_head head; >+ >+ /** >+ * @flags: the &drm_gpuva_flags for this mapping >+ */ >+ enum drm_gpuva_flags flags; >+ >+ /** >+ * @gem: structure containing the &drm_gem_object and it's offset >+ */ >+ struct { >+ /** >+ * @offset: the offset within the &drm_gem_object >+ */ >+ u64 offset; >+ >+ /** >+ * @obj: the mapped &drm_gem_object >+ */ >+ struct drm_gem_object *obj; >+ } gem; >+}; >+ >+void drm_gpuva_link_locked(struct drm_gpuva *va); >+void drm_gpuva_link_unlocked(struct drm_gpuva *va); >+void drm_gpuva_unlink_locked(struct drm_gpuva *va); >+void drm_gpuva_unlink_unlocked(struct drm_gpuva *va); >+ >+void drm_gpuva_destroy_locked(struct drm_gpuva *va); >+void drm_gpuva_destroy_unlocked(struct drm_gpuva *va); >+ >+struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr, >+ u64 addr, u64 range); >+struct drm_gpuva *drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start); >+struct drm_gpuva *drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end); >+ >+/** >+ * drm_gpuva_swap - sets whether the backing BO of this &drm_gpuva is swapped >+ * @va: the &drm_gpuva to set the swap flag of >+ * @swap: indicates whether the &drm_gpuva is swapped >+ */ >+static inline void drm_gpuva_swap(struct drm_gpuva *va, bool swap) >+{ >+ if (swap) >+ va->flags |= DRM_GPUVA_SWAPPED; >+ else >+ va->flags &= ~DRM_GPUVA_SWAPPED; >+} >+ >+/** >+ * drm_gpuva_swapped - indicates whether the backing BO of this &drm_gpuva >+ * is swapped >+ * @va: the &drm_gpuva to check >+ */ >+static inline bool drm_gpuva_swapped(struct drm_gpuva *va) >+{ >+ return va->flags & DRM_GPUVA_SWAPPED; >+} >+ >+/** >+ * drm_gpuva_for_each_va_in_range - iternator to walk over a range of nodes >+ * @node__: &drm_gpuva structure to assign to in each iteration step >+ * @gpuva__: &drm_gpuva_manager structure to walk >+ * @start__: starting offset, the first node will overlap this >+ * @end__: ending offset, the last node will start before this (but may overlap) >+ * >+ * This iterator walks over all nodes in the range allocator that lie >+ * between @start and @end. It is implemented similarly to list_for_each(), >+ * but is using &drm_mm's internal interval tree to accelerate the search for >+ * the starting node, and hence isn't safe against removal of elements. It >+ * assumes that @end is within (or is the upper limit of) the &drm_gpuva_manager. >+ * If [@start, @end] are beyond the range of the &drm_gpuva_manager, the >+ * iterator may walk over the special _unallocated_ &drm_mm.head_node of the >+ * backing &drm_mm, and may even continue indefinitely. >+ */ >+#define drm_gpuva_for_each_va_in_range(node__, gpuva__, start__, end__) \ >+ for (node__ = (struct drm_gpuva *)__drm_mm_interval_first(&(gpuva__)->va_mm, \ >+ (start__), (end__)-1); \ >+ node__->node.start < (end__); \ >+ node__ = (struct drm_gpuva *)list_next_entry(&node__->node, node_list)) >+ >+/** >+ * drm_gpuva_for_each_va - iternator to walk over a range of nodes >+ * @entry: &drm_gpuva structure to assign to in each iteration step >+ * @gpuva: &drm_gpuva_manager structure to walk >+ * >+ * This iterator walks over all &drm_gpuva structures associated with the >+ * &drm_gpuva_manager. >+ */ >+#define drm_gpuva_for_each_va(entry, gpuva) \ >+ list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->va_mm), node.node_list) >+ >+/** >+ * drm_gpuva_for_each_va_safe - iternator to safely walk over a range of >+ * nodes >+ * @entry: &drm_gpuva structure to assign to in each iteration step >+ * @next: &next &drm_gpuva to store the next step >+ * @gpuva: &drm_gpuva_manager structure to walk >+ * >+ * This iterator walks over all &drm_gpuva structures associated with the >+ * &drm_gpuva_manager. It is implemented with list_for_each_safe(), so save >+ * against removal of elements. >+ */ >+#define drm_gpuva_for_each_va_safe(entry, next, gpuva) \ >+ list_for_each_entry_safe(entry, next, drm_mm_nodes(&(gpuva)->va_mm), node.node_list) >+ >+/** >+ * enum drm_gpuva_op_type - GPU VA operation type >+ * >+ * Operations to alter the GPU VA mappings tracked by the &drm_gpuva_manager >+ * can be map, remap or unmap operations. >+ */ >+enum drm_gpuva_op_type { >+ /** >+ * @DRM_GPUVA_OP_MAP: the map op type >+ */ >+ DRM_GPUVA_OP_MAP, >+ >+ /** >+ * @DRM_GPUVA_OP_REMAP: the remap op type >+ */ >+ DRM_GPUVA_OP_REMAP, >+ >+ /** >+ * @DRM_GPUVA_OP_UNMAP: the unmap op type >+ */ >+ DRM_GPUVA_OP_UNMAP, >+}; >+ >+/** >+ * struct drm_gpuva_op_map - GPU VA map operation >+ * >+ * This structure represents a single map operation generated by the >+ * DRM GPU VA manager. >+ */ >+struct drm_gpuva_op_map { >+ /** >+ * @va: structure containing address and range of a map >+ * operation >+ */ >+ struct { >+ /** >+ * @addr: the base address of the new mapping >+ */ >+ u64 addr; >+ >+ /** >+ * @range: the range of the new mapping >+ */ >+ u64 range; >+ } va; >+ >+ /** >+ * @gem: structure containing the &drm_gem_object and it's offset >+ */ >+ struct { >+ /** >+ * @offset: the offset within the &drm_gem_object >+ */ >+ u64 offset; >+ >+ /** >+ * @obj: the &drm_gem_object to map >+ */ >+ struct drm_gem_object *obj; >+ } gem; >+}; >+ >+/** >+ * struct drm_gpuva_op_unmap - GPU VA unmap operation >+ * >+ * This structure represents a single unmap operation generated by the >+ * DRM GPU VA manager. >+ */ >+struct drm_gpuva_op_unmap { >+ /** >+ * @va: the &drm_gpuva to unmap >+ */ >+ struct drm_gpuva *va; >+ >+ /** >+ * @keep: >+ * >+ * Indicates whether this &drm_gpuva is physically contiguous with the >+ * original mapping request. >+ * >+ * Optionally, if &keep is set, drivers may keep the actual page table >+ * mappings for this &drm_gpuva, adding the missing page table entries >+ * only and update the &drm_gpuva_manager accordingly. >+ */ >+ bool keep; >+}; >+ >+/** >+ * struct drm_gpuva_op_remap - GPU VA remap operation >+ * >+ * This represents a single remap operation generated by the DRM GPU VA manager. >+ * >+ * A remap operation is generated when an existing GPU VA mmapping is split up >+ * by inserting a new GPU VA mapping or by partially unmapping existent >+ * mapping(s), hence it consists of a maximum of two map and one unmap >+ * operation. >+ * >+ * The @unmap operation takes care of removing the original existing mapping. >+ * @prev is used to remap the preceding part, @next the subsequent part. >+ * >+ * If either a new mapping's start address is aligned with the start address >+ * of the old mapping or the new mapping's end address is aligned with the >+ * end address of the old mapping, either @prev or @next is NULL. >+ * >+ * Note, the reason for a dedicated remap operation, rather than arbitrary >+ * unmap and map operations, is to give drivers the chance of extracting driver >+ * specific data for creating the new mappings from the unmap operations's >+ * &drm_gpuva structure which typically is embedded in larger driver specific >+ * structures. >+ */ >+struct drm_gpuva_op_remap { >+ /** >+ * @prev: the preceding part of a split mapping >+ */ >+ struct drm_gpuva_op_map *prev; >+ >+ /** >+ * @next: the subsequent part of a split mapping >+ */ >+ struct drm_gpuva_op_map *next; >+ >+ /** >+ * @unmap: the unmap operation for the original existing mapping >+ */ >+ struct drm_gpuva_op_unmap *unmap; >+}; >+ >+/** >+ * struct drm_gpuva_op - GPU VA operation >+ * >+ * This structure represents a single generic operation, which can be either >+ * map, unmap or remap. >+ * >+ * The particular type of the operation is defined by @op. >+ */ >+struct drm_gpuva_op { >+ /** >+ * @entry: >+ * >+ * The &list_head used to distribute instances of this struct within >+ * &drm_gpuva_ops. >+ */ >+ struct list_head entry; >+ >+ /** >+ * @op: the type of the operation >+ */ >+ enum drm_gpuva_op_type op; >+ >+ union { >+ /** >+ * @map: the map operation >+ */ >+ struct drm_gpuva_op_map map; >+ >+ /** >+ * @unmap: the unmap operation >+ */ >+ struct drm_gpuva_op_unmap unmap; >+ >+ /** >+ * @remap: the remap operation >+ */ >+ struct drm_gpuva_op_remap remap; >+ }; >+}; >+ >+/** >+ * struct drm_gpuva_ops - wraps a list of &drm_gpuva_op >+ */ >+struct drm_gpuva_ops { >+ /** >+ * @list: the &list_head >+ */ >+ struct list_head list; >+}; >+ >+/** >+ * drm_gpuva_for_each_op - iterator to walk over all ops >+ * @op: &drm_gpuva_op to assign in each iteration step >+ * @ops: &drm_gpuva_ops to walk >+ * >+ * This iterator walks over all ops within a given list of operations. >+ */ >+#define drm_gpuva_for_each_op(op, ops) list_for_each_entry(op, &(ops)->list, entry) >+ >+/** >+ * drm_gpuva_for_each_op_safe - iterator to safely walk over all ops >+ * @op: &drm_gpuva_op to assign in each iteration step >+ * @next: &next &drm_gpuva_op to store the next step >+ * @ops: &drm_gpuva_ops to walk >+ * >+ * This iterator walks over all ops within a given list of operations. It is >+ * implemented with list_for_each_safe(), so save against removal of elements. >+ */ >+#define drm_gpuva_for_each_op_safe(op, next, ops) \ >+ list_for_each_entry_safe(op, next, &(ops)->list, entry) >+ >+struct drm_gpuva_ops * >+drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr, >+ u64 addr, u64 range, >+ struct drm_gem_object *obj, u64 offset); >+struct drm_gpuva_ops * >+drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr, >+ u64 addr, u64 range); >+void drm_gpuva_ops_free(struct drm_gpuva_ops *ops); >+ >+#endif /* __DRM_GPUVA_MGR_H__ */ >-- >2.39.0 >
On 1/24/23 00:23, Niranjana Vishwanathapura wrote: > On Wed, Jan 18, 2023 at 07:12:45AM +0100, Danilo Krummrich wrote: >> This adds the infrastructure for a manager implementation to keep track >> of GPU virtual address (VA) mappings. >> >> New UAPIs, motivated by Vulkan sparse memory bindings graphics drivers >> start implementing, allow userspace applications to request multiple and >> arbitrary GPU VA mappings of buffer objects. The DRM GPU VA manager is >> intended to serve the following purposes in this context. >> >> 1) Provide a dedicated range allocator to track GPU VA allocations and >>  mappings, making use of the drm_mm range allocator. >> >> 2) Generically connect GPU VA mappings to their backing buffers, in >>  particular DRM GEM objects. >> >> 3) Provide a common implementation to perform more complex mapping >>  operations on the GPU VA space. In particular splitting and merging >>  of GPU VA mappings, e.g. for intersecting mapping requests or partial >>  unmap requests. >> >> Idea-suggested-by: Dave Airlie <airlied@redhat.com> >> Signed-off-by: Danilo Krummrich <dakr@redhat.com> >> --- >> Documentation/gpu/drm-mm.rst   |  31 + >> drivers/gpu/drm/Makefile       |   1 + >> drivers/gpu/drm/drm_gem.c      |   3 + >> drivers/gpu/drm/drm_gpuva_mgr.c | 1323 +++++++++++++++++++++++++++++++ >> include/drm/drm_drv.h          |   6 + >> include/drm/drm_gem.h          |  75 ++ >> include/drm/drm_gpuva_mgr.h    | 527 ++++++++++++ >> 7 files changed, 1966 insertions(+) >> create mode 100644 drivers/gpu/drm/drm_gpuva_mgr.c >> create mode 100644 include/drm/drm_gpuva_mgr.h >> >> diff --git a/Documentation/gpu/drm-mm.rst b/Documentation/gpu/drm-mm.rst >> index a52e6f4117d6..c9f120cfe730 100644 >> --- a/Documentation/gpu/drm-mm.rst >> +++ b/Documentation/gpu/drm-mm.rst >> @@ -466,6 +466,37 @@ DRM MM Range Allocator Function References >> .. kernel-doc:: drivers/gpu/drm/drm_mm.c >>   :export: >> >> +DRM GPU VA Manager >> +================== >> + >> +Overview >> +-------- >> + >> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >> +  :doc: Overview >> + >> +Split and Merge >> +--------------- >> + >> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >> +  :doc: Split and Merge >> + >> +Locking >> +------- >> + >> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >> +  :doc: Locking >> + >> + >> +DRM GPU VA Manager Function References >> +-------------------------------------- >> + >> +.. kernel-doc:: include/drm/drm_gpuva_mgr.h >> +  :internal: >> + >> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >> +  :export: >> + >> DRM Buddy Allocator >> =================== >> >> diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile >> index 4fe190aee584..de2ffca3b6e4 100644 >> --- a/drivers/gpu/drm/Makefile >> +++ b/drivers/gpu/drm/Makefile >> @@ -45,6 +45,7 @@ drm-y := \ >>     drm_vblank.o \ >>     drm_vblank_work.o \ >>     drm_vma_manager.o \ >> +   drm_gpuva_mgr.o \ >>     drm_writeback.o >> drm-$(CONFIG_DRM_LEGACY) += \ >>     drm_agpsupport.o \ >> diff --git a/drivers/gpu/drm/drm_gem.c b/drivers/gpu/drm/drm_gem.c >> index 59a0bb5ebd85..65115fe88627 100644 >> --- a/drivers/gpu/drm/drm_gem.c >> +++ b/drivers/gpu/drm/drm_gem.c >> @@ -164,6 +164,9 @@ void drm_gem_private_object_init(struct drm_device >> *dev, >>     if (!obj->resv) >>        obj->resv = &obj->_resv; >> >> +   if (drm_core_check_feature(dev, DRIVER_GEM_GPUVA)) >> +       drm_gem_gpuva_init(obj); >> + >>     drm_vma_node_reset(&obj->vma_node); >>     INIT_LIST_HEAD(&obj->lru_node); >> } >> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c >> b/drivers/gpu/drm/drm_gpuva_mgr.c >> new file mode 100644 >> index 000000000000..e665f642689d >> --- /dev/null >> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c >> @@ -0,0 +1,1323 @@ >> +// SPDX-License-Identifier: GPL-2.0 >> +/* >> + * Copyright (c) 2022 Red Hat. >> + * >> + * Permission is hereby granted, free of charge, to any person >> obtaining a >> + * copy of this software and associated documentation files (the >> "Software"), >> + * to deal in the Software without restriction, including without >> limitation >> + * the rights to use, copy, modify, merge, publish, distribute, >> sublicense, >> + * and/or sell copies of the Software, and to permit persons to whom the >> + * Software is furnished to do so, subject to the following conditions: >> + * >> + * The above copyright notice and this permission notice shall be >> included in >> + * all copies or substantial portions of the Software. >> + * >> + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, >> EXPRESS OR >> + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF >> MERCHANTABILITY, >> + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT >> SHALL >> + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, >> DAMAGES OR >> + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, >> + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR >> + * OTHER DEALINGS IN THE SOFTWARE. >> + * >> + * Authors: >> + *    Danilo Krummrich <dakr@redhat.com> >> + * >> + */ >> + >> +#include <drm/drm_gem.h> >> +#include <drm/drm_gpuva_mgr.h> >> + >> +/** >> + * DOC: Overview >> + * >> + * The DRM GPU VA Manager, represented by struct drm_gpuva_manager >> keeps track >> + * of a GPU's virtual address (VA) space and manages the >> corresponding virtual >> + * mappings represented by &drm_gpuva objects. It also keeps track of >> the >> + * mapping's backing &drm_gem_object buffers. >> + * >> + * &drm_gem_object buffers maintain a list (and a corresponding list >> lock) of >> + * &drm_gpuva objects representing all existent GPU VA mappings using >> this >> + * &drm_gem_object as backing buffer. >> + * >> + * A GPU VA mapping can only be created within a previously allocated >> + * &drm_gpuva_region, which represents a reserved portion of the GPU >> VA space. >> + * GPU VA mappings are not allowed to span over a &drm_gpuva_region's >> boundary. >> + * >> + * GPU VA regions can also be flagged as sparse, which allows drivers >> to create >> + * sparse mappings for a whole GPU VA region in order to support Vulkan >> + * 'Sparse Resources'. >> + * > > So, is the sparse resources the only usecase for the VA region > abstraction in uapi? > Or are there any other potential usecases (other than kernel reserved > space)? > It's mainly for sparse resources, though it also enables the kernel to know the bounds within it is allowed to merge mappings. >> + * The GPU VA manager internally uses the &drm_mm range allocator to >> manage the >> + * &drm_gpuva mappings and the &drm_gpuva_regions within a GPU's >> virtual address >> + * space. >> + * >> + * Besides the GPU VA space regions (&drm_gpuva_region) allocated by >> a driver >> + * the &drm_gpuva_manager contains a special region representing the >> portion of >> + * VA space reserved by the kernel. This node is initialized together >> with the >> + * GPU VA manager instance and removed when the GPU VA manager is >> destroyed. >> + * >> + * In a typical application drivers would embed struct >> drm_gpuva_manager, >> + * struct drm_gpuva_region and struct drm_gpuva within their own driver >> + * specific structures, there won't be any memory allocations of it's >> own nor >> + * memory allocations of &drm_gpuva or &drm_gpuva_region entries. >> + */ >> + >> +/** >> + * DOC: Split and Merge >> + * >> + * The DRM GPU VA manager also provides an algorithm implementing >> splitting and >> + * merging of existent GPU VA mappings with the ones that are >> requested to be >> + * mapped or unmapped. This feature is required by the Vulkan API to >> implement >> + * Vulkan 'Sparse Memory Bindings' - drivers UAPIs often refer to >> this as >> + * VM BIND. > > Looks like the split and merge is only based on address continuity > (virtual and > physical). > How about potential page table attributes specified during VM_BIND for a > mapping > such as readonly, caching attributes, atomicity etc? > In this (first) patch version the GPU VA manager generates a list of (sub-)operations (or split/merge steps) for each map / unmap request, which I will replace with callbacks for each split/merge step in v2. For generating those steps the GPU VA manager indeed does take in consideration address continuity (virtual and physical) only. However, driver specific attributes can be taken in consideration by drivers for each of those split/merge steps. E.g. for a map request the GPU VA manager generates an arbitrary amount of unmap ops, a maximum of two remap ops (for splitting) and exactly one map operation. Unmap operations carry their corresponding struct drm_gpuva being embedded in it's driver specific structure. They're generated for all mappings residing (and are enclosed by) the range of the new mapping to map and for mappings that are direct and physically contiguous neighbors and are in the same region. Unmap operations also carry a flag to indicate drivers whether the mapping to unmap is physically contiguous, such that drivers could only do a partial page table update in case this is feasible . Remap operations consist out of one unmap operation and at least one, but a maximum of two map operations, such that drivers can apply the original driver specific attributes of the mapping to the new (split up) mapping(s). For the single map operation we know the driver specific attributes as well, since they're just the ones that were passed to the driver through the original VM_BIND ioctl requesting the new mapping in the first place. I hope I got your question right and the answer helps. - Danilo > Thanks, > Niranjana > >> + * >> + * Drivers can call drm_gpuva_sm_map_ops_create() to obtain a list of >> map, unmap >> + * and remap operations for a given newly requested mapping. This list >> + * represents the set of operations to execute in order to integrate >> the new >> + * mapping cleanly into the current state of the GPU VA space. >> + * >> + * Depending on how the new GPU VA mapping intersects with the >> existent mappings >> + * of the GPU VA space the &drm_gpuva_ops contain an arbitrary amount >> of unmap >> + * operations, a maximum of two remap operations and a single map >> operation. >> + * The set of operations can also be empty if no operation is >> required, e.g. if >> + * the requested mapping already exists in the exact same way. >> + * >> + * The single map operation, if existent, represents the original map >> operation >> + * requested by the caller. Please note that this operation might be >> altered >> + * comparing it with the original map operation, e.g. because it was >> merged with >> + * an already existent mapping. Hence, drivers must execute this map >> operation >> + * instead of the original one they passed to >> drm_gpuva_sm_map_ops_create(). >> + * >> + * &drm_gpuva_op_unmap contains a 'keep' field, which indicates >> whether the >> + * &drm_gpuva to unmap is physically contiguous with the original >> mapping >> + * request. Optionally, if 'keep' is set, drivers may keep the actual >> page table >> + * entries for this &drm_gpuva, adding the missing page table entries >> only and >> + * update the &drm_gpuva_manager's view of things accordingly. >> + * >> + * Drivers may do the same optimization, namely delta page table >> updates, also >> + * for remap operations. This is possible since &drm_gpuva_op_remap >> consists of >> + * one unmap operation and one or two map operations, such that >> drivers can >> + * derive the page table update delta accordingly. >> + * >> + * Note that there can't be more than two existent mappings to split >> up, one at >> + * the beginning and one at the end of the new mapping, hence there is a >> + * maximum of two remap operations. >> + * >> + * Generally, the DRM GPU VA manager never merges mappings across the >> + * boundaries of &drm_gpuva_regions. This is the case since merging >> between >> + * GPU VA regions would result into unmap and map operations to be >> issued for >> + * both regions involved although the original mapping request was >> referred to >> + * one specific GPU VA region only. Since the other GPU VA region, >> the one not >> + * explicitly requested to be altered, might be in use by the GPU, we >> are not >> + * allowed to issue any map/unmap operations for this region. >> + * >> + * Note that before calling drm_gpuva_sm_map_ops_create() again with >> another >> + * mapping request it is necessary to update the &drm_gpuva_manager's >> view of >> + * the GPU VA space. The previously obtained operations must be >> either fully >> + * processed or completely abandoned. >> + * >> + * To update the &drm_gpuva_manager's view of the GPU VA space >> + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >> + * drm_gpuva_destroy_unlocked() should be used. >> + * >> + * Analogue to drm_gpuva_sm_map_ops_create() >> drm_gpuva_sm_unmap_ops_create() >> + * provides drivers a the list of operations to be executed in order >> to unmap >> + * a range of GPU VA space. The logic behind this functions is way >> simpler >> + * though: For all existent mappings enclosed by the given range unmap >> + * operations are created. For mappings which are only partically >> located within >> + * the given range, remap operations are created such that those >> mappings are >> + * split up and re-mapped partically. >> + * >> + * The following paragraph depicts the basic constellations of >> existent GPU VA >> + * mappings, a newly requested mapping and the resulting mappings as >> implemented >> + * by drm_gpuva_sm_map_ops_create() - it doesn't cover arbitrary >> combinations >> + * of those constellations. >> + * >> + * :: >> + * >> + *   1) Existent mapping is kept. >> + *   ---------------------------- >> + * >> + *        0    a    1 >> + *   old: |-----------| (bo_offset=n) >> + * >> + *        0    a    1 >> + *   req: |-----------| (bo_offset=n) >> + * >> + *        0    a    1 >> + *   new: |-----------| (bo_offset=n) >> + * >> + * >> + *   2) Existent mapping is replaced. >> + *   -------------------------------- >> + * >> + *        0    a    1 >> + *   old: |-----------| (bo_offset=n) >> + * >> + *        0    a    1 >> + *   req: |-----------| (bo_offset=m) >> + * >> + *        0    a    1 >> + *   new: |-----------| (bo_offset=m) >> + * >> + * >> + *   3) Existent mapping is replaced. >> + *   -------------------------------- >> + * >> + *        0    a    1 >> + *   old: |-----------| (bo_offset=n) >> + * >> + *        0    b    1 >> + *   req: |-----------| (bo_offset=n) >> + * >> + *        0    b    1 >> + *   new: |-----------| (bo_offset=n) >> + * >> + * >> + *   4) Existent mapping is replaced. >> + *   -------------------------------- >> + * >> + *        0 a 1 >> + *   old: |-----|      (bo_offset=n) >> + * >> + *        0    a    2 >> + *   req: |-----------| (bo_offset=n) >> + * >> + *        0    a    2 >> + *   new: |-----------| (bo_offset=n) >> + * >> + *   Note: We expect to see the same result for a request with a >> different bo >> + *         and/or bo_offset. >> + * >> + * >> + *   5) Existent mapping is split. >> + *   ----------------------------- >> + * >> + *        0    a    2 >> + *   old: |-----------| (bo_offset=n) >> + * >> + *        0 b 1 >> + *   req: |-----|      (bo_offset=n) >> + * >> + *        0 b 1 a' 2 >> + *   new: |-----|-----| (b.bo_offset=n, a.bo_offset=n+1) >> + * >> + *   Note: We expect to see the same result for a request with a >> different bo >> + *         and/or non-contiguous bo_offset. >> + * >> + * >> + *   6) Existent mapping is kept. >> + *   ---------------------------- >> + * >> + *        0    a    2 >> + *   old: |-----------| (bo_offset=n) >> + * >> + *        0 a 1 >> + *   req: |-----|      (bo_offset=n) >> + * >> + *        0    a    2 >> + *   new: |-----------| (bo_offset=n) >> + * >> + * >> + *   7) Existent mapping is split. >> + *   ----------------------------- >> + * >> + *        0    a    2 >> + *   old: |-----------| (bo_offset=n) >> + * >> + *              1 b 2 >> + *   req:      |-----| (bo_offset=m) >> + * >> + *        0 a 1 b 2 >> + *   new: |-----|-----| (a.bo_offset=n,b.bo_offset=m) >> + * >> + * >> + *   8) Existent mapping is kept. >> + *   ---------------------------- >> + * >> + *         0    a    2 >> + *   old: |-----------| (bo_offset=n) >> + * >> + *              1 a 2 >> + *   req:      |-----| (bo_offset=n+1) >> + * >> + *        0    a    2 >> + *   new: |-----------| (bo_offset=n) >> + * >> + * >> + *   9) Existent mapping is split. >> + *   ----------------------------- >> + * >> + *        0    a    2 >> + *   old: |-----------|      (bo_offset=n) >> + * >> + *              1    b    3 >> + *   req:      |-----------| (bo_offset=m) >> + * >> + *        0 a 1    b    3 >> + *   new: |-----|-----------| (a.bo_offset=n,b.bo_offset=m) >> + * >> + * >> + *   10) Existent mapping is merged. >> + *   ------------------------------- >> + * >> + *        0    a    2 >> + *   old: |-----------|      (bo_offset=n) >> + * >> + *              1    a    3 >> + *   req:      |-----------| (bo_offset=n+1) >> + * >> + *        0       a       3 >> + *   new: |-----------------| (bo_offset=n) >> + * >> + * >> + *   11) Existent mapping is split. >> + *   ------------------------------ >> + * >> + *        0       a       3 >> + *   old: |-----------------| (bo_offset=n) >> + * >> + *              1 b 2 >> + *   req:      |-----|      (bo_offset=m) >> + * >> + *        0 a 1 b 2 a' 3 >> + *   new: |-----|-----|-----| >> (a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2) >> + * >> + * >> + *   12) Existent mapping is kept. >> + *   ----------------------------- >> + * >> + *        0       a       3 >> + *   old: |-----------------| (bo_offset=n) >> + * >> + *              1 a 2 >> + *   req:      |-----|      (bo_offset=n+1) >> + * >> + *        0       a       3 >> + *   old: |-----------------| (bo_offset=n) >> + * >> + * >> + *   13) Existent mapping is replaced. >> + *   --------------------------------- >> + * >> + *              1 a 2 >> + *   old:      |-----| (bo_offset=n) >> + * >> + *        0    a    2 >> + *   req: |-----------| (bo_offset=n) >> + * >> + *        0    a    2 >> + *   new: |-----------| (bo_offset=n) >> + * >> + *   Note: We expect to see the same result for a request with a >> different bo >> + *         and/or non-contiguous bo_offset. >> + * >> + * >> + *   14) Existent mapping is replaced. >> + *   --------------------------------- >> + * >> + *              1 a 2 >> + *   old:      |-----| (bo_offset=n) >> + * >> + *        0       a      3 >> + *   req: |----------------| (bo_offset=n) >> + * >> + *        0       a      3 >> + *   new: |----------------| (bo_offset=n) >> + * >> + *   Note: We expect to see the same result for a request with a >> different bo >> + *         and/or non-contiguous bo_offset. >> + * >> + * >> + *   15) Existent mapping is split. >> + *   ------------------------------ >> + * >> + *              1    a    3 >> + *   old:      |-----------| (bo_offset=n) >> + * >> + *        0    b    2 >> + *   req: |-----------|      (bo_offset=m) >> + * >> + *        0    b    2 a' 3 >> + *   new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2) >> + * >> + * >> + *   16) Existent mappings are merged. >> + *   --------------------------------- >> + * >> + *        0    a    1 >> + *   old: |-----------|                       (bo_offset=n) >> + * >> + *                               2    a    3 >> + *   old':                      |-----------| (bo_offset=n+2) >> + * >> + *                   1    a    2 >> + *   req:           |-----------|            (bo_offset=n+1) >> + * >> + *                         a >> + *   new: |----------------------------------| (bo_offset=n) >> + */ >> + >> +/** >> + * DOC: Locking >> + * >> + * Generally, the GPU VA manager does not take care of locking >> itself, it is >> + * the drivers responsibility to take care about locking. Drivers >> might want to >> + * protect the following operations: inserting, destroying and iterating >> + * &drm_gpuva and &drm_gpuva_region objects as well as generating >> split and merge >> + * operations. >> + * >> + * The GPU VA manager does take care of the locking of the backing >> + * &drm_gem_object buffers GPU VA lists though, unless the provided >> functions >> + * documentation claims otherwise. >> + */ >> + >> +/** >> + * drm_gpuva_manager_init - initialize a &drm_gpuva_manager >> + * @mgr: pointer to the &drm_gpuva_manager to initialize >> + * @name: the name of the GPU VA space >> + * @start_offset: the start offset of the GPU VA space >> + * @range: the size of the GPU VA space >> + * @reserve_offset: the start of the kernel reserved GPU VA area >> + * @reserve_range: the size of the kernel reserved GPU VA area >> + * >> + * The &drm_gpuva_manager must be initialized with this function >> before use. >> + * >> + * Note that @mgr must be cleared to 0 before calling this function. >> The given >> + * &name is expected to be managed by the surrounding driver structures. >> + */ >> +void >> +drm_gpuva_manager_init(struct drm_gpuva_manager *mgr, >> +              const char *name, >> +              u64 start_offset, u64 range, >> +              u64 reserve_offset, u64 reserve_range) >> +{ >> +   drm_mm_init(&mgr->va_mm, start_offset, range); >> +   drm_mm_init(&mgr->region_mm, start_offset, range); >> + >> +   mgr->mm_start = start_offset; >> +   mgr->mm_range = range; >> + >> +   mgr->name = name ? name : "unknown"; >> + >> +   memset(&mgr->kernel_alloc_node, 0, sizeof(struct drm_mm_node)); >> +   mgr->kernel_alloc_node.start = reserve_offset; >> +   mgr->kernel_alloc_node.size = reserve_range; >> +   drm_mm_reserve_node(&mgr->region_mm, &mgr->kernel_alloc_node); >> +} >> +EXPORT_SYMBOL(drm_gpuva_manager_init); >> + >> +/** >> + * drm_gpuva_manager_destroy - cleanup a &drm_gpuva_manager >> + * @mgr: pointer to the &drm_gpuva_manager to clean up >> + * >> + * Note that it is a bug to call this function on a manager that still >> + * holds GPU VA mappings. >> + */ >> +void >> +drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr) >> +{ >> +   mgr->name = NULL; >> +   drm_mm_remove_node(&mgr->kernel_alloc_node); >> +   drm_mm_takedown(&mgr->va_mm); >> +   drm_mm_takedown(&mgr->region_mm); >> +} >> +EXPORT_SYMBOL(drm_gpuva_manager_destroy); >> + >> +static struct drm_gpuva_region * >> +drm_gpuva_in_region(struct drm_gpuva_manager *mgr, u64 addr, u64 range) >> +{ >> +   struct drm_gpuva_region *reg; >> + >> +   /* Find the VA region the requested range is strictly enclosed >> by. */ >> +   drm_gpuva_for_each_region_in_range(reg, mgr, addr, addr + range) { >> +       if (reg->node.start <= addr && >> +           reg->node.start + reg->node.size >= addr + range && >> +           ®->node != &mgr->kernel_alloc_node) >> +           return reg; >> +   } >> + >> +   return NULL; >> +} >> + >> +static bool >> +drm_gpuva_in_any_region(struct drm_gpuva_manager *mgr, u64 addr, u64 >> range) >> +{ >> +   return !!drm_gpuva_in_region(mgr, addr, range); >> +} >> + >> +/** >> + * drm_gpuva_insert - insert a &drm_gpuva >> + * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in >> + * @va: the &drm_gpuva to insert >> + * @addr: the start address of the GPU VA >> + * @range: the range of the GPU VA >> + * >> + * Insert a &drm_gpuva with a given address and range into a >> + * &drm_gpuva_manager. >> + * >> + * The function assumes the caller does not hold the &drm_gem_object's >> + * GPU VA list mutex. >> + * >> + * Returns: 0 on success, negative error code on failure. >> + */ >> +int >> +drm_gpuva_insert(struct drm_gpuva_manager *mgr, >> +        struct drm_gpuva *va, >> +        u64 addr, u64 range) >> +{ >> +   struct drm_gpuva_region *reg; >> +   int ret; >> + >> +   if (!va->gem.obj) >> +       return -EINVAL; >> + >> +   reg = drm_gpuva_in_region(mgr, addr, range); >> +   if (!reg) >> +       return -EINVAL; >> + >> +   ret = drm_mm_insert_node_in_range(&mgr->va_mm, &va->node, >> +                     range, 0, >> +                     0, addr, >> +                     addr + range, >> +                     DRM_MM_INSERT_LOW|DRM_MM_INSERT_ONCE); >> +   if (ret) >> +       return ret; >> + >> +   va->mgr = mgr; >> +   va->region = reg; >> + >> +   return 0; >> +} >> +EXPORT_SYMBOL(drm_gpuva_insert); >> + >> +/** >> + * drm_gpuva_link_locked - link a &drm_gpuva >> + * @va: the &drm_gpuva to link >> + * >> + * This adds the given &va to the GPU VA list of the &drm_gem_object >> it is >> + * associated with. >> + * >> + * The function assumes the caller already holds the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_link_locked(struct drm_gpuva *va) >> +{ >> +   lockdep_assert_held(&va->gem.obj->gpuva.mutex); >> +   list_add_tail(&va->head, &va->gem.obj->gpuva.list); >> +} >> +EXPORT_SYMBOL(drm_gpuva_link_locked); >> + >> +/** >> + * drm_gpuva_link_unlocked - unlink a &drm_gpuva >> + * @va: the &drm_gpuva to unlink >> + * >> + * This adds the given &va to the GPU VA list of the &drm_gem_object >> it is >> + * associated with. >> + * >> + * The function assumes the caller does not hold the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_link_unlocked(struct drm_gpuva *va) >> +{ >> +   drm_gem_gpuva_lock(va->gem.obj); >> +   drm_gpuva_link_locked(va); >> +   drm_gem_gpuva_unlock(va->gem.obj); >> +} >> +EXPORT_SYMBOL(drm_gpuva_link_unlocked); >> + >> +/** >> + * drm_gpuva_unlink_locked - unlink a &drm_gpuva >> + * @va: the &drm_gpuva to unlink >> + * >> + * This removes the given &va from the GPU VA list of the >> &drm_gem_object it is >> + * associated with. >> + * >> + * The function assumes the caller already holds the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_unlink_locked(struct drm_gpuva *va) >> +{ >> +   lockdep_assert_held(&va->gem.obj->gpuva.mutex); >> +   list_del_init(&va->head); >> +} >> +EXPORT_SYMBOL(drm_gpuva_unlink_locked); >> + >> +/** >> + * drm_gpuva_unlink_unlocked - unlink a &drm_gpuva >> + * @va: the &drm_gpuva to unlink >> + * >> + * This removes the given &va from the GPU VA list of the >> &drm_gem_object it is >> + * associated with. >> + * >> + * The function assumes the caller does not hold the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_unlink_unlocked(struct drm_gpuva *va) >> +{ >> +   drm_gem_gpuva_lock(va->gem.obj); >> +   drm_gpuva_unlink_locked(va); >> +   drm_gem_gpuva_unlock(va->gem.obj); >> +} >> +EXPORT_SYMBOL(drm_gpuva_unlink_unlocked); >> + >> +/** >> + * drm_gpuva_destroy_locked - destroy a &drm_gpuva >> + * @va: the &drm_gpuva to destroy >> + * >> + * This removes the given &va from GPU VA list of the &drm_gem_object >> it is >> + * associated with and removes it from the underlaying range allocator. >> + * >> + * The function assumes the caller already holds the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_destroy_locked(struct drm_gpuva *va) >> +{ >> +   lockdep_assert_held(&va->gem.obj->gpuva.mutex); >> + >> +   list_del(&va->head); >> +   drm_mm_remove_node(&va->node); >> +} >> +EXPORT_SYMBOL(drm_gpuva_destroy_locked); >> + >> +/** >> + * drm_gpuva_destroy_unlocked - destroy a &drm_gpuva >> + * @va: the &drm_gpuva to destroy >> + * >> + * This removes the given &va from GPU VA list of the &drm_gem_object >> it is >> + * associated with and removes it from the underlaying range allocator. >> + * >> + * The function assumes the caller does not hold the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_destroy_unlocked(struct drm_gpuva *va) >> +{ >> +   drm_gem_gpuva_lock(va->gem.obj); >> +   list_del(&va->head); >> +   drm_gem_gpuva_unlock(va->gem.obj); >> + >> +   drm_mm_remove_node(&va->node); >> +} >> +EXPORT_SYMBOL(drm_gpuva_destroy_unlocked); >> + >> +/** >> + * drm_gpuva_find - find a &drm_gpuva >> + * @mgr: the &drm_gpuva_manager to search in >> + * @addr: the &drm_gpuvas address >> + * @range: the &drm_gpuvas range >> + * >> + * Returns: the &drm_gpuva at a given &addr and with a given &range >> + */ >> +struct drm_gpuva * >> +drm_gpuva_find(struct drm_gpuva_manager *mgr, >> +          u64 addr, u64 range) >> +{ >> +   struct drm_gpuva *va; >> + >> +   drm_gpuva_for_each_va_in_range(va, mgr, addr, range) { >> +       if (va->node.start == addr && >> +           va->node.size == range) >> +           return va; >> +   } >> + >> +   return NULL; >> +} >> +EXPORT_SYMBOL(drm_gpuva_find); >> + >> +/** >> + * drm_gpuva_find_prev - find the &drm_gpuva before the given address >> + * @mgr: the &drm_gpuva_manager to search in >> + * @start: the given GPU VA's start address >> + * >> + * Find the adjacent &drm_gpuva before the GPU VA with given &start >> address. >> + * >> + * Note that if there is any free space between the GPU VA mappings >> no mapping >> + * is returned. >> + * >> + * Returns: a pointer to the found &drm_gpuva or NULL if none was found >> + */ >> +struct drm_gpuva * >> +drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start) >> +{ >> +   struct drm_mm_node *node; >> + >> +   if (start <= mgr->mm_start || >> +       start > (mgr->mm_start + mgr->mm_range)) >> +       return NULL; >> + >> +   node = __drm_mm_interval_first(&mgr->va_mm, start - 1, start); >> +   if (node == &mgr->va_mm.head_node) >> +       return NULL; >> + >> +   return (struct drm_gpuva *)node; >> +} >> +EXPORT_SYMBOL(drm_gpuva_find_prev); >> + >> +/** >> + * drm_gpuva_find_next - find the &drm_gpuva after the given address >> + * @mgr: the &drm_gpuva_manager to search in >> + * @end: the given GPU VA's end address >> + * >> + * Find the adjacent &drm_gpuva after the GPU VA with given &end >> address. >> + * >> + * Note that if there is any free space between the GPU VA mappings >> no mapping >> + * is returned. >> + * >> + * Returns: a pointer to the found &drm_gpuva or NULL if none was found >> + */ >> +struct drm_gpuva * >> +drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end) >> +{ >> +   struct drm_mm_node *node; >> + >> +   if (end < mgr->mm_start || >> +       end >= (mgr->mm_start + mgr->mm_range)) >> +       return NULL; >> + >> +   node = __drm_mm_interval_first(&mgr->va_mm, end, end + 1); >> +   if (node == &mgr->va_mm.head_node) >> +       return NULL; >> + >> +   return (struct drm_gpuva *)node; >> +} >> +EXPORT_SYMBOL(drm_gpuva_find_next); >> + >> +/** >> + * drm_gpuva_region_insert - insert a &drm_gpuva_region >> + * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in >> + * @reg: the &drm_gpuva_region to insert >> + * @addr: the start address of the GPU VA >> + * @range: the range of the GPU VA >> + * >> + * Insert a &drm_gpuva_region with a given address and range into a >> + * &drm_gpuva_manager. >> + * >> + * Returns: 0 on success, negative error code on failure. >> + */ >> +int >> +drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, >> +           struct drm_gpuva_region *reg, >> +           u64 addr, u64 range) >> +{ >> +   int ret; >> + >> +   ret = drm_mm_insert_node_in_range(&mgr->region_mm, ®->node, >> +                     range, 0, >> +                     0, addr, >> +                     addr + range, >> +                     DRM_MM_INSERT_LOW| >> +                     DRM_MM_INSERT_ONCE); >> +   if (ret) >> +       return ret; >> + >> +   reg->mgr = mgr; >> + >> +   return 0; >> +} >> +EXPORT_SYMBOL(drm_gpuva_region_insert); >> + >> +/** >> + * drm_gpuva_region_destroy - destroy a &drm_gpuva_region >> + * @mgr: the &drm_gpuva_manager holding the region >> + * @reg: the &drm_gpuva to destroy >> + * >> + * This removes the given ® from the underlaying range allocator. >> + */ >> +void >> +drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, >> +            struct drm_gpuva_region *reg) >> +{ >> +   struct drm_gpuva *va; >> + >> +   drm_gpuva_for_each_va_in_range(va, mgr, >> +                      reg->node.start, >> +                      reg->node.size) { >> +       WARN(1, "GPU VA region must be empty on destroy.\n"); >> +       return; >> +   } >> + >> +   if (®->node == &mgr->kernel_alloc_node) { >> +       WARN(1, "Can't destroy kernel reserved region.\n"); >> +       return; >> +   } >> + >> +   drm_mm_remove_node(®->node); >> +} >> +EXPORT_SYMBOL(drm_gpuva_region_destroy); >> + >> +/** >> + * drm_gpuva_region_find - find a &drm_gpuva_region >> + * @mgr: the &drm_gpuva_manager to search in >> + * @addr: the &drm_gpuva_regions address >> + * @range: the &drm_gpuva_regions range >> + * >> + * Returns: the &drm_gpuva_region at a given &addr and with a given >> &range >> + */ >> +struct drm_gpuva_region * >> +drm_gpuva_region_find(struct drm_gpuva_manager *mgr, >> +             u64 addr, u64 range) >> +{ >> +   struct drm_gpuva_region *reg; >> + >> +   drm_gpuva_for_each_region_in_range(reg, mgr, addr, addr + range) >> +       if (reg->node.start == addr && >> +           reg->node.size == range) >> +           return reg; >> + >> +   return NULL; >> +} >> +EXPORT_SYMBOL(drm_gpuva_region_find); >> + >> +static int >> +gpuva_op_map_new(struct drm_gpuva_op **pop, >> +        u64 addr, u64 range, >> +        struct drm_gem_object *obj, u64 offset) >> +{ >> +   struct drm_gpuva_op *op; >> + >> +   op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >> +   if (!op) >> +       return -ENOMEM; >> + >> +   op->op = DRM_GPUVA_OP_MAP; >> +   op->map.va.addr = addr; >> +   op->map.va.range = range; >> +   op->map.gem.obj = obj; >> +   op->map.gem.offset = offset; >> + >> +   return 0; >> +} >> + >> +static int >> +gpuva_op_remap_new(struct drm_gpuva_op **pop, >> +          struct drm_gpuva_op_map *prev, >> +          struct drm_gpuva_op_map *next, >> +          struct drm_gpuva_op_unmap *unmap) >> +{ >> +   struct drm_gpuva_op *op; >> +   struct drm_gpuva_op_remap *r; >> + >> +   op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >> +   if (!op) >> +       return -ENOMEM; >> + >> +   op->op = DRM_GPUVA_OP_REMAP; >> +   r = &op->remap; >> + >> +   if (prev) { >> +       r->prev = kmemdup(prev, sizeof(*prev), GFP_KERNEL); >> +       if (!r->prev) >> +           goto err_free_op; >> +   } >> + >> +   if (next) { >> +       r->next = kmemdup(next, sizeof(*next), GFP_KERNEL); >> +       if (!r->next) >> +           goto err_free_prev; >> +   } >> + >> +   r->unmap = kmemdup(unmap, sizeof(*unmap), GFP_KERNEL); >> +   if (!r->unmap) >> +       goto err_free_next; >> + >> +   return 0; >> + >> +err_free_next: >> +   if (next) >> +       kfree(r->next); >> +err_free_prev: >> +   if (prev) >> +       kfree(r->prev); >> +err_free_op: >> +   kfree(op); >> +   *pop = NULL; >> + >> +   return -ENOMEM; >> +} >> + >> +static int >> +gpuva_op_unmap_new(struct drm_gpuva_op **pop, >> +          struct drm_gpuva *va, bool merge) >> +{ >> +   struct drm_gpuva_op *op; >> + >> +   op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >> +   if (!op) >> +       return -ENOMEM; >> + >> +   op->op = DRM_GPUVA_OP_UNMAP; >> +   op->unmap.va = va; >> +   op->unmap.keep = merge; >> + >> +   return 0; >> +} >> + >> +#define op_map_new_to_list(_ops, _addr, _range,       \ >> +              _obj, _offset)       \ >> +do {                           \ >> +   struct drm_gpuva_op *op;           \ >> +                           \ >> +   ret = gpuva_op_map_new(&op, _addr, _range,   \ >> +                  _obj, _offset);       \ >> +   if (ret)                   \ >> +       goto err_free_ops;           \ >> +                           \ >> +   list_add_tail(&op->entry, _ops);       \ >> +} while (0) >> + >> +#define op_remap_new_to_list(_ops, _prev, _next,   \ >> +                _unmap)           \ >> +do {                           \ >> +   struct drm_gpuva_op *op;           \ >> +                           \ >> +   ret = gpuva_op_remap_new(&op, _prev, _next,   \ >> +                _unmap);       \ >> +   if (ret)                   \ >> +       goto err_free_ops;           \ >> +                           \ >> +   list_add_tail(&op->entry, _ops);       \ >> +} while (0) >> + >> +#define op_unmap_new_to_list(_ops, _gpuva, _merge)   \ >> +do {                           \ >> +   struct drm_gpuva_op *op;           \ >> +                           \ >> +   ret = gpuva_op_unmap_new(&op, _gpuva, _merge);   \ >> +   if (ret)                   \ >> +       goto err_free_ops;           \ >> +                           \ >> +   list_add_tail(&op->entry, _ops);       \ >> +} while (0) >> + >> +/** >> + * drm_gpuva_sm_map_ops_create - creates the &drm_gpuva_ops to split >> and merge >> + * @mgr: the &drm_gpuva_manager representing the GPU VA space >> + * @req_addr: the start address of the new mapping >> + * @req_range: the range of the new mapping >> + * @req_obj: the &drm_gem_object to map >> + * @req_offset: the offset within the &drm_gem_object >> + * >> + * This function creates a list of operations to perform splitting >> and merging >> + * of existent mapping(s) with the newly requested one. >> + * >> + * The list can be iterated with &drm_gpuva_for_each_op and must be >> processed >> + * in the given order. It can contain map, unmap and remap >> operations, but it >> + * also can be empty if no operation is required, e.g. if the >> requested mapping >> + * already exists is the exact same way. >> + * >> + * There can be an arbitrary amount of unmap operations, a maximum of >> two remap >> + * operations and a single map operation. The latter one, if existent, >> + * represents the original map operation requested by the caller. >> Please note >> + * that the map operation might has been modified, e.g. if it was >> + * merged with an existent mapping. >> + * >> + * Note that before calling this function again with another mapping >> request it >> + * is necessary to update the &drm_gpuva_manager's view of the GPU VA >> space. >> + * The previously obtained operations must be either processed or >> abandoned. >> + * To update the &drm_gpuva_manager's view of the GPU VA space >> + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >> + * drm_gpuva_destroy_unlocked() should be used. >> + * >> + * After the caller finished processing the returned &drm_gpuva_ops, >> they must >> + * be freed with &drm_gpuva_ops_free. >> + * >> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on >> failure >> + */ >> +struct drm_gpuva_ops * >> +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr, >> +               u64 req_addr, u64 req_range, >> +               struct drm_gem_object *req_obj, u64 req_offset) >> +{ >> +   struct drm_gpuva_ops *ops; >> +   struct drm_gpuva *va, *prev = NULL; >> +   u64 req_end = req_addr + req_range; >> +   bool skip_pmerge = false, skip_nmerge = false; >> +   int ret; >> + >> +   if (!drm_gpuva_in_any_region(mgr, req_addr, req_range)) >> +       return ERR_PTR(-EINVAL); >> + >> +   ops = kzalloc(sizeof(*ops), GFP_KERNEL); >> +   if (!ops) >> +       return ERR_PTR(-ENOMEM); >> + >> +   INIT_LIST_HEAD(&ops->list); >> + >> +   drm_gpuva_for_each_va_in_range(va, mgr, req_addr, req_end) { >> +       struct drm_gem_object *obj = va->gem.obj; >> +       u64 offset = va->gem.offset; >> +       u64 addr = va->node.start; >> +       u64 range = va->node.size; >> +       u64 end = addr + range; >> + >> +       /* Generally, we want to skip merging with potential mappings >> +        * left and right of the requested one when we found a >> +        * collision, since merging happens in this loop already. >> +        * >> +        * However, there is one exception when the requested mapping >> +        * spans into a free VM area. If this is the case we might >> +        * still hit the boundary of another mapping before and/or >> +        * after the free VM area. >> +        */ >> +       skip_pmerge = true; >> +       skip_nmerge = true; >> + >> +       if (addr == req_addr) { >> +           bool merge = obj == req_obj && >> +                    offset == req_offset; >> +           if (end == req_end) { >> +               if (merge) >> +                   goto done; >> + >> +               op_unmap_new_to_list(&ops->list, va, false); >> +               break; >> +           } >> + >> +           if (end < req_end) { >> +               skip_nmerge = false; >> +               op_unmap_new_to_list(&ops->list, va, merge); >> +               goto next; >> +           } >> + >> +           if (end > req_end) { >> +               struct drm_gpuva_op_map n = { >> +                   .va.addr = req_end, >> +                   .va.range = range - req_range, >> +                   .gem.obj = obj, >> +                   .gem.offset = offset + req_range, >> +               }; >> +               struct drm_gpuva_op_unmap u = { .va = va }; >> + >> +               if (merge) >> +                   goto done; >> + >> +               op_remap_new_to_list(&ops->list, NULL, &n, &u); >> +               break; >> +           } >> +       } else if (addr < req_addr) { >> +           u64 ls_range = req_addr - addr; >> +           struct drm_gpuva_op_map p = { >> +               .va.addr = addr, >> +               .va.range = ls_range, >> +               .gem.obj = obj, >> +               .gem.offset = offset, >> +           }; >> +           struct drm_gpuva_op_unmap u = { .va = va }; >> +           bool merge = obj == req_obj && >> +                    offset + ls_range == req_offset; >> + >> +           if (end == req_end) { >> +               if (merge) >> +                   goto done; >> + >> +               op_remap_new_to_list(&ops->list, &p, NULL, &u); >> +               break; >> +           } >> + >> +           if (end < req_end) { >> +               u64 new_addr = addr; >> +               u64 new_range = req_range + ls_range; >> +               u64 new_offset = offset; >> + >> +               /* We validated that the requested mapping is >> +                * within a single VA region already. >> +                * Since it overlaps the current mapping (which >> +                * can't cross a VA region boundary) we can be >> +                * sure that we're still within the boundaries >> +                * of the same VA region after merging. >> +                */ >> +               if (merge) { >> +                   req_offset = new_offset; >> +                   req_addr = new_addr; >> +                   req_range = new_range; >> +                   op_unmap_new_to_list(&ops->list, va, true); >> +                   goto next; >> +               } >> + >> +               op_remap_new_to_list(&ops->list, &p, NULL, &u); >> +               goto next; >> +           } >> + >> +           if (end > req_end) { >> +               struct drm_gpuva_op_map n = { >> +                   .va.addr = req_end, >> +                   .va.range = end - req_end, >> +                   .gem.obj = obj, >> +                   .gem.offset = offset + ls_range + >> +                             req_range, >> +               }; >> + >> +               if (merge) >> +                   goto done; >> + >> +               op_remap_new_to_list(&ops->list, &p, &n, &u); >> +               break; >> +           } >> +       } else if (addr > req_addr) { >> +           bool merge = obj == req_obj && >> +                    offset == req_offset + >> +                          (addr - req_addr); >> +           if (!prev) >> +               skip_pmerge = false; >> + >> +           if (end == req_end) { >> +               op_unmap_new_to_list(&ops->list, va, merge); >> +               break; >> +           } >> + >> +           if (end < req_end) { >> +               skip_nmerge = false; >> +               op_unmap_new_to_list(&ops->list, va, merge); >> +               goto next; >> +           } >> + >> +           if (end > req_end) { >> +               struct drm_gpuva_op_map n = { >> +                   .va.addr = req_end, >> +                   .va.range = end - req_end, >> +                   .gem.obj = obj, >> +                   .gem.offset = offset + req_end - addr, >> +               }; >> +               struct drm_gpuva_op_unmap u = { .va = va }; >> +               u64 new_end = end; >> +               u64 new_range = new_end - req_addr; >> + >> +               /* We validated that the requested mapping is >> +                * within a single VA region already. >> +                * Since it overlaps the current mapping (which >> +                * can't cross a VA region boundary) we can be >> +                * sure that we're still within the boundaries >> +                * of the same VA region after merging. >> +                */ >> +               if (merge) { >> +                   req_end = new_end; >> +                   req_range = new_range; >> +                   op_unmap_new_to_list(&ops->list, va, true); >> +                   break; >> +               } >> + >> +               op_remap_new_to_list(&ops->list, NULL, &n, &u); >> +               break; >> +           } >> +       } >> +next: >> +       prev = va; >> +   } >> + >> +   va = skip_pmerge ? NULL : drm_gpuva_find_prev(mgr, req_addr); >> +   if (va) { >> +       struct drm_gem_object *obj = va->gem.obj; >> +       u64 offset = va->gem.offset; >> +       u64 addr = va->node.start; >> +       u64 range = va->node.size; >> +       u64 new_offset = offset; >> +       u64 new_addr = addr; >> +       u64 new_range = req_range + range; >> +       bool merge = obj == req_obj && >> +                offset + range == req_offset; >> + >> +       /* Don't merge over VA region boundaries. */ >> +       merge &= drm_gpuva_in_any_region(mgr, new_addr, new_range); >> +       if (merge) { >> +           op_unmap_new_to_list(&ops->list, va, true); >> + >> +           req_offset = new_offset; >> +           req_addr = new_addr; >> +           req_range = new_range; >> +       } >> +   } >> + >> +   va = skip_nmerge ? NULL : drm_gpuva_find_next(mgr, req_end); >> +   if (va) { >> +       struct drm_gem_object *obj = va->gem.obj; >> +       u64 offset = va->gem.offset; >> +       u64 addr = va->node.start; >> +       u64 range = va->node.size; >> +       u64 end = addr + range; >> +       u64 new_range = req_range + range; >> +       u64 new_end = end; >> +       bool merge = obj == req_obj && >> +                offset == req_offset + req_range; >> + >> +       /* Don't merge over VA region boundaries. */ >> +       merge &= drm_gpuva_in_any_region(mgr, req_addr, new_range); >> +       if (merge) { >> +           op_unmap_new_to_list(&ops->list, va, true); >> + >> +           req_range = new_range; >> +           req_end = new_end; >> +       } >> +   } >> + >> +   op_map_new_to_list(&ops->list, >> +              req_addr, req_range, >> +              req_obj, req_offset); >> + >> +done: >> +   return ops; >> + >> +err_free_ops: >> +   drm_gpuva_ops_free(ops); >> +   return ERR_PTR(ret); >> +} >> +EXPORT_SYMBOL(drm_gpuva_sm_map_ops_create); >> + >> +#undef op_map_new_to_list >> +#undef op_remap_new_to_list >> +#undef op_unmap_new_to_list >> + >> +/** >> + * drm_gpuva_sm_unmap_ops_create - creates the &drm_gpuva_ops to >> split on unmap >> + * @mgr: the &drm_gpuva_manager representing the GPU VA space >> + * @req_addr: the start address of the range to unmap >> + * @req_range: the range of the mappings to unmap >> + * >> + * This function creates a list of operations to perform unmapping >> and, if >> + * required, splitting of the mappings overlapping the unmap range. >> + * >> + * The list can be iterated with &drm_gpuva_for_each_op and must be >> processed >> + * in the given order. It can contain unmap and remap operations, >> depending on >> + * whether there are actual overlapping mappings to split. >> + * >> + * There can be an arbitrary amount of unmap operations and a maximum >> of two >> + * remap operations. >> + * >> + * Note that before calling this function again with another range to >> unmap it >> + * is necessary to update the &drm_gpuva_manager's view of the GPU VA >> space. >> + * The previously obtained operations must be processed or abandoned. >> + * To update the &drm_gpuva_manager's view of the GPU VA space >> + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >> + * drm_gpuva_destroy_unlocked() should be used. >> + * >> + * After the caller finished processing the returned &drm_gpuva_ops, >> they must >> + * be freed with &drm_gpuva_ops_free. >> + * >> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on >> failure >> + */ >> +struct drm_gpuva_ops * >> +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr, >> +                 u64 req_addr, u64 req_range) >> +{ >> +   struct drm_gpuva_ops *ops; >> +   struct drm_gpuva_op *op; >> +   struct drm_gpuva_op_remap *r; >> +   struct drm_gpuva *va; >> +   u64 req_end = req_addr + req_range; >> +   int ret; >> + >> +   ops = kzalloc(sizeof(*ops), GFP_KERNEL); >> +   if (!ops) >> +       return ERR_PTR(-ENOMEM); >> + >> +   INIT_LIST_HEAD(&ops->list); >> + >> +   drm_gpuva_for_each_va_in_range(va, mgr, req_addr, req_end) { >> +       struct drm_gem_object *obj = va->gem.obj; >> +       u64 offset = va->gem.offset; >> +       u64 addr = va->node.start; >> +       u64 range = va->node.size; >> +       u64 end = addr + range; >> + >> +       op = kzalloc(sizeof(*op), GFP_KERNEL); >> +       if (!op) { >> +           ret = -ENOMEM; >> +           goto err_free_ops; >> +       } >> + >> +       r = &op->remap; >> + >> +       if (addr < req_addr) { >> +           r->prev = kzalloc(sizeof(*r->prev), GFP_KERNEL); >> +           if (!r->prev) { >> +               ret = -ENOMEM; >> +               goto err_free_op; >> +           } >> + >> +           r->prev->va.addr = addr; >> +           r->prev->va.range = req_addr - addr; >> +           r->prev->gem.obj = obj; >> +           r->prev->gem.offset = offset; >> +       } >> + >> +       if (end > req_end) { >> +           r->next = kzalloc(sizeof(*r->next), GFP_KERNEL); >> +           if (!r->next) { >> +               ret = -ENOMEM; >> +               goto err_free_prev; >> +           } >> + >> +           r->next->va.addr = req_end; >> +           r->next->va.range = end - req_end; >> +           r->next->gem.obj = obj; >> +           r->next->gem.offset = offset + (req_end - addr); >> +       } >> + >> +       if (op->remap.prev || op->remap.next) { >> +           op->op = DRM_GPUVA_OP_REMAP; >> +           r->unmap = kzalloc(sizeof(*r->unmap), GFP_KERNEL); >> +           if (!r->unmap) { >> +               ret = -ENOMEM; >> +               goto err_free_next; >> +           } >> + >> +           r->unmap->va = va; >> +       } else { >> +           op->op = DRM_GPUVA_OP_UNMAP; >> +           op->unmap.va = va; >> +       } >> + >> +       list_add_tail(&op->entry, &ops->list); >> +   } >> + >> +   return ops; >> + >> +err_free_next: >> +   if (r->next) >> +       kfree(r->next); >> +err_free_prev: >> +   if (r->prev) >> +       kfree(r->prev); >> +err_free_op: >> +   kfree(op); >> +err_free_ops: >> +   drm_gpuva_ops_free(ops); >> +   return ERR_PTR(ret); >> +} >> +EXPORT_SYMBOL(drm_gpuva_sm_unmap_ops_create); >> + >> +/** >> + * drm_gpuva_ops_free - free the given &drm_gpuva_ops >> + * @ops: the &drm_gpuva_ops to free >> + * >> + * Frees the given &drm_gpuva_ops structure including all the ops >> associated >> + * with it. >> + */ >> +void >> +drm_gpuva_ops_free(struct drm_gpuva_ops *ops) >> +{ >> +   struct drm_gpuva_op *op, *next; >> + >> +   drm_gpuva_for_each_op_safe(op, next, ops) { >> +       list_del(&op->entry); >> +       if (op->op == DRM_GPUVA_OP_REMAP) { >> +           if (op->remap.prev) >> +               kfree(op->remap.prev); >> + >> +           if (op->remap.next) >> +               kfree(op->remap.next); >> + >> +           kfree(op->remap.unmap); >> +       } >> +       kfree(op); >> +   } >> + >> +   kfree(ops); >> +} >> +EXPORT_SYMBOL(drm_gpuva_ops_free); >> diff --git a/include/drm/drm_drv.h b/include/drm/drm_drv.h >> index d7c521e8860f..6feacd93aca6 100644 >> --- a/include/drm/drm_drv.h >> +++ b/include/drm/drm_drv.h >> @@ -104,6 +104,12 @@ enum drm_driver_feature { >>     * acceleration should be handled by two drivers that are >> connected using auxiliary bus. >>     */ >>     DRIVER_COMPUTE_ACCEL           = BIT(7), >> +   /** >> +    * @DRIVER_GEM_GPUVA: >> +    * >> +    * Driver supports user defined GPU VA bindings for GEM objects. >> +    */ >> +   DRIVER_GEM_GPUVA       = BIT(8), >> >>     /* IMPORTANT: Below are all the legacy flags, add new ones above. */ >> >> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h >> index 772a4adf5287..4a3679034966 100644 >> --- a/include/drm/drm_gem.h >> +++ b/include/drm/drm_gem.h >> @@ -36,6 +36,8 @@ >> >> #include <linux/kref.h> >> #include <linux/dma-resv.h> >> +#include <linux/list.h> >> +#include <linux/mutex.h> >> >> #include <drm/drm_vma_manager.h> >> >> @@ -337,6 +339,17 @@ struct drm_gem_object { >>     */ >>     struct dma_resv _resv; >> >> +   /** >> +    * @gpuva: >> +    * >> +    * Provides the list and list mutex of GPU VAs attached to this >> +    * GEM object. >> +    */ >> +   struct { >> +       struct list_head list; >> +       struct mutex mutex; >> +   } gpuva; >> + >>     /** >>     * @funcs: >>     * >> @@ -479,4 +492,66 @@ void drm_gem_lru_move_tail(struct drm_gem_lru >> *lru, struct drm_gem_object *obj); >> unsigned long drm_gem_lru_scan(struct drm_gem_lru *lru, unsigned >> nr_to_scan, >>                   bool (*shrink)(struct drm_gem_object *obj)); >> >> +/** >> + * drm_gem_gpuva_init - initialize the gpuva list of a GEM object >> + * @obj: the &drm_gem_object >> + * >> + * This initializes the &drm_gem_object's &drm_gpuva list and the mutex >> + * protecting it. >> + * >> + * Calling this function is only necessary for drivers intending to >> support the >> + * &drm_driver_feature DRIVER_GEM_GPUVA. >> + */ >> +static inline void drm_gem_gpuva_init(struct drm_gem_object *obj) >> +{ >> +   INIT_LIST_HEAD(&obj->gpuva.list); >> +   mutex_init(&obj->gpuva.mutex); >> +} >> + >> +/** >> + * drm_gem_gpuva_lock - lock the GEM's gpuva list mutex >> + * @obj: the &drm_gem_object >> + * >> + * This unlocks the mutex protecting the &drm_gem_object's &drm_gpuva >> list. >> + */ >> +static inline void drm_gem_gpuva_lock(struct drm_gem_object *obj) >> +{ >> +   mutex_lock(&obj->gpuva.mutex); >> +} >> + >> +/** >> + * drm_gem_gpuva_unlock - unlock the GEM's gpuva list mutex >> + * @obj: the &drm_gem_object >> + * >> + * This unlocks the mutex protecting the &drm_gem_object's &drm_gpuva >> list. >> + */ >> +static inline void drm_gem_gpuva_unlock(struct drm_gem_object *obj) >> +{ >> +   mutex_unlock(&obj->gpuva.mutex); >> +} >> + >> +/** >> + * drm_gem_for_each_gpuva - iternator to walk over a list of gpuvas >> + * @entry: &drm_gpuva structure to assign to in each iteration step >> + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with >> + * >> + * This iterator walks over all &drm_gpuva structures associated with >> the >> + * &drm_gpuva_manager. >> + */ >> +#define drm_gem_for_each_gpuva(entry, obj) \ >> +   list_for_each_entry(entry, &obj->gpuva.list, head) >> + >> +/** >> + * drm_gem_for_each_gpuva_safe - iternator to safely walk over a list >> of gpuvas >> + * @entry: &drm_gpuva structure to assign to in each iteration step >> + * @next: &next &drm_gpuva to store the next step >> + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with >> + * >> + * This iterator walks over all &drm_gpuva structures associated with >> the >> + * &drm_gem_object. It is implemented with >> list_for_each_entry_safe(), hence >> + * it is save against removal of elements. >> + */ >> +#define drm_gem_for_each_gpuva_safe(entry, next, obj) \ >> +   list_for_each_entry_safe(entry, next, &obj->gpuva.list, head) >> + >> #endif /* __DRM_GEM_H__ */ >> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h >> new file mode 100644 >> index 000000000000..adeb0c916e91 >> --- /dev/null >> +++ b/include/drm/drm_gpuva_mgr.h >> @@ -0,0 +1,527 @@ >> +// SPDX-License-Identifier: GPL-2.0 >> + >> +#ifndef __DRM_GPUVA_MGR_H__ >> +#define __DRM_GPUVA_MGR_H__ >> + >> +/* >> + * Copyright (c) 2022 Red Hat. >> + * >> + * Permission is hereby granted, free of charge, to any person >> obtaining a >> + * copy of this software and associated documentation files (the >> "Software"), >> + * to deal in the Software without restriction, including without >> limitation >> + * the rights to use, copy, modify, merge, publish, distribute, >> sublicense, >> + * and/or sell copies of the Software, and to permit persons to whom the >> + * Software is furnished to do so, subject to the following conditions: >> + * >> + * The above copyright notice and this permission notice shall be >> included in >> + * all copies or substantial portions of the Software. >> + * >> + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, >> EXPRESS OR >> + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF >> MERCHANTABILITY, >> + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT >> SHALL >> + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, >> DAMAGES OR >> + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, >> + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR >> + * OTHER DEALINGS IN THE SOFTWARE. >> + */ >> + >> +#include <drm/drm_mm.h> >> +#include <linux/mm.h> >> +#include <linux/rbtree.h> >> +#include <linux/spinlock.h> >> +#include <linux/types.h> >> + >> +struct drm_gpuva_region; >> +struct drm_gpuva; >> +struct drm_gpuva_ops; >> + >> +/** >> + * struct drm_gpuva_manager - DRM GPU VA Manager >> + * >> + * The DRM GPU VA Manager keeps track of a GPU's virtual address >> space by using >> + * the &drm_mm range allocator. Typically, this structure is embedded >> in bigger >> + * driver structures. >> + * >> + * Drivers can pass addresses and ranges in an arbitrary unit, e.g. >> bytes or >> + * pages. >> + * >> + * There should be one manager instance per GPU virtual address space. >> + */ >> +struct drm_gpuva_manager { >> +   /** >> +    * @name: the name of the DRM GPU VA space >> +    */ >> +   const char *name; >> + >> +   /** >> +    * @mm_start: start of the VA space >> +    */ >> +   u64 mm_start; >> + >> +   /** >> +    * @mm_range: length of the VA space >> +    */ >> +   u64 mm_range; >> + >> +   /** >> +    * @region_mm: the &drm_mm range allocator to track GPU VA regions >> +    */ >> +   struct drm_mm region_mm; >> + >> +   /** >> +    * @va_mm: the &drm_mm range allocator to track GPU VA mappings >> +    */ >> +   struct drm_mm va_mm; >> + >> +   /** >> +    * @kernel_alloc_node: >> +    * >> +    * &drm_mm_node representing the address space cutout reserved for >> +    * the kernel >> +    */ >> +   struct drm_mm_node kernel_alloc_node; >> +}; >> + >> +void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr, >> +               const char *name, >> +               u64 start_offset, u64 range, >> +               u64 reserve_offset, u64 reserve_range); >> +void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr); >> + >> +/** >> + * struct drm_gpuva_region - structure to track a portion of GPU VA >> space >> + * >> + * This structure represents a portion of a GPUs VA space and is >> associated >> + * with a &drm_gpuva_manager. Internally it is based on a &drm_mm_node. >> + * >> + * GPU VA mappings, represented by &drm_gpuva objects, are restricted >> to be >> + * placed within a &drm_gpuva_region. >> + */ >> +struct drm_gpuva_region { >> +   /** >> +    * @node: the &drm_mm_node to track the GPU VA region >> +    */ >> +   struct drm_mm_node node; >> + >> +   /** >> +    * @mgr: the &drm_gpuva_manager this object is associated with >> +    */ >> +   struct drm_gpuva_manager *mgr; >> + >> +   /** >> +    * @sparse: indicates whether this region is sparse >> +    */ >> +   bool sparse; >> +}; >> + >> +struct drm_gpuva_region * >> +drm_gpuva_region_find(struct drm_gpuva_manager *mgr, >> +             u64 addr, u64 range); >> +int drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, >> +               struct drm_gpuva_region *reg, >> +               u64 addr, u64 range); >> +void drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, >> +                 struct drm_gpuva_region *reg); >> + >> +int drm_gpuva_insert(struct drm_gpuva_manager *mgr, >> +            struct drm_gpuva *va, >> +            u64 addr, u64 range); >> +/** >> + * drm_gpuva_for_each_region_in_range - iternator to walk over a >> range of nodes >> + * @node__: &drm_gpuva_region structure to assign to in each >> iteration step >> + * @gpuva__: &drm_gpuva_manager structure to walk >> + * @start__: starting offset, the first node will overlap this >> + * @end__: ending offset, the last node will start before this (but >> may overlap) >> + * >> + * This iterator walks over all nodes in the range allocator that lie >> + * between @start and @end. It is implemented similarly to >> list_for_each(), >> + * but is using &drm_mm's internal interval tree to accelerate the >> search for >> + * the starting node, and hence isn't safe against removal of >> elements. It >> + * assumes that @end is within (or is the upper limit of) the >> &drm_gpuva_manager. >> + * If [@start, @end] are beyond the range of the &drm_gpuva_manager, the >> + * iterator may walk over the special _unallocated_ &drm_mm.head_node >> of the >> + * backing &drm_mm, and may even continue indefinitely. >> + */ >> +#define drm_gpuva_for_each_region_in_range(node__, gpuva__, start__, >> end__) \ >> +   for (node__ = (struct drm_gpuva_region >> *)__drm_mm_interval_first(&(gpuva__)->region_mm, \ >> +                                    (start__), (end__)-1); \ >> +        node__->node.start < (end__); \ >> +        node__ = (struct drm_gpuva_region >> *)list_next_entry(&node__->node, node_list)) >> + >> +/** >> + * drm_gpuva_for_each_region - iternator to walk over a range of nodes >> + * @entry: &drm_gpuva_region structure to assign to in each iteration >> step >> + * @gpuva: &drm_gpuva_manager structure to walk >> + * >> + * This iterator walks over all &drm_gpuva_region structures >> associated with the >> + * &drm_gpuva_manager. >> + */ >> +#define drm_gpuva_for_each_region(entry, gpuva) \ >> +   list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->region_mm), >> node.node_list) >> + >> +/** >> + * drm_gpuva_for_each_region_safe - iternator to safely walk over a >> range of >> + * nodes >> + * @entry: &drm_gpuva_region structure to assign to in each iteration >> step >> + * @next: &next &drm_gpuva_region to store the next step >> + * @gpuva: &drm_gpuva_manager structure to walk >> + * >> + * This iterator walks over all &drm_gpuva_region structures >> associated with the >> + * &drm_gpuva_manager. It is implemented with list_for_each_safe(), >> so save >> + * against removal of elements. >> + */ >> +#define drm_gpuva_for_each_region_safe(entry, next, gpuva) \ >> +   list_for_each_entry_safe(entry, next, >> drm_mm_nodes(&(gpuva)->region_mm), node.node_list) >> + >> + >> +/** >> + * enum drm_gpuva_flags - flags for struct drm_gpuva >> + */ >> +enum drm_gpuva_flags { >> +   /** >> +    * @DRM_GPUVA_SWAPPED: flag indicating that the &drm_gpuva is >> swapped >> +    */ >> +   DRM_GPUVA_SWAPPED = (1 << 0), >> +}; >> + >> +/** >> + * struct drm_gpuva - structure to track a GPU VA mapping >> + * >> + * This structure represents a GPU VA mapping and is associated with a >> + * &drm_gpuva_manager. Internally it is based on a &drm_mm_node. >> + * >> + * Typically, this structure is embedded in bigger driver structures. >> + */ >> +struct drm_gpuva { >> +   /** >> +    * @node: the &drm_mm_node to track the GPU VA mapping >> +    */ >> +   struct drm_mm_node node; >> + >> +   /** >> +    * @mgr: the &drm_gpuva_manager this object is associated with >> +    */ >> +   struct drm_gpuva_manager *mgr; >> + >> +   /** >> +    * @region: the &drm_gpuva_region the &drm_gpuva is mapped in >> +    */ >> +   struct drm_gpuva_region *region; >> + >> +   /** >> +    * @head: the &list_head to attach this object to a &drm_gem_object >> +    */ >> +   struct list_head head; >> + >> +   /** >> +    * @flags: the &drm_gpuva_flags for this mapping >> +    */ >> +   enum drm_gpuva_flags flags; >> + >> +   /** >> +    * @gem: structure containing the &drm_gem_object and it's offset >> +    */ >> +   struct { >> +       /** >> +        * @offset: the offset within the &drm_gem_object >> +        */ >> +       u64 offset; >> + >> +       /** >> +        * @obj: the mapped &drm_gem_object >> +        */ >> +       struct drm_gem_object *obj; >> +   } gem; >> +}; >> + >> +void drm_gpuva_link_locked(struct drm_gpuva *va); >> +void drm_gpuva_link_unlocked(struct drm_gpuva *va); >> +void drm_gpuva_unlink_locked(struct drm_gpuva *va); >> +void drm_gpuva_unlink_unlocked(struct drm_gpuva *va); >> + >> +void drm_gpuva_destroy_locked(struct drm_gpuva *va); >> +void drm_gpuva_destroy_unlocked(struct drm_gpuva *va); >> + >> +struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr, >> +                u64 addr, u64 range); >> +struct drm_gpuva *drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, >> u64 start); >> +struct drm_gpuva *drm_gpuva_find_next(struct drm_gpuva_manager *mgr, >> u64 end); >> + >> +/** >> + * drm_gpuva_swap - sets whether the backing BO of this &drm_gpuva is >> swapped >> + * @va: the &drm_gpuva to set the swap flag of >> + * @swap: indicates whether the &drm_gpuva is swapped >> + */ >> +static inline void drm_gpuva_swap(struct drm_gpuva *va, bool swap) >> +{ >> +   if (swap) >> +       va->flags |= DRM_GPUVA_SWAPPED; >> +   else >> +       va->flags &= ~DRM_GPUVA_SWAPPED; >> +} >> + >> +/** >> + * drm_gpuva_swapped - indicates whether the backing BO of this >> &drm_gpuva >> + * is swapped >> + * @va: the &drm_gpuva to check >> + */ >> +static inline bool drm_gpuva_swapped(struct drm_gpuva *va) >> +{ >> +   return va->flags & DRM_GPUVA_SWAPPED; >> +} >> + >> +/** >> + * drm_gpuva_for_each_va_in_range - iternator to walk over a range of >> nodes >> + * @node__: &drm_gpuva structure to assign to in each iteration step >> + * @gpuva__: &drm_gpuva_manager structure to walk >> + * @start__: starting offset, the first node will overlap this >> + * @end__: ending offset, the last node will start before this (but >> may overlap) >> + * >> + * This iterator walks over all nodes in the range allocator that lie >> + * between @start and @end. It is implemented similarly to >> list_for_each(), >> + * but is using &drm_mm's internal interval tree to accelerate the >> search for >> + * the starting node, and hence isn't safe against removal of >> elements. It >> + * assumes that @end is within (or is the upper limit of) the >> &drm_gpuva_manager. >> + * If [@start, @end] are beyond the range of the &drm_gpuva_manager, the >> + * iterator may walk over the special _unallocated_ &drm_mm.head_node >> of the >> + * backing &drm_mm, and may even continue indefinitely. >> + */ >> +#define drm_gpuva_for_each_va_in_range(node__, gpuva__, start__, >> end__) \ >> +   for (node__ = (struct drm_gpuva >> *)__drm_mm_interval_first(&(gpuva__)->va_mm, \ >> +                                 (start__), (end__)-1); \ >> +        node__->node.start < (end__); \ >> +        node__ = (struct drm_gpuva *)list_next_entry(&node__->node, >> node_list)) >> + >> +/** >> + * drm_gpuva_for_each_va - iternator to walk over a range of nodes >> + * @entry: &drm_gpuva structure to assign to in each iteration step >> + * @gpuva: &drm_gpuva_manager structure to walk >> + * >> + * This iterator walks over all &drm_gpuva structures associated with >> the >> + * &drm_gpuva_manager. >> + */ >> +#define drm_gpuva_for_each_va(entry, gpuva) \ >> +   list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->va_mm), >> node.node_list) >> + >> +/** >> + * drm_gpuva_for_each_va_safe - iternator to safely walk over a range of >> + * nodes >> + * @entry: &drm_gpuva structure to assign to in each iteration step >> + * @next: &next &drm_gpuva to store the next step >> + * @gpuva: &drm_gpuva_manager structure to walk >> + * >> + * This iterator walks over all &drm_gpuva structures associated with >> the >> + * &drm_gpuva_manager. It is implemented with list_for_each_safe(), >> so save >> + * against removal of elements. >> + */ >> +#define drm_gpuva_for_each_va_safe(entry, next, gpuva) \ >> +   list_for_each_entry_safe(entry, next, >> drm_mm_nodes(&(gpuva)->va_mm), node.node_list) >> + >> +/** >> + * enum drm_gpuva_op_type - GPU VA operation type >> + * >> + * Operations to alter the GPU VA mappings tracked by the >> &drm_gpuva_manager >> + * can be map, remap or unmap operations. >> + */ >> +enum drm_gpuva_op_type { >> +   /** >> +    * @DRM_GPUVA_OP_MAP: the map op type >> +    */ >> +   DRM_GPUVA_OP_MAP, >> + >> +   /** >> +    * @DRM_GPUVA_OP_REMAP: the remap op type >> +    */ >> +   DRM_GPUVA_OP_REMAP, >> + >> +   /** >> +    * @DRM_GPUVA_OP_UNMAP: the unmap op type >> +    */ >> +   DRM_GPUVA_OP_UNMAP, >> +}; >> + >> +/** >> + * struct drm_gpuva_op_map - GPU VA map operation >> + * >> + * This structure represents a single map operation generated by the >> + * DRM GPU VA manager. >> + */ >> +struct drm_gpuva_op_map { >> +   /** >> +    * @va: structure containing address and range of a map >> +    * operation >> +    */ >> +   struct { >> +       /** >> +        * @addr: the base address of the new mapping >> +        */ >> +       u64 addr; >> + >> +       /** >> +        * @range: the range of the new mapping >> +        */ >> +       u64 range; >> +   } va; >> + >> +   /** >> +    * @gem: structure containing the &drm_gem_object and it's offset >> +    */ >> +   struct { >> +       /** >> +        * @offset: the offset within the &drm_gem_object >> +        */ >> +       u64 offset; >> + >> +       /** >> +        * @obj: the &drm_gem_object to map >> +        */ >> +       struct drm_gem_object *obj; >> +   } gem; >> +}; >> + >> +/** >> + * struct drm_gpuva_op_unmap - GPU VA unmap operation >> + * >> + * This structure represents a single unmap operation generated by the >> + * DRM GPU VA manager. >> + */ >> +struct drm_gpuva_op_unmap { >> +   /** >> +    * @va: the &drm_gpuva to unmap >> +    */ >> +   struct drm_gpuva *va; >> + >> +   /** >> +    * @keep: >> +    * >> +    * Indicates whether this &drm_gpuva is physically contiguous >> with the >> +    * original mapping request. >> +    * >> +    * Optionally, if &keep is set, drivers may keep the actual page >> table >> +    * mappings for this &drm_gpuva, adding the missing page table >> entries >> +    * only and update the &drm_gpuva_manager accordingly. >> +    */ >> +   bool keep; >> +}; >> + >> +/** >> + * struct drm_gpuva_op_remap - GPU VA remap operation >> + * >> + * This represents a single remap operation generated by the DRM GPU >> VA manager. >> + * >> + * A remap operation is generated when an existing GPU VA mmapping is >> split up >> + * by inserting a new GPU VA mapping or by partially unmapping existent >> + * mapping(s), hence it consists of a maximum of two map and one unmap >> + * operation. >> + * >> + * The @unmap operation takes care of removing the original existing >> mapping. >> + * @prev is used to remap the preceding part, @next the subsequent part. >> + * >> + * If either a new mapping's start address is aligned with the start >> address >> + * of the old mapping or the new mapping's end address is aligned >> with the >> + * end address of the old mapping, either @prev or @next is NULL. >> + * >> + * Note, the reason for a dedicated remap operation, rather than >> arbitrary >> + * unmap and map operations, is to give drivers the chance of >> extracting driver >> + * specific data for creating the new mappings from the unmap >> operations's >> + * &drm_gpuva structure which typically is embedded in larger driver >> specific >> + * structures. >> + */ >> +struct drm_gpuva_op_remap { >> +   /** >> +    * @prev: the preceding part of a split mapping >> +    */ >> +   struct drm_gpuva_op_map *prev; >> + >> +   /** >> +    * @next: the subsequent part of a split mapping >> +    */ >> +   struct drm_gpuva_op_map *next; >> + >> +   /** >> +    * @unmap: the unmap operation for the original existing mapping >> +    */ >> +   struct drm_gpuva_op_unmap *unmap; >> +}; >> + >> +/** >> + * struct drm_gpuva_op - GPU VA operation >> + * >> + * This structure represents a single generic operation, which can be >> either >> + * map, unmap or remap. >> + * >> + * The particular type of the operation is defined by @op. >> + */ >> +struct drm_gpuva_op { >> +   /** >> +    * @entry: >> +    * >> +    * The &list_head used to distribute instances of this struct within >> +    * &drm_gpuva_ops. >> +    */ >> +   struct list_head entry; >> + >> +   /** >> +    * @op: the type of the operation >> +    */ >> +   enum drm_gpuva_op_type op; >> + >> +   union { >> +       /** >> +        * @map: the map operation >> +        */ >> +       struct drm_gpuva_op_map map; >> + >> +       /** >> +        * @unmap: the unmap operation >> +        */ >> +       struct drm_gpuva_op_unmap unmap; >> + >> +       /** >> +        * @remap: the remap operation >> +        */ >> +       struct drm_gpuva_op_remap remap; >> +   }; >> +}; >> + >> +/** >> + * struct drm_gpuva_ops - wraps a list of &drm_gpuva_op >> + */ >> +struct drm_gpuva_ops { >> +   /** >> +    * @list: the &list_head >> +    */ >> +   struct list_head list; >> +}; >> + >> +/** >> + * drm_gpuva_for_each_op - iterator to walk over all ops >> + * @op: &drm_gpuva_op to assign in each iteration step >> + * @ops: &drm_gpuva_ops to walk >> + * >> + * This iterator walks over all ops within a given list of operations. >> + */ >> +#define drm_gpuva_for_each_op(op, ops) list_for_each_entry(op, >> &(ops)->list, entry) >> + >> +/** >> + * drm_gpuva_for_each_op_safe - iterator to safely walk over all ops >> + * @op: &drm_gpuva_op to assign in each iteration step >> + * @next: &next &drm_gpuva_op to store the next step >> + * @ops: &drm_gpuva_ops to walk >> + * >> + * This iterator walks over all ops within a given list of >> operations. It is >> + * implemented with list_for_each_safe(), so save against removal of >> elements. >> + */ >> +#define drm_gpuva_for_each_op_safe(op, next, ops) \ >> +   list_for_each_entry_safe(op, next, &(ops)->list, entry) >> + >> +struct drm_gpuva_ops * >> +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr, >> +               u64 addr, u64 range, >> +               struct drm_gem_object *obj, u64 offset); >> +struct drm_gpuva_ops * >> +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr, >> +                 u64 addr, u64 range); >> +void drm_gpuva_ops_free(struct drm_gpuva_ops *ops); >> + >> +#endif /* __DRM_GPUVA_MGR_H__ */ >> -- >> 2.39.0 >> >
On Tue, Jan 24, 2023 at 01:11:55AM +0100, Danilo Krummrich wrote: >On 1/24/23 00:23, Niranjana Vishwanathapura wrote: >>On Wed, Jan 18, 2023 at 07:12:45AM +0100, Danilo Krummrich wrote: >>>This adds the infrastructure for a manager implementation to keep track >>>of GPU virtual address (VA) mappings. >>> >>>New UAPIs, motivated by Vulkan sparse memory bindings graphics drivers >>>start implementing, allow userspace applications to request multiple and >>>arbitrary GPU VA mappings of buffer objects. The DRM GPU VA manager is >>>intended to serve the following purposes in this context. >>> >>>1) Provide a dedicated range allocator to track GPU VA allocations and >>> mappings, making use of the drm_mm range allocator. >>> >>>2) Generically connect GPU VA mappings to their backing buffers, in >>> particular DRM GEM objects. >>> >>>3) Provide a common implementation to perform more complex mapping >>> operations on the GPU VA space. In particular splitting and merging >>> of GPU VA mappings, e.g. for intersecting mapping requests or partial >>> unmap requests. >>> >>>Idea-suggested-by: Dave Airlie <airlied@redhat.com> >>>Signed-off-by: Danilo Krummrich <dakr@redhat.com> >>>--- >>>Documentation/gpu/drm-mm.rst   |  31 + >>>drivers/gpu/drm/Makefile       |   1 + >>>drivers/gpu/drm/drm_gem.c      |   3 + >>>drivers/gpu/drm/drm_gpuva_mgr.c | 1323 +++++++++++++++++++++++++++++++ >>>include/drm/drm_drv.h          |   6 + >>>include/drm/drm_gem.h          |  75 ++ >>>include/drm/drm_gpuva_mgr.h    | 527 ++++++++++++ >>>7 files changed, 1966 insertions(+) >>>create mode 100644 drivers/gpu/drm/drm_gpuva_mgr.c >>>create mode 100644 include/drm/drm_gpuva_mgr.h >>> >>>diff --git a/Documentation/gpu/drm-mm.rst b/Documentation/gpu/drm-mm.rst >>>index a52e6f4117d6..c9f120cfe730 100644 >>>--- a/Documentation/gpu/drm-mm.rst >>>+++ b/Documentation/gpu/drm-mm.rst >>>@@ -466,6 +466,37 @@ DRM MM Range Allocator Function References >>>.. kernel-doc:: drivers/gpu/drm/drm_mm.c >>>  :export: >>> >>>+DRM GPU VA Manager >>>+================== >>>+ >>>+Overview >>>+-------- >>>+ >>>+.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >>>+  :doc: Overview >>>+ >>>+Split and Merge >>>+--------------- >>>+ >>>+.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >>>+  :doc: Split and Merge >>>+ >>>+Locking >>>+------- >>>+ >>>+.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >>>+  :doc: Locking >>>+ >>>+ >>>+DRM GPU VA Manager Function References >>>+-------------------------------------- >>>+ >>>+.. kernel-doc:: include/drm/drm_gpuva_mgr.h >>>+  :internal: >>>+ >>>+.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >>>+  :export: >>>+ >>>DRM Buddy Allocator >>>=================== >>> >>>diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile >>>index 4fe190aee584..de2ffca3b6e4 100644 >>>--- a/drivers/gpu/drm/Makefile >>>+++ b/drivers/gpu/drm/Makefile >>>@@ -45,6 +45,7 @@ drm-y := \ >>>    drm_vblank.o \ >>>    drm_vblank_work.o \ >>>    drm_vma_manager.o \ >>>+   drm_gpuva_mgr.o \ >>>    drm_writeback.o >>>drm-$(CONFIG_DRM_LEGACY) += \ >>>    drm_agpsupport.o \ >>>diff --git a/drivers/gpu/drm/drm_gem.c b/drivers/gpu/drm/drm_gem.c >>>index 59a0bb5ebd85..65115fe88627 100644 >>>--- a/drivers/gpu/drm/drm_gem.c >>>+++ b/drivers/gpu/drm/drm_gem.c >>>@@ -164,6 +164,9 @@ void drm_gem_private_object_init(struct >>>drm_device *dev, >>>    if (!obj->resv) >>>       obj->resv = &obj->_resv; >>> >>>+   if (drm_core_check_feature(dev, DRIVER_GEM_GPUVA)) >>>+       drm_gem_gpuva_init(obj); >>>+ >>>    drm_vma_node_reset(&obj->vma_node); >>>    INIT_LIST_HEAD(&obj->lru_node); >>>} >>>diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c >>>b/drivers/gpu/drm/drm_gpuva_mgr.c >>>new file mode 100644 >>>index 000000000000..e665f642689d >>>--- /dev/null >>>+++ b/drivers/gpu/drm/drm_gpuva_mgr.c >>>@@ -0,0 +1,1323 @@ >>>+// SPDX-License-Identifier: GPL-2.0 >>>+/* >>>+ * Copyright (c) 2022 Red Hat. >>>+ * >>>+ * Permission is hereby granted, free of charge, to any person >>>obtaining a >>>+ * copy of this software and associated documentation files (the >>>"Software"), >>>+ * to deal in the Software without restriction, including without >>>limitation >>>+ * the rights to use, copy, modify, merge, publish, distribute, >>>sublicense, >>>+ * and/or sell copies of the Software, and to permit persons to whom the >>>+ * Software is furnished to do so, subject to the following conditions: >>>+ * >>>+ * The above copyright notice and this permission notice shall be >>>included in >>>+ * all copies or substantial portions of the Software. >>>+ * >>>+ * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY >>>KIND, EXPRESS OR >>>+ * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF >>>MERCHANTABILITY, >>>+ * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO >>>EVENT SHALL >>>+ * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, >>>DAMAGES OR >>>+ * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, >>>+ * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR >>>+ * OTHER DEALINGS IN THE SOFTWARE. >>>+ * >>>+ * Authors: >>>+ *    Danilo Krummrich <dakr@redhat.com> >>>+ * >>>+ */ >>>+ >>>+#include <drm/drm_gem.h> >>>+#include <drm/drm_gpuva_mgr.h> >>>+ >>>+/** >>>+ * DOC: Overview >>>+ * >>>+ * The DRM GPU VA Manager, represented by struct >>>drm_gpuva_manager keeps track >>>+ * of a GPU's virtual address (VA) space and manages the >>>corresponding virtual >>>+ * mappings represented by &drm_gpuva objects. It also keeps >>>track of the >>>+ * mapping's backing &drm_gem_object buffers. >>>+ * >>>+ * &drm_gem_object buffers maintain a list (and a corresponding >>>list lock) of >>>+ * &drm_gpuva objects representing all existent GPU VA mappings >>>using this >>>+ * &drm_gem_object as backing buffer. >>>+ * >>>+ * A GPU VA mapping can only be created within a previously allocated >>>+ * &drm_gpuva_region, which represents a reserved portion of the >>>GPU VA space. >>>+ * GPU VA mappings are not allowed to span over a >>>&drm_gpuva_region's boundary. >>>+ * >>>+ * GPU VA regions can also be flagged as sparse, which allows >>>drivers to create >>>+ * sparse mappings for a whole GPU VA region in order to support Vulkan >>>+ * 'Sparse Resources'. >>>+ * >> >>So, is the sparse resources the only usecase for the VA region >>abstraction in uapi? >>Or are there any other potential usecases (other than kernel >>reserved space)? >> Ok thanks. > >It's mainly for sparse resources, though it also enables the kernel to >know the bounds within it is allowed to merge mappings. > >>>+ * The GPU VA manager internally uses the &drm_mm range allocator >>>to manage the >>>+ * &drm_gpuva mappings and the &drm_gpuva_regions within a GPU's >>>virtual address >>>+ * space. >>>+ * >>>+ * Besides the GPU VA space regions (&drm_gpuva_region) allocated >>>by a driver >>>+ * the &drm_gpuva_manager contains a special region representing >>>the portion of >>>+ * VA space reserved by the kernel. This node is initialized >>>together with the >>>+ * GPU VA manager instance and removed when the GPU VA manager is >>>destroyed. >>>+ * >>>+ * In a typical application drivers would embed struct >>>drm_gpuva_manager, >>>+ * struct drm_gpuva_region and struct drm_gpuva within their own driver >>>+ * specific structures, there won't be any memory allocations of >>>it's own nor >>>+ * memory allocations of &drm_gpuva or &drm_gpuva_region entries. >>>+ */ >>>+ >>>+/** >>>+ * DOC: Split and Merge >>>+ * >>>+ * The DRM GPU VA manager also provides an algorithm implementing >>>splitting and >>>+ * merging of existent GPU VA mappings with the ones that are >>>requested to be >>>+ * mapped or unmapped. This feature is required by the Vulkan API >>>to implement >>>+ * Vulkan 'Sparse Memory Bindings' - drivers UAPIs often refer to >>>this as >>>+ * VM BIND. >> >>Looks like the split and merge is only based on address continuity >>(virtual and >>physical). >>How about potential page table attributes specified during VM_BIND >>for a mapping >>such as readonly, caching attributes, atomicity etc? >> > >In this (first) patch version the GPU VA manager generates a list of >(sub-)operations (or split/merge steps) for each map / unmap request, >which I will replace with callbacks for each split/merge step in v2. > >For generating those steps the GPU VA manager indeed does take in >consideration address continuity (virtual and physical) only. > >However, driver specific attributes can be taken in consideration by >drivers for each of those split/merge steps. > >E.g. for a map request the GPU VA manager generates an arbitrary >amount of unmap ops, a maximum of two remap ops (for splitting) and >exactly one map operation. >Unmap operations carry their corresponding struct drm_gpuva being >embedded in it's driver specific structure. They're generated for all >mappings residing (and are enclosed by) the range of the new mapping >to map and for mappings that are direct and physically contiguous >neighbors and are in the same region. Unmap operations also carry a >flag to indicate drivers whether the mapping to unmap is physically >contiguous, such that drivers could only do a partial page table >update in case this is feasible . >Remap operations consist out of one unmap operation and at least one, >but a maximum of two map operations, such that drivers can apply the >original driver specific attributes of the mapping to the new (split >up) mapping(s). >For the single map operation we know the driver specific attributes as >well, since they're just the ones that were passed to the driver >through the original VM_BIND ioctl requesting the new mapping in the >first place. > >I hope I got your question right and the answer helps. > Yes, thanks, callbacks for considering driver specific attributes sounds good. Niranjana >- Danilo > >>Thanks, >>Niranjana >> >>>+ * >>>+ * Drivers can call drm_gpuva_sm_map_ops_create() to obtain a >>>list of map, unmap >>>+ * and remap operations for a given newly requested mapping. This list >>>+ * represents the set of operations to execute in order to >>>integrate the new >>>+ * mapping cleanly into the current state of the GPU VA space. >>>+ * >>>+ * Depending on how the new GPU VA mapping intersects with the >>>existent mappings >>>+ * of the GPU VA space the &drm_gpuva_ops contain an arbitrary >>>amount of unmap >>>+ * operations, a maximum of two remap operations and a single map >>>operation. >>>+ * The set of operations can also be empty if no operation is >>>required, e.g. if >>>+ * the requested mapping already exists in the exact same way. >>>+ * >>>+ * The single map operation, if existent, represents the original >>>map operation >>>+ * requested by the caller. Please note that this operation might >>>be altered >>>+ * comparing it with the original map operation, e.g. because it >>>was merged with >>>+ * an already existent mapping. Hence, drivers must execute this >>>map operation >>>+ * instead of the original one they passed to >>>drm_gpuva_sm_map_ops_create(). >>>+ * >>>+ * &drm_gpuva_op_unmap contains a 'keep' field, which indicates >>>whether the >>>+ * &drm_gpuva to unmap is physically contiguous with the original >>>mapping >>>+ * request. Optionally, if 'keep' is set, drivers may keep the >>>actual page table >>>+ * entries for this &drm_gpuva, adding the missing page table >>>entries only and >>>+ * update the &drm_gpuva_manager's view of things accordingly. >>>+ * >>>+ * Drivers may do the same optimization, namely delta page table >>>updates, also >>>+ * for remap operations. This is possible since >>>&drm_gpuva_op_remap consists of >>>+ * one unmap operation and one or two map operations, such that >>>drivers can >>>+ * derive the page table update delta accordingly. >>>+ * >>>+ * Note that there can't be more than two existent mappings to >>>split up, one at >>>+ * the beginning and one at the end of the new mapping, hence there is a >>>+ * maximum of two remap operations. >>>+ * >>>+ * Generally, the DRM GPU VA manager never merges mappings across the >>>+ * boundaries of &drm_gpuva_regions. This is the case since >>>merging between >>>+ * GPU VA regions would result into unmap and map operations to >>>be issued for >>>+ * both regions involved although the original mapping request >>>was referred to >>>+ * one specific GPU VA region only. Since the other GPU VA >>>region, the one not >>>+ * explicitly requested to be altered, might be in use by the >>>GPU, we are not >>>+ * allowed to issue any map/unmap operations for this region. >>>+ * >>>+ * Note that before calling drm_gpuva_sm_map_ops_create() again >>>with another >>>+ * mapping request it is necessary to update the >>>&drm_gpuva_manager's view of >>>+ * the GPU VA space. The previously obtained operations must be >>>either fully >>>+ * processed or completely abandoned. >>>+ * >>>+ * To update the &drm_gpuva_manager's view of the GPU VA space >>>+ * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >>>+ * drm_gpuva_destroy_unlocked() should be used. >>>+ * >>>+ * Analogue to drm_gpuva_sm_map_ops_create() >>>drm_gpuva_sm_unmap_ops_create() >>>+ * provides drivers a the list of operations to be executed in >>>order to unmap >>>+ * a range of GPU VA space. The logic behind this functions is >>>way simpler >>>+ * though: For all existent mappings enclosed by the given range unmap >>>+ * operations are created. For mappings which are only partically >>>located within >>>+ * the given range, remap operations are created such that those >>>mappings are >>>+ * split up and re-mapped partically. >>>+ * >>>+ * The following paragraph depicts the basic constellations of >>>existent GPU VA >>>+ * mappings, a newly requested mapping and the resulting mappings >>>as implemented >>>+ * by drm_gpuva_sm_map_ops_create() - it doesn't cover arbitrary >>>combinations >>>+ * of those constellations. >>>+ * >>>+ * :: >>>+ * >>>+ *   1) Existent mapping is kept. >>>+ *   ---------------------------- >>>+ * >>>+ *        0    a    1 >>>+ *   old: |-----------| (bo_offset=n) >>>+ * >>>+ *        0    a    1 >>>+ *   req: |-----------| (bo_offset=n) >>>+ * >>>+ *        0    a    1 >>>+ *   new: |-----------| (bo_offset=n) >>>+ * >>>+ * >>>+ *   2) Existent mapping is replaced. >>>+ *   -------------------------------- >>>+ * >>>+ *        0    a    1 >>>+ *   old: |-----------| (bo_offset=n) >>>+ * >>>+ *        0    a    1 >>>+ *   req: |-----------| (bo_offset=m) >>>+ * >>>+ *        0    a    1 >>>+ *   new: |-----------| (bo_offset=m) >>>+ * >>>+ * >>>+ *   3) Existent mapping is replaced. >>>+ *   -------------------------------- >>>+ * >>>+ *        0    a    1 >>>+ *   old: |-----------| (bo_offset=n) >>>+ * >>>+ *        0    b    1 >>>+ *   req: |-----------| (bo_offset=n) >>>+ * >>>+ *        0    b    1 >>>+ *   new: |-----------| (bo_offset=n) >>>+ * >>>+ * >>>+ *   4) Existent mapping is replaced. >>>+ *   -------------------------------- >>>+ * >>>+ *        0 a 1 >>>+ *   old: |-----|      (bo_offset=n) >>>+ * >>>+ *        0    a    2 >>>+ *   req: |-----------| (bo_offset=n) >>>+ * >>>+ *        0    a    2 >>>+ *   new: |-----------| (bo_offset=n) >>>+ * >>>+ *   Note: We expect to see the same result for a request with a >>>different bo >>>+ *         and/or bo_offset. >>>+ * >>>+ * >>>+ *   5) Existent mapping is split. >>>+ *   ----------------------------- >>>+ * >>>+ *        0    a    2 >>>+ *   old: |-----------| (bo_offset=n) >>>+ * >>>+ *        0 b 1 >>>+ *   req: |-----|      (bo_offset=n) >>>+ * >>>+ *        0 b 1 a' 2 >>>+ *   new: |-----|-----| (b.bo_offset=n, a.bo_offset=n+1) >>>+ * >>>+ *   Note: We expect to see the same result for a request with a >>>different bo >>>+ *         and/or non-contiguous bo_offset. >>>+ * >>>+ * >>>+ *   6) Existent mapping is kept. >>>+ *   ---------------------------- >>>+ * >>>+ *        0    a    2 >>>+ *   old: |-----------| (bo_offset=n) >>>+ * >>>+ *        0 a 1 >>>+ *   req: |-----|      (bo_offset=n) >>>+ * >>>+ *        0    a    2 >>>+ *   new: |-----------| (bo_offset=n) >>>+ * >>>+ * >>>+ *   7) Existent mapping is split. >>>+ *   ----------------------------- >>>+ * >>>+ *        0    a    2 >>>+ *   old: |-----------| (bo_offset=n) >>>+ * >>>+ *              1 b 2 >>>+ *   req:      |-----| (bo_offset=m) >>>+ * >>>+ *        0 a 1 b 2 >>>+ *   new: |-----|-----| (a.bo_offset=n,b.bo_offset=m) >>>+ * >>>+ * >>>+ *   8) Existent mapping is kept. >>>+ *   ---------------------------- >>>+ * >>>+ *         0    a    2 >>>+ *   old: |-----------| (bo_offset=n) >>>+ * >>>+ *              1 a 2 >>>+ *   req:      |-----| (bo_offset=n+1) >>>+ * >>>+ *        0    a    2 >>>+ *   new: |-----------| (bo_offset=n) >>>+ * >>>+ * >>>+ *   9) Existent mapping is split. >>>+ *   ----------------------------- >>>+ * >>>+ *        0    a    2 >>>+ *   old: |-----------|      (bo_offset=n) >>>+ * >>>+ *              1    b    3 >>>+ *   req:      |-----------| (bo_offset=m) >>>+ * >>>+ *        0 a 1    b    3 >>>+ *   new: |-----|-----------| (a.bo_offset=n,b.bo_offset=m) >>>+ * >>>+ * >>>+ *   10) Existent mapping is merged. >>>+ *   ------------------------------- >>>+ * >>>+ *        0    a    2 >>>+ *   old: |-----------|      (bo_offset=n) >>>+ * >>>+ *              1    a    3 >>>+ *   req:      |-----------| (bo_offset=n+1) >>>+ * >>>+ *        0       a       3 >>>+ *   new: |-----------------| (bo_offset=n) >>>+ * >>>+ * >>>+ *   11) Existent mapping is split. >>>+ *   ------------------------------ >>>+ * >>>+ *        0       a       3 >>>+ *   old: |-----------------| (bo_offset=n) >>>+ * >>>+ *              1 b 2 >>>+ *   req:      |-----|      (bo_offset=m) >>>+ * >>>+ *        0 a 1 b 2 a' 3 >>>+ *   new: |-----|-----|-----| >>>(a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2) >>>+ * >>>+ * >>>+ *   12) Existent mapping is kept. >>>+ *   ----------------------------- >>>+ * >>>+ *        0       a       3 >>>+ *   old: |-----------------| (bo_offset=n) >>>+ * >>>+ *              1 a 2 >>>+ *   req:      |-----|      (bo_offset=n+1) >>>+ * >>>+ *        0       a       3 >>>+ *   old: |-----------------| (bo_offset=n) >>>+ * >>>+ * >>>+ *   13) Existent mapping is replaced. >>>+ *   --------------------------------- >>>+ * >>>+ *              1 a 2 >>>+ *   old:      |-----| (bo_offset=n) >>>+ * >>>+ *        0    a    2 >>>+ *   req: |-----------| (bo_offset=n) >>>+ * >>>+ *        0    a    2 >>>+ *   new: |-----------| (bo_offset=n) >>>+ * >>>+ *   Note: We expect to see the same result for a request with a >>>different bo >>>+ *         and/or non-contiguous bo_offset. >>>+ * >>>+ * >>>+ *   14) Existent mapping is replaced. >>>+ *   --------------------------------- >>>+ * >>>+ *              1 a 2 >>>+ *   old:      |-----| (bo_offset=n) >>>+ * >>>+ *        0       a      3 >>>+ *   req: |----------------| (bo_offset=n) >>>+ * >>>+ *        0       a      3 >>>+ *   new: |----------------| (bo_offset=n) >>>+ * >>>+ *   Note: We expect to see the same result for a request with a >>>different bo >>>+ *         and/or non-contiguous bo_offset. >>>+ * >>>+ * >>>+ *   15) Existent mapping is split. >>>+ *   ------------------------------ >>>+ * >>>+ *              1    a    3 >>>+ *   old:      |-----------| (bo_offset=n) >>>+ * >>>+ *        0    b    2 >>>+ *   req: |-----------|      (bo_offset=m) >>>+ * >>>+ *        0    b    2 a' 3 >>>+ *   new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2) >>>+ * >>>+ * >>>+ *   16) Existent mappings are merged. >>>+ *   --------------------------------- >>>+ * >>>+ *        0    a    1 >>>+ *   old: |-----------|                       (bo_offset=n) >>>+ * >>>+ *                               2    a    3 >>>+ *   old':                      |-----------| (bo_offset=n+2) >>>+ * >>>+ *                   1    a    2 >>>+ *   req:           |-----------|            (bo_offset=n+1) >>>+ * >>>+ *                         a >>>+ *   new: |----------------------------------| (bo_offset=n) >>>+ */ >>>+ >>>+/** >>>+ * DOC: Locking >>>+ * >>>+ * Generally, the GPU VA manager does not take care of locking >>>itself, it is >>>+ * the drivers responsibility to take care about locking. Drivers >>>might want to >>>+ * protect the following operations: inserting, destroying and iterating >>>+ * &drm_gpuva and &drm_gpuva_region objects as well as generating >>>split and merge >>>+ * operations. >>>+ * >>>+ * The GPU VA manager does take care of the locking of the backing >>>+ * &drm_gem_object buffers GPU VA lists though, unless the >>>provided functions >>>+ * documentation claims otherwise. >>>+ */ >>>+ >>>+/** >>>+ * drm_gpuva_manager_init - initialize a &drm_gpuva_manager >>>+ * @mgr: pointer to the &drm_gpuva_manager to initialize >>>+ * @name: the name of the GPU VA space >>>+ * @start_offset: the start offset of the GPU VA space >>>+ * @range: the size of the GPU VA space >>>+ * @reserve_offset: the start of the kernel reserved GPU VA area >>>+ * @reserve_range: the size of the kernel reserved GPU VA area >>>+ * >>>+ * The &drm_gpuva_manager must be initialized with this function >>>before use. >>>+ * >>>+ * Note that @mgr must be cleared to 0 before calling this >>>function. The given >>>+ * &name is expected to be managed by the surrounding driver structures. >>>+ */ >>>+void >>>+drm_gpuva_manager_init(struct drm_gpuva_manager *mgr, >>>+              const char *name, >>>+              u64 start_offset, u64 range, >>>+              u64 reserve_offset, u64 reserve_range) >>>+{ >>>+   drm_mm_init(&mgr->va_mm, start_offset, range); >>>+   drm_mm_init(&mgr->region_mm, start_offset, range); >>>+ >>>+   mgr->mm_start = start_offset; >>>+   mgr->mm_range = range; >>>+ >>>+   mgr->name = name ? name : "unknown"; >>>+ >>>+   memset(&mgr->kernel_alloc_node, 0, sizeof(struct drm_mm_node)); >>>+   mgr->kernel_alloc_node.start = reserve_offset; >>>+   mgr->kernel_alloc_node.size = reserve_range; >>>+   drm_mm_reserve_node(&mgr->region_mm, &mgr->kernel_alloc_node); >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_manager_init); >>>+ >>>+/** >>>+ * drm_gpuva_manager_destroy - cleanup a &drm_gpuva_manager >>>+ * @mgr: pointer to the &drm_gpuva_manager to clean up >>>+ * >>>+ * Note that it is a bug to call this function on a manager that still >>>+ * holds GPU VA mappings. >>>+ */ >>>+void >>>+drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr) >>>+{ >>>+   mgr->name = NULL; >>>+   drm_mm_remove_node(&mgr->kernel_alloc_node); >>>+   drm_mm_takedown(&mgr->va_mm); >>>+   drm_mm_takedown(&mgr->region_mm); >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_manager_destroy); >>>+ >>>+static struct drm_gpuva_region * >>>+drm_gpuva_in_region(struct drm_gpuva_manager *mgr, u64 addr, u64 range) >>>+{ >>>+   struct drm_gpuva_region *reg; >>>+ >>>+   /* Find the VA region the requested range is strictly >>>enclosed by. */ >>>+   drm_gpuva_for_each_region_in_range(reg, mgr, addr, addr + range) { >>>+       if (reg->node.start <= addr && >>>+           reg->node.start + reg->node.size >= addr + range && >>>+           ®->node != &mgr->kernel_alloc_node) >>>+           return reg; >>>+   } >>>+ >>>+   return NULL; >>>+} >>>+ >>>+static bool >>>+drm_gpuva_in_any_region(struct drm_gpuva_manager *mgr, u64 addr, >>>u64 range) >>>+{ >>>+   return !!drm_gpuva_in_region(mgr, addr, range); >>>+} >>>+ >>>+/** >>>+ * drm_gpuva_insert - insert a &drm_gpuva >>>+ * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in >>>+ * @va: the &drm_gpuva to insert >>>+ * @addr: the start address of the GPU VA >>>+ * @range: the range of the GPU VA >>>+ * >>>+ * Insert a &drm_gpuva with a given address and range into a >>>+ * &drm_gpuva_manager. >>>+ * >>>+ * The function assumes the caller does not hold the &drm_gem_object's >>>+ * GPU VA list mutex. >>>+ * >>>+ * Returns: 0 on success, negative error code on failure. >>>+ */ >>>+int >>>+drm_gpuva_insert(struct drm_gpuva_manager *mgr, >>>+        struct drm_gpuva *va, >>>+        u64 addr, u64 range) >>>+{ >>>+   struct drm_gpuva_region *reg; >>>+   int ret; >>>+ >>>+   if (!va->gem.obj) >>>+       return -EINVAL; >>>+ >>>+   reg = drm_gpuva_in_region(mgr, addr, range); >>>+   if (!reg) >>>+       return -EINVAL; >>>+ >>>+   ret = drm_mm_insert_node_in_range(&mgr->va_mm, &va->node, >>>+                     range, 0, >>>+                     0, addr, >>>+                     addr + range, >>>+                     DRM_MM_INSERT_LOW|DRM_MM_INSERT_ONCE); >>>+   if (ret) >>>+       return ret; >>>+ >>>+   va->mgr = mgr; >>>+   va->region = reg; >>>+ >>>+   return 0; >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_insert); >>>+ >>>+/** >>>+ * drm_gpuva_link_locked - link a &drm_gpuva >>>+ * @va: the &drm_gpuva to link >>>+ * >>>+ * This adds the given &va to the GPU VA list of the >>>&drm_gem_object it is >>>+ * associated with. >>>+ * >>>+ * The function assumes the caller already holds the &drm_gem_object's >>>+ * GPU VA list mutex. >>>+ */ >>>+void >>>+drm_gpuva_link_locked(struct drm_gpuva *va) >>>+{ >>>+   lockdep_assert_held(&va->gem.obj->gpuva.mutex); >>>+   list_add_tail(&va->head, &va->gem.obj->gpuva.list); >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_link_locked); >>>+ >>>+/** >>>+ * drm_gpuva_link_unlocked - unlink a &drm_gpuva >>>+ * @va: the &drm_gpuva to unlink >>>+ * >>>+ * This adds the given &va to the GPU VA list of the >>>&drm_gem_object it is >>>+ * associated with. >>>+ * >>>+ * The function assumes the caller does not hold the &drm_gem_object's >>>+ * GPU VA list mutex. >>>+ */ >>>+void >>>+drm_gpuva_link_unlocked(struct drm_gpuva *va) >>>+{ >>>+   drm_gem_gpuva_lock(va->gem.obj); >>>+   drm_gpuva_link_locked(va); >>>+   drm_gem_gpuva_unlock(va->gem.obj); >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_link_unlocked); >>>+ >>>+/** >>>+ * drm_gpuva_unlink_locked - unlink a &drm_gpuva >>>+ * @va: the &drm_gpuva to unlink >>>+ * >>>+ * This removes the given &va from the GPU VA list of the >>>&drm_gem_object it is >>>+ * associated with. >>>+ * >>>+ * The function assumes the caller already holds the &drm_gem_object's >>>+ * GPU VA list mutex. >>>+ */ >>>+void >>>+drm_gpuva_unlink_locked(struct drm_gpuva *va) >>>+{ >>>+   lockdep_assert_held(&va->gem.obj->gpuva.mutex); >>>+   list_del_init(&va->head); >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_unlink_locked); >>>+ >>>+/** >>>+ * drm_gpuva_unlink_unlocked - unlink a &drm_gpuva >>>+ * @va: the &drm_gpuva to unlink >>>+ * >>>+ * This removes the given &va from the GPU VA list of the >>>&drm_gem_object it is >>>+ * associated with. >>>+ * >>>+ * The function assumes the caller does not hold the &drm_gem_object's >>>+ * GPU VA list mutex. >>>+ */ >>>+void >>>+drm_gpuva_unlink_unlocked(struct drm_gpuva *va) >>>+{ >>>+   drm_gem_gpuva_lock(va->gem.obj); >>>+   drm_gpuva_unlink_locked(va); >>>+   drm_gem_gpuva_unlock(va->gem.obj); >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_unlink_unlocked); >>>+ >>>+/** >>>+ * drm_gpuva_destroy_locked - destroy a &drm_gpuva >>>+ * @va: the &drm_gpuva to destroy >>>+ * >>>+ * This removes the given &va from GPU VA list of the >>>&drm_gem_object it is >>>+ * associated with and removes it from the underlaying range allocator. >>>+ * >>>+ * The function assumes the caller already holds the &drm_gem_object's >>>+ * GPU VA list mutex. >>>+ */ >>>+void >>>+drm_gpuva_destroy_locked(struct drm_gpuva *va) >>>+{ >>>+   lockdep_assert_held(&va->gem.obj->gpuva.mutex); >>>+ >>>+   list_del(&va->head); >>>+   drm_mm_remove_node(&va->node); >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_destroy_locked); >>>+ >>>+/** >>>+ * drm_gpuva_destroy_unlocked - destroy a &drm_gpuva >>>+ * @va: the &drm_gpuva to destroy >>>+ * >>>+ * This removes the given &va from GPU VA list of the >>>&drm_gem_object it is >>>+ * associated with and removes it from the underlaying range allocator. >>>+ * >>>+ * The function assumes the caller does not hold the &drm_gem_object's >>>+ * GPU VA list mutex. >>>+ */ >>>+void >>>+drm_gpuva_destroy_unlocked(struct drm_gpuva *va) >>>+{ >>>+   drm_gem_gpuva_lock(va->gem.obj); >>>+   list_del(&va->head); >>>+   drm_gem_gpuva_unlock(va->gem.obj); >>>+ >>>+   drm_mm_remove_node(&va->node); >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_destroy_unlocked); >>>+ >>>+/** >>>+ * drm_gpuva_find - find a &drm_gpuva >>>+ * @mgr: the &drm_gpuva_manager to search in >>>+ * @addr: the &drm_gpuvas address >>>+ * @range: the &drm_gpuvas range >>>+ * >>>+ * Returns: the &drm_gpuva at a given &addr and with a given &range >>>+ */ >>>+struct drm_gpuva * >>>+drm_gpuva_find(struct drm_gpuva_manager *mgr, >>>+          u64 addr, u64 range) >>>+{ >>>+   struct drm_gpuva *va; >>>+ >>>+   drm_gpuva_for_each_va_in_range(va, mgr, addr, range) { >>>+       if (va->node.start == addr && >>>+           va->node.size == range) >>>+           return va; >>>+   } >>>+ >>>+   return NULL; >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_find); >>>+ >>>+/** >>>+ * drm_gpuva_find_prev - find the &drm_gpuva before the given address >>>+ * @mgr: the &drm_gpuva_manager to search in >>>+ * @start: the given GPU VA's start address >>>+ * >>>+ * Find the adjacent &drm_gpuva before the GPU VA with given >>>&start address. >>>+ * >>>+ * Note that if there is any free space between the GPU VA >>>mappings no mapping >>>+ * is returned. >>>+ * >>>+ * Returns: a pointer to the found &drm_gpuva or NULL if none was found >>>+ */ >>>+struct drm_gpuva * >>>+drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start) >>>+{ >>>+   struct drm_mm_node *node; >>>+ >>>+   if (start <= mgr->mm_start || >>>+       start > (mgr->mm_start + mgr->mm_range)) >>>+       return NULL; >>>+ >>>+   node = __drm_mm_interval_first(&mgr->va_mm, start - 1, start); >>>+   if (node == &mgr->va_mm.head_node) >>>+       return NULL; >>>+ >>>+   return (struct drm_gpuva *)node; >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_find_prev); >>>+ >>>+/** >>>+ * drm_gpuva_find_next - find the &drm_gpuva after the given address >>>+ * @mgr: the &drm_gpuva_manager to search in >>>+ * @end: the given GPU VA's end address >>>+ * >>>+ * Find the adjacent &drm_gpuva after the GPU VA with given &end >>>address. >>>+ * >>>+ * Note that if there is any free space between the GPU VA >>>mappings no mapping >>>+ * is returned. >>>+ * >>>+ * Returns: a pointer to the found &drm_gpuva or NULL if none was found >>>+ */ >>>+struct drm_gpuva * >>>+drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end) >>>+{ >>>+   struct drm_mm_node *node; >>>+ >>>+   if (end < mgr->mm_start || >>>+       end >= (mgr->mm_start + mgr->mm_range)) >>>+       return NULL; >>>+ >>>+   node = __drm_mm_interval_first(&mgr->va_mm, end, end + 1); >>>+   if (node == &mgr->va_mm.head_node) >>>+       return NULL; >>>+ >>>+   return (struct drm_gpuva *)node; >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_find_next); >>>+ >>>+/** >>>+ * drm_gpuva_region_insert - insert a &drm_gpuva_region >>>+ * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in >>>+ * @reg: the &drm_gpuva_region to insert >>>+ * @addr: the start address of the GPU VA >>>+ * @range: the range of the GPU VA >>>+ * >>>+ * Insert a &drm_gpuva_region with a given address and range into a >>>+ * &drm_gpuva_manager. >>>+ * >>>+ * Returns: 0 on success, negative error code on failure. >>>+ */ >>>+int >>>+drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, >>>+           struct drm_gpuva_region *reg, >>>+           u64 addr, u64 range) >>>+{ >>>+   int ret; >>>+ >>>+   ret = drm_mm_insert_node_in_range(&mgr->region_mm, ®->node, >>>+                     range, 0, >>>+                     0, addr, >>>+                     addr + range, >>>+                     DRM_MM_INSERT_LOW| >>>+                     DRM_MM_INSERT_ONCE); >>>+   if (ret) >>>+       return ret; >>>+ >>>+   reg->mgr = mgr; >>>+ >>>+   return 0; >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_region_insert); >>>+ >>>+/** >>>+ * drm_gpuva_region_destroy - destroy a &drm_gpuva_region >>>+ * @mgr: the &drm_gpuva_manager holding the region >>>+ * @reg: the &drm_gpuva to destroy >>>+ * >>>+ * This removes the given ® from the underlaying range allocator. >>>+ */ >>>+void >>>+drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, >>>+            struct drm_gpuva_region *reg) >>>+{ >>>+   struct drm_gpuva *va; >>>+ >>>+   drm_gpuva_for_each_va_in_range(va, mgr, >>>+                      reg->node.start, >>>+                      reg->node.size) { >>>+       WARN(1, "GPU VA region must be empty on destroy.\n"); >>>+       return; >>>+   } >>>+ >>>+   if (®->node == &mgr->kernel_alloc_node) { >>>+       WARN(1, "Can't destroy kernel reserved region.\n"); >>>+       return; >>>+   } >>>+ >>>+   drm_mm_remove_node(®->node); >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_region_destroy); >>>+ >>>+/** >>>+ * drm_gpuva_region_find - find a &drm_gpuva_region >>>+ * @mgr: the &drm_gpuva_manager to search in >>>+ * @addr: the &drm_gpuva_regions address >>>+ * @range: the &drm_gpuva_regions range >>>+ * >>>+ * Returns: the &drm_gpuva_region at a given &addr and with a >>>given &range >>>+ */ >>>+struct drm_gpuva_region * >>>+drm_gpuva_region_find(struct drm_gpuva_manager *mgr, >>>+             u64 addr, u64 range) >>>+{ >>>+   struct drm_gpuva_region *reg; >>>+ >>>+   drm_gpuva_for_each_region_in_range(reg, mgr, addr, addr + range) >>>+       if (reg->node.start == addr && >>>+           reg->node.size == range) >>>+           return reg; >>>+ >>>+   return NULL; >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_region_find); >>>+ >>>+static int >>>+gpuva_op_map_new(struct drm_gpuva_op **pop, >>>+        u64 addr, u64 range, >>>+        struct drm_gem_object *obj, u64 offset) >>>+{ >>>+   struct drm_gpuva_op *op; >>>+ >>>+   op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >>>+   if (!op) >>>+       return -ENOMEM; >>>+ >>>+   op->op = DRM_GPUVA_OP_MAP; >>>+   op->map.va.addr = addr; >>>+   op->map.va.range = range; >>>+   op->map.gem.obj = obj; >>>+   op->map.gem.offset = offset; >>>+ >>>+   return 0; >>>+} >>>+ >>>+static int >>>+gpuva_op_remap_new(struct drm_gpuva_op **pop, >>>+          struct drm_gpuva_op_map *prev, >>>+          struct drm_gpuva_op_map *next, >>>+          struct drm_gpuva_op_unmap *unmap) >>>+{ >>>+   struct drm_gpuva_op *op; >>>+   struct drm_gpuva_op_remap *r; >>>+ >>>+   op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >>>+   if (!op) >>>+       return -ENOMEM; >>>+ >>>+   op->op = DRM_GPUVA_OP_REMAP; >>>+   r = &op->remap; >>>+ >>>+   if (prev) { >>>+       r->prev = kmemdup(prev, sizeof(*prev), GFP_KERNEL); >>>+       if (!r->prev) >>>+           goto err_free_op; >>>+   } >>>+ >>>+   if (next) { >>>+       r->next = kmemdup(next, sizeof(*next), GFP_KERNEL); >>>+       if (!r->next) >>>+           goto err_free_prev; >>>+   } >>>+ >>>+   r->unmap = kmemdup(unmap, sizeof(*unmap), GFP_KERNEL); >>>+   if (!r->unmap) >>>+       goto err_free_next; >>>+ >>>+   return 0; >>>+ >>>+err_free_next: >>>+   if (next) >>>+       kfree(r->next); >>>+err_free_prev: >>>+   if (prev) >>>+       kfree(r->prev); >>>+err_free_op: >>>+   kfree(op); >>>+   *pop = NULL; >>>+ >>>+   return -ENOMEM; >>>+} >>>+ >>>+static int >>>+gpuva_op_unmap_new(struct drm_gpuva_op **pop, >>>+          struct drm_gpuva *va, bool merge) >>>+{ >>>+   struct drm_gpuva_op *op; >>>+ >>>+   op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >>>+   if (!op) >>>+       return -ENOMEM; >>>+ >>>+   op->op = DRM_GPUVA_OP_UNMAP; >>>+   op->unmap.va = va; >>>+   op->unmap.keep = merge; >>>+ >>>+   return 0; >>>+} >>>+ >>>+#define op_map_new_to_list(_ops, _addr, _range,       \ >>>+              _obj, _offset)       \ >>>+do {                           \ >>>+   struct drm_gpuva_op *op;           \ >>>+                           \ >>>+   ret = gpuva_op_map_new(&op, _addr, _range,   \ >>>+                  _obj, _offset);       \ >>>+   if (ret)                   \ >>>+       goto err_free_ops;           \ >>>+                           \ >>>+   list_add_tail(&op->entry, _ops);       \ >>>+} while (0) >>>+ >>>+#define op_remap_new_to_list(_ops, _prev, _next,   \ >>>+                _unmap)           \ >>>+do {                           \ >>>+   struct drm_gpuva_op *op;           \ >>>+                           \ >>>+   ret = gpuva_op_remap_new(&op, _prev, _next,   \ >>>+                _unmap);       \ >>>+   if (ret)                   \ >>>+       goto err_free_ops;           \ >>>+                           \ >>>+   list_add_tail(&op->entry, _ops);       \ >>>+} while (0) >>>+ >>>+#define op_unmap_new_to_list(_ops, _gpuva, _merge)   \ >>>+do {                           \ >>>+   struct drm_gpuva_op *op;           \ >>>+                           \ >>>+   ret = gpuva_op_unmap_new(&op, _gpuva, _merge);   \ >>>+   if (ret)                   \ >>>+       goto err_free_ops;           \ >>>+                           \ >>>+   list_add_tail(&op->entry, _ops);       \ >>>+} while (0) >>>+ >>>+/** >>>+ * drm_gpuva_sm_map_ops_create - creates the &drm_gpuva_ops to >>>split and merge >>>+ * @mgr: the &drm_gpuva_manager representing the GPU VA space >>>+ * @req_addr: the start address of the new mapping >>>+ * @req_range: the range of the new mapping >>>+ * @req_obj: the &drm_gem_object to map >>>+ * @req_offset: the offset within the &drm_gem_object >>>+ * >>>+ * This function creates a list of operations to perform >>>splitting and merging >>>+ * of existent mapping(s) with the newly requested one. >>>+ * >>>+ * The list can be iterated with &drm_gpuva_for_each_op and must >>>be processed >>>+ * in the given order. It can contain map, unmap and remap >>>operations, but it >>>+ * also can be empty if no operation is required, e.g. if the >>>requested mapping >>>+ * already exists is the exact same way. >>>+ * >>>+ * There can be an arbitrary amount of unmap operations, a >>>maximum of two remap >>>+ * operations and a single map operation. The latter one, if existent, >>>+ * represents the original map operation requested by the caller. >>>Please note >>>+ * that the map operation might has been modified, e.g. if it was >>>+ * merged with an existent mapping. >>>+ * >>>+ * Note that before calling this function again with another >>>mapping request it >>>+ * is necessary to update the &drm_gpuva_manager's view of the >>>GPU VA space. >>>+ * The previously obtained operations must be either processed or >>>abandoned. >>>+ * To update the &drm_gpuva_manager's view of the GPU VA space >>>+ * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >>>+ * drm_gpuva_destroy_unlocked() should be used. >>>+ * >>>+ * After the caller finished processing the returned >>>&drm_gpuva_ops, they must >>>+ * be freed with &drm_gpuva_ops_free. >>>+ * >>>+ * Returns: a pointer to the &drm_gpuva_ops on success, an >>>ERR_PTR on failure >>>+ */ >>>+struct drm_gpuva_ops * >>>+drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr, >>>+               u64 req_addr, u64 req_range, >>>+               struct drm_gem_object *req_obj, u64 req_offset) >>>+{ >>>+   struct drm_gpuva_ops *ops; >>>+   struct drm_gpuva *va, *prev = NULL; >>>+   u64 req_end = req_addr + req_range; >>>+   bool skip_pmerge = false, skip_nmerge = false; >>>+   int ret; >>>+ >>>+   if (!drm_gpuva_in_any_region(mgr, req_addr, req_range)) >>>+       return ERR_PTR(-EINVAL); >>>+ >>>+   ops = kzalloc(sizeof(*ops), GFP_KERNEL); >>>+   if (!ops) >>>+       return ERR_PTR(-ENOMEM); >>>+ >>>+   INIT_LIST_HEAD(&ops->list); >>>+ >>>+   drm_gpuva_for_each_va_in_range(va, mgr, req_addr, req_end) { >>>+       struct drm_gem_object *obj = va->gem.obj; >>>+       u64 offset = va->gem.offset; >>>+       u64 addr = va->node.start; >>>+       u64 range = va->node.size; >>>+       u64 end = addr + range; >>>+ >>>+       /* Generally, we want to skip merging with potential mappings >>>+        * left and right of the requested one when we found a >>>+        * collision, since merging happens in this loop already. >>>+        * >>>+        * However, there is one exception when the requested mapping >>>+        * spans into a free VM area. If this is the case we might >>>+        * still hit the boundary of another mapping before and/or >>>+        * after the free VM area. >>>+        */ >>>+       skip_pmerge = true; >>>+       skip_nmerge = true; >>>+ >>>+       if (addr == req_addr) { >>>+           bool merge = obj == req_obj && >>>+                    offset == req_offset; >>>+           if (end == req_end) { >>>+               if (merge) >>>+                   goto done; >>>+ >>>+               op_unmap_new_to_list(&ops->list, va, false); >>>+               break; >>>+           } >>>+ >>>+           if (end < req_end) { >>>+               skip_nmerge = false; >>>+               op_unmap_new_to_list(&ops->list, va, merge); >>>+               goto next; >>>+           } >>>+ >>>+           if (end > req_end) { >>>+               struct drm_gpuva_op_map n = { >>>+                   .va.addr = req_end, >>>+                   .va.range = range - req_range, >>>+                   .gem.obj = obj, >>>+                   .gem.offset = offset + req_range, >>>+               }; >>>+               struct drm_gpuva_op_unmap u = { .va = va }; >>>+ >>>+               if (merge) >>>+                   goto done; >>>+ >>>+               op_remap_new_to_list(&ops->list, NULL, &n, &u); >>>+               break; >>>+           } >>>+       } else if (addr < req_addr) { >>>+           u64 ls_range = req_addr - addr; >>>+           struct drm_gpuva_op_map p = { >>>+               .va.addr = addr, >>>+               .va.range = ls_range, >>>+               .gem.obj = obj, >>>+               .gem.offset = offset, >>>+           }; >>>+           struct drm_gpuva_op_unmap u = { .va = va }; >>>+           bool merge = obj == req_obj && >>>+                    offset + ls_range == req_offset; >>>+ >>>+           if (end == req_end) { >>>+               if (merge) >>>+                   goto done; >>>+ >>>+               op_remap_new_to_list(&ops->list, &p, NULL, &u); >>>+               break; >>>+           } >>>+ >>>+           if (end < req_end) { >>>+               u64 new_addr = addr; >>>+               u64 new_range = req_range + ls_range; >>>+               u64 new_offset = offset; >>>+ >>>+               /* We validated that the requested mapping is >>>+                * within a single VA region already. >>>+                * Since it overlaps the current mapping (which >>>+                * can't cross a VA region boundary) we can be >>>+                * sure that we're still within the boundaries >>>+                * of the same VA region after merging. >>>+                */ >>>+               if (merge) { >>>+                   req_offset = new_offset; >>>+                   req_addr = new_addr; >>>+                   req_range = new_range; >>>+                   op_unmap_new_to_list(&ops->list, va, true); >>>+                   goto next; >>>+               } >>>+ >>>+               op_remap_new_to_list(&ops->list, &p, NULL, &u); >>>+               goto next; >>>+           } >>>+ >>>+           if (end > req_end) { >>>+               struct drm_gpuva_op_map n = { >>>+                   .va.addr = req_end, >>>+                   .va.range = end - req_end, >>>+                   .gem.obj = obj, >>>+                   .gem.offset = offset + ls_range + >>>+                             req_range, >>>+               }; >>>+ >>>+               if (merge) >>>+                   goto done; >>>+ >>>+               op_remap_new_to_list(&ops->list, &p, &n, &u); >>>+               break; >>>+           } >>>+       } else if (addr > req_addr) { >>>+           bool merge = obj == req_obj && >>>+                    offset == req_offset + >>>+                          (addr - req_addr); >>>+           if (!prev) >>>+               skip_pmerge = false; >>>+ >>>+           if (end == req_end) { >>>+               op_unmap_new_to_list(&ops->list, va, merge); >>>+               break; >>>+           } >>>+ >>>+           if (end < req_end) { >>>+               skip_nmerge = false; >>>+               op_unmap_new_to_list(&ops->list, va, merge); >>>+               goto next; >>>+           } >>>+ >>>+           if (end > req_end) { >>>+               struct drm_gpuva_op_map n = { >>>+                   .va.addr = req_end, >>>+                   .va.range = end - req_end, >>>+                   .gem.obj = obj, >>>+                   .gem.offset = offset + req_end - addr, >>>+               }; >>>+               struct drm_gpuva_op_unmap u = { .va = va }; >>>+               u64 new_end = end; >>>+               u64 new_range = new_end - req_addr; >>>+ >>>+               /* We validated that the requested mapping is >>>+                * within a single VA region already. >>>+                * Since it overlaps the current mapping (which >>>+                * can't cross a VA region boundary) we can be >>>+                * sure that we're still within the boundaries >>>+                * of the same VA region after merging. >>>+                */ >>>+               if (merge) { >>>+                   req_end = new_end; >>>+                   req_range = new_range; >>>+                   op_unmap_new_to_list(&ops->list, va, true); >>>+                   break; >>>+               } >>>+ >>>+               op_remap_new_to_list(&ops->list, NULL, &n, &u); >>>+               break; >>>+           } >>>+       } >>>+next: >>>+       prev = va; >>>+   } >>>+ >>>+   va = skip_pmerge ? NULL : drm_gpuva_find_prev(mgr, req_addr); >>>+   if (va) { >>>+       struct drm_gem_object *obj = va->gem.obj; >>>+       u64 offset = va->gem.offset; >>>+       u64 addr = va->node.start; >>>+       u64 range = va->node.size; >>>+       u64 new_offset = offset; >>>+       u64 new_addr = addr; >>>+       u64 new_range = req_range + range; >>>+       bool merge = obj == req_obj && >>>+                offset + range == req_offset; >>>+ >>>+       /* Don't merge over VA region boundaries. */ >>>+       merge &= drm_gpuva_in_any_region(mgr, new_addr, new_range); >>>+       if (merge) { >>>+           op_unmap_new_to_list(&ops->list, va, true); >>>+ >>>+           req_offset = new_offset; >>>+           req_addr = new_addr; >>>+           req_range = new_range; >>>+       } >>>+   } >>>+ >>>+   va = skip_nmerge ? NULL : drm_gpuva_find_next(mgr, req_end); >>>+   if (va) { >>>+       struct drm_gem_object *obj = va->gem.obj; >>>+       u64 offset = va->gem.offset; >>>+       u64 addr = va->node.start; >>>+       u64 range = va->node.size; >>>+       u64 end = addr + range; >>>+       u64 new_range = req_range + range; >>>+       u64 new_end = end; >>>+       bool merge = obj == req_obj && >>>+                offset == req_offset + req_range; >>>+ >>>+       /* Don't merge over VA region boundaries. */ >>>+       merge &= drm_gpuva_in_any_region(mgr, req_addr, new_range); >>>+       if (merge) { >>>+           op_unmap_new_to_list(&ops->list, va, true); >>>+ >>>+           req_range = new_range; >>>+           req_end = new_end; >>>+       } >>>+   } >>>+ >>>+   op_map_new_to_list(&ops->list, >>>+              req_addr, req_range, >>>+              req_obj, req_offset); >>>+ >>>+done: >>>+   return ops; >>>+ >>>+err_free_ops: >>>+   drm_gpuva_ops_free(ops); >>>+   return ERR_PTR(ret); >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_sm_map_ops_create); >>>+ >>>+#undef op_map_new_to_list >>>+#undef op_remap_new_to_list >>>+#undef op_unmap_new_to_list >>>+ >>>+/** >>>+ * drm_gpuva_sm_unmap_ops_create - creates the &drm_gpuva_ops to >>>split on unmap >>>+ * @mgr: the &drm_gpuva_manager representing the GPU VA space >>>+ * @req_addr: the start address of the range to unmap >>>+ * @req_range: the range of the mappings to unmap >>>+ * >>>+ * This function creates a list of operations to perform >>>unmapping and, if >>>+ * required, splitting of the mappings overlapping the unmap range. >>>+ * >>>+ * The list can be iterated with &drm_gpuva_for_each_op and must >>>be processed >>>+ * in the given order. It can contain unmap and remap operations, >>>depending on >>>+ * whether there are actual overlapping mappings to split. >>>+ * >>>+ * There can be an arbitrary amount of unmap operations and a >>>maximum of two >>>+ * remap operations. >>>+ * >>>+ * Note that before calling this function again with another >>>range to unmap it >>>+ * is necessary to update the &drm_gpuva_manager's view of the >>>GPU VA space. >>>+ * The previously obtained operations must be processed or abandoned. >>>+ * To update the &drm_gpuva_manager's view of the GPU VA space >>>+ * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >>>+ * drm_gpuva_destroy_unlocked() should be used. >>>+ * >>>+ * After the caller finished processing the returned >>>&drm_gpuva_ops, they must >>>+ * be freed with &drm_gpuva_ops_free. >>>+ * >>>+ * Returns: a pointer to the &drm_gpuva_ops on success, an >>>ERR_PTR on failure >>>+ */ >>>+struct drm_gpuva_ops * >>>+drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr, >>>+                 u64 req_addr, u64 req_range) >>>+{ >>>+   struct drm_gpuva_ops *ops; >>>+   struct drm_gpuva_op *op; >>>+   struct drm_gpuva_op_remap *r; >>>+   struct drm_gpuva *va; >>>+   u64 req_end = req_addr + req_range; >>>+   int ret; >>>+ >>>+   ops = kzalloc(sizeof(*ops), GFP_KERNEL); >>>+   if (!ops) >>>+       return ERR_PTR(-ENOMEM); >>>+ >>>+   INIT_LIST_HEAD(&ops->list); >>>+ >>>+   drm_gpuva_for_each_va_in_range(va, mgr, req_addr, req_end) { >>>+       struct drm_gem_object *obj = va->gem.obj; >>>+       u64 offset = va->gem.offset; >>>+       u64 addr = va->node.start; >>>+       u64 range = va->node.size; >>>+       u64 end = addr + range; >>>+ >>>+       op = kzalloc(sizeof(*op), GFP_KERNEL); >>>+       if (!op) { >>>+           ret = -ENOMEM; >>>+           goto err_free_ops; >>>+       } >>>+ >>>+       r = &op->remap; >>>+ >>>+       if (addr < req_addr) { >>>+           r->prev = kzalloc(sizeof(*r->prev), GFP_KERNEL); >>>+           if (!r->prev) { >>>+               ret = -ENOMEM; >>>+               goto err_free_op; >>>+           } >>>+ >>>+           r->prev->va.addr = addr; >>>+           r->prev->va.range = req_addr - addr; >>>+           r->prev->gem.obj = obj; >>>+           r->prev->gem.offset = offset; >>>+       } >>>+ >>>+       if (end > req_end) { >>>+           r->next = kzalloc(sizeof(*r->next), GFP_KERNEL); >>>+           if (!r->next) { >>>+               ret = -ENOMEM; >>>+               goto err_free_prev; >>>+           } >>>+ >>>+           r->next->va.addr = req_end; >>>+           r->next->va.range = end - req_end; >>>+           r->next->gem.obj = obj; >>>+           r->next->gem.offset = offset + (req_end - addr); >>>+       } >>>+ >>>+       if (op->remap.prev || op->remap.next) { >>>+           op->op = DRM_GPUVA_OP_REMAP; >>>+           r->unmap = kzalloc(sizeof(*r->unmap), GFP_KERNEL); >>>+           if (!r->unmap) { >>>+               ret = -ENOMEM; >>>+               goto err_free_next; >>>+           } >>>+ >>>+           r->unmap->va = va; >>>+       } else { >>>+           op->op = DRM_GPUVA_OP_UNMAP; >>>+           op->unmap.va = va; >>>+       } >>>+ >>>+       list_add_tail(&op->entry, &ops->list); >>>+   } >>>+ >>>+   return ops; >>>+ >>>+err_free_next: >>>+   if (r->next) >>>+       kfree(r->next); >>>+err_free_prev: >>>+   if (r->prev) >>>+       kfree(r->prev); >>>+err_free_op: >>>+   kfree(op); >>>+err_free_ops: >>>+   drm_gpuva_ops_free(ops); >>>+   return ERR_PTR(ret); >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_sm_unmap_ops_create); >>>+ >>>+/** >>>+ * drm_gpuva_ops_free - free the given &drm_gpuva_ops >>>+ * @ops: the &drm_gpuva_ops to free >>>+ * >>>+ * Frees the given &drm_gpuva_ops structure including all the ops >>>associated >>>+ * with it. >>>+ */ >>>+void >>>+drm_gpuva_ops_free(struct drm_gpuva_ops *ops) >>>+{ >>>+   struct drm_gpuva_op *op, *next; >>>+ >>>+   drm_gpuva_for_each_op_safe(op, next, ops) { >>>+       list_del(&op->entry); >>>+       if (op->op == DRM_GPUVA_OP_REMAP) { >>>+           if (op->remap.prev) >>>+               kfree(op->remap.prev); >>>+ >>>+           if (op->remap.next) >>>+               kfree(op->remap.next); >>>+ >>>+           kfree(op->remap.unmap); >>>+       } >>>+       kfree(op); >>>+   } >>>+ >>>+   kfree(ops); >>>+} >>>+EXPORT_SYMBOL(drm_gpuva_ops_free); >>>diff --git a/include/drm/drm_drv.h b/include/drm/drm_drv.h >>>index d7c521e8860f..6feacd93aca6 100644 >>>--- a/include/drm/drm_drv.h >>>+++ b/include/drm/drm_drv.h >>>@@ -104,6 +104,12 @@ enum drm_driver_feature { >>>    * acceleration should be handled by two drivers that are >>>connected using auxiliary bus. >>>    */ >>>    DRIVER_COMPUTE_ACCEL           = BIT(7), >>>+   /** >>>+    * @DRIVER_GEM_GPUVA: >>>+    * >>>+    * Driver supports user defined GPU VA bindings for GEM objects. >>>+    */ >>>+   DRIVER_GEM_GPUVA       = BIT(8), >>> >>>    /* IMPORTANT: Below are all the legacy flags, add new ones above. */ >>> >>>diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h >>>index 772a4adf5287..4a3679034966 100644 >>>--- a/include/drm/drm_gem.h >>>+++ b/include/drm/drm_gem.h >>>@@ -36,6 +36,8 @@ >>> >>>#include <linux/kref.h> >>>#include <linux/dma-resv.h> >>>+#include <linux/list.h> >>>+#include <linux/mutex.h> >>> >>>#include <drm/drm_vma_manager.h> >>> >>>@@ -337,6 +339,17 @@ struct drm_gem_object { >>>    */ >>>    struct dma_resv _resv; >>> >>>+   /** >>>+    * @gpuva: >>>+    * >>>+    * Provides the list and list mutex of GPU VAs attached to this >>>+    * GEM object. >>>+    */ >>>+   struct { >>>+       struct list_head list; >>>+       struct mutex mutex; >>>+   } gpuva; >>>+ >>>    /** >>>    * @funcs: >>>    * >>>@@ -479,4 +492,66 @@ void drm_gem_lru_move_tail(struct drm_gem_lru >>>*lru, struct drm_gem_object *obj); >>>unsigned long drm_gem_lru_scan(struct drm_gem_lru *lru, unsigned >>>nr_to_scan, >>>                  bool (*shrink)(struct drm_gem_object *obj)); >>> >>>+/** >>>+ * drm_gem_gpuva_init - initialize the gpuva list of a GEM object >>>+ * @obj: the &drm_gem_object >>>+ * >>>+ * This initializes the &drm_gem_object's &drm_gpuva list and the mutex >>>+ * protecting it. >>>+ * >>>+ * Calling this function is only necessary for drivers intending >>>to support the >>>+ * &drm_driver_feature DRIVER_GEM_GPUVA. >>>+ */ >>>+static inline void drm_gem_gpuva_init(struct drm_gem_object *obj) >>>+{ >>>+   INIT_LIST_HEAD(&obj->gpuva.list); >>>+   mutex_init(&obj->gpuva.mutex); >>>+} >>>+ >>>+/** >>>+ * drm_gem_gpuva_lock - lock the GEM's gpuva list mutex >>>+ * @obj: the &drm_gem_object >>>+ * >>>+ * This unlocks the mutex protecting the &drm_gem_object's >>>&drm_gpuva list. >>>+ */ >>>+static inline void drm_gem_gpuva_lock(struct drm_gem_object *obj) >>>+{ >>>+   mutex_lock(&obj->gpuva.mutex); >>>+} >>>+ >>>+/** >>>+ * drm_gem_gpuva_unlock - unlock the GEM's gpuva list mutex >>>+ * @obj: the &drm_gem_object >>>+ * >>>+ * This unlocks the mutex protecting the &drm_gem_object's >>>&drm_gpuva list. >>>+ */ >>>+static inline void drm_gem_gpuva_unlock(struct drm_gem_object *obj) >>>+{ >>>+   mutex_unlock(&obj->gpuva.mutex); >>>+} >>>+ >>>+/** >>>+ * drm_gem_for_each_gpuva - iternator to walk over a list of gpuvas >>>+ * @entry: &drm_gpuva structure to assign to in each iteration step >>>+ * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with >>>+ * >>>+ * This iterator walks over all &drm_gpuva structures associated >>>with the >>>+ * &drm_gpuva_manager. >>>+ */ >>>+#define drm_gem_for_each_gpuva(entry, obj) \ >>>+   list_for_each_entry(entry, &obj->gpuva.list, head) >>>+ >>>+/** >>>+ * drm_gem_for_each_gpuva_safe - iternator to safely walk over a >>>list of gpuvas >>>+ * @entry: &drm_gpuva structure to assign to in each iteration step >>>+ * @next: &next &drm_gpuva to store the next step >>>+ * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with >>>+ * >>>+ * This iterator walks over all &drm_gpuva structures associated >>>with the >>>+ * &drm_gem_object. It is implemented with >>>list_for_each_entry_safe(), hence >>>+ * it is save against removal of elements. >>>+ */ >>>+#define drm_gem_for_each_gpuva_safe(entry, next, obj) \ >>>+   list_for_each_entry_safe(entry, next, &obj->gpuva.list, head) >>>+ >>>#endif /* __DRM_GEM_H__ */ >>>diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h >>>new file mode 100644 >>>index 000000000000..adeb0c916e91 >>>--- /dev/null >>>+++ b/include/drm/drm_gpuva_mgr.h >>>@@ -0,0 +1,527 @@ >>>+// SPDX-License-Identifier: GPL-2.0 >>>+ >>>+#ifndef __DRM_GPUVA_MGR_H__ >>>+#define __DRM_GPUVA_MGR_H__ >>>+ >>>+/* >>>+ * Copyright (c) 2022 Red Hat. >>>+ * >>>+ * Permission is hereby granted, free of charge, to any person >>>obtaining a >>>+ * copy of this software and associated documentation files (the >>>"Software"), >>>+ * to deal in the Software without restriction, including without >>>limitation >>>+ * the rights to use, copy, modify, merge, publish, distribute, >>>sublicense, >>>+ * and/or sell copies of the Software, and to permit persons to whom the >>>+ * Software is furnished to do so, subject to the following conditions: >>>+ * >>>+ * The above copyright notice and this permission notice shall be >>>included in >>>+ * all copies or substantial portions of the Software. >>>+ * >>>+ * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY >>>KIND, EXPRESS OR >>>+ * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF >>>MERCHANTABILITY, >>>+ * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO >>>EVENT SHALL >>>+ * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, >>>DAMAGES OR >>>+ * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, >>>+ * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR >>>+ * OTHER DEALINGS IN THE SOFTWARE. >>>+ */ >>>+ >>>+#include <drm/drm_mm.h> >>>+#include <linux/mm.h> >>>+#include <linux/rbtree.h> >>>+#include <linux/spinlock.h> >>>+#include <linux/types.h> >>>+ >>>+struct drm_gpuva_region; >>>+struct drm_gpuva; >>>+struct drm_gpuva_ops; >>>+ >>>+/** >>>+ * struct drm_gpuva_manager - DRM GPU VA Manager >>>+ * >>>+ * The DRM GPU VA Manager keeps track of a GPU's virtual address >>>space by using >>>+ * the &drm_mm range allocator. Typically, this structure is >>>embedded in bigger >>>+ * driver structures. >>>+ * >>>+ * Drivers can pass addresses and ranges in an arbitrary unit, >>>e.g. bytes or >>>+ * pages. >>>+ * >>>+ * There should be one manager instance per GPU virtual address space. >>>+ */ >>>+struct drm_gpuva_manager { >>>+   /** >>>+    * @name: the name of the DRM GPU VA space >>>+    */ >>>+   const char *name; >>>+ >>>+   /** >>>+    * @mm_start: start of the VA space >>>+    */ >>>+   u64 mm_start; >>>+ >>>+   /** >>>+    * @mm_range: length of the VA space >>>+    */ >>>+   u64 mm_range; >>>+ >>>+   /** >>>+    * @region_mm: the &drm_mm range allocator to track GPU VA regions >>>+    */ >>>+   struct drm_mm region_mm; >>>+ >>>+   /** >>>+    * @va_mm: the &drm_mm range allocator to track GPU VA mappings >>>+    */ >>>+   struct drm_mm va_mm; >>>+ >>>+   /** >>>+    * @kernel_alloc_node: >>>+    * >>>+    * &drm_mm_node representing the address space cutout reserved for >>>+    * the kernel >>>+    */ >>>+   struct drm_mm_node kernel_alloc_node; >>>+}; >>>+ >>>+void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr, >>>+               const char *name, >>>+               u64 start_offset, u64 range, >>>+               u64 reserve_offset, u64 reserve_range); >>>+void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr); >>>+ >>>+/** >>>+ * struct drm_gpuva_region - structure to track a portion of GPU >>>VA space >>>+ * >>>+ * This structure represents a portion of a GPUs VA space and is >>>associated >>>+ * with a &drm_gpuva_manager. Internally it is based on a &drm_mm_node. >>>+ * >>>+ * GPU VA mappings, represented by &drm_gpuva objects, are >>>restricted to be >>>+ * placed within a &drm_gpuva_region. >>>+ */ >>>+struct drm_gpuva_region { >>>+   /** >>>+    * @node: the &drm_mm_node to track the GPU VA region >>>+    */ >>>+   struct drm_mm_node node; >>>+ >>>+   /** >>>+    * @mgr: the &drm_gpuva_manager this object is associated with >>>+    */ >>>+   struct drm_gpuva_manager *mgr; >>>+ >>>+   /** >>>+    * @sparse: indicates whether this region is sparse >>>+    */ >>>+   bool sparse; >>>+}; >>>+ >>>+struct drm_gpuva_region * >>>+drm_gpuva_region_find(struct drm_gpuva_manager *mgr, >>>+             u64 addr, u64 range); >>>+int drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, >>>+               struct drm_gpuva_region *reg, >>>+               u64 addr, u64 range); >>>+void drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, >>>+                 struct drm_gpuva_region *reg); >>>+ >>>+int drm_gpuva_insert(struct drm_gpuva_manager *mgr, >>>+            struct drm_gpuva *va, >>>+            u64 addr, u64 range); >>>+/** >>>+ * drm_gpuva_for_each_region_in_range - iternator to walk over a >>>range of nodes >>>+ * @node__: &drm_gpuva_region structure to assign to in each >>>iteration step >>>+ * @gpuva__: &drm_gpuva_manager structure to walk >>>+ * @start__: starting offset, the first node will overlap this >>>+ * @end__: ending offset, the last node will start before this >>>(but may overlap) >>>+ * >>>+ * This iterator walks over all nodes in the range allocator that lie >>>+ * between @start and @end. It is implemented similarly to >>>list_for_each(), >>>+ * but is using &drm_mm's internal interval tree to accelerate >>>the search for >>>+ * the starting node, and hence isn't safe against removal of >>>elements. It >>>+ * assumes that @end is within (or is the upper limit of) the >>>&drm_gpuva_manager. >>>+ * If [@start, @end] are beyond the range of the &drm_gpuva_manager, the >>>+ * iterator may walk over the special _unallocated_ >>>&drm_mm.head_node of the >>>+ * backing &drm_mm, and may even continue indefinitely. >>>+ */ >>>+#define drm_gpuva_for_each_region_in_range(node__, gpuva__, >>>start__, end__) \ >>>+   for (node__ = (struct drm_gpuva_region >>>*)__drm_mm_interval_first(&(gpuva__)->region_mm, \ >>>+                                    (start__), (end__)-1); \ >>>+        node__->node.start < (end__); \ >>>+        node__ = (struct drm_gpuva_region >>>*)list_next_entry(&node__->node, node_list)) >>>+ >>>+/** >>>+ * drm_gpuva_for_each_region - iternator to walk over a range of nodes >>>+ * @entry: &drm_gpuva_region structure to assign to in each >>>iteration step >>>+ * @gpuva: &drm_gpuva_manager structure to walk >>>+ * >>>+ * This iterator walks over all &drm_gpuva_region structures >>>associated with the >>>+ * &drm_gpuva_manager. >>>+ */ >>>+#define drm_gpuva_for_each_region(entry, gpuva) \ >>>+   list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->region_mm), >>>node.node_list) >>>+ >>>+/** >>>+ * drm_gpuva_for_each_region_safe - iternator to safely walk over >>>a range of >>>+ * nodes >>>+ * @entry: &drm_gpuva_region structure to assign to in each >>>iteration step >>>+ * @next: &next &drm_gpuva_region to store the next step >>>+ * @gpuva: &drm_gpuva_manager structure to walk >>>+ * >>>+ * This iterator walks over all &drm_gpuva_region structures >>>associated with the >>>+ * &drm_gpuva_manager. It is implemented with >>>list_for_each_safe(), so save >>>+ * against removal of elements. >>>+ */ >>>+#define drm_gpuva_for_each_region_safe(entry, next, gpuva) \ >>>+   list_for_each_entry_safe(entry, next, >>>drm_mm_nodes(&(gpuva)->region_mm), node.node_list) >>>+ >>>+ >>>+/** >>>+ * enum drm_gpuva_flags - flags for struct drm_gpuva >>>+ */ >>>+enum drm_gpuva_flags { >>>+   /** >>>+    * @DRM_GPUVA_SWAPPED: flag indicating that the &drm_gpuva is >>>swapped >>>+    */ >>>+   DRM_GPUVA_SWAPPED = (1 << 0), >>>+}; >>>+ >>>+/** >>>+ * struct drm_gpuva - structure to track a GPU VA mapping >>>+ * >>>+ * This structure represents a GPU VA mapping and is associated with a >>>+ * &drm_gpuva_manager. Internally it is based on a &drm_mm_node. >>>+ * >>>+ * Typically, this structure is embedded in bigger driver structures. >>>+ */ >>>+struct drm_gpuva { >>>+   /** >>>+    * @node: the &drm_mm_node to track the GPU VA mapping >>>+    */ >>>+   struct drm_mm_node node; >>>+ >>>+   /** >>>+    * @mgr: the &drm_gpuva_manager this object is associated with >>>+    */ >>>+   struct drm_gpuva_manager *mgr; >>>+ >>>+   /** >>>+    * @region: the &drm_gpuva_region the &drm_gpuva is mapped in >>>+    */ >>>+   struct drm_gpuva_region *region; >>>+ >>>+   /** >>>+    * @head: the &list_head to attach this object to a &drm_gem_object >>>+    */ >>>+   struct list_head head; >>>+ >>>+   /** >>>+    * @flags: the &drm_gpuva_flags for this mapping >>>+    */ >>>+   enum drm_gpuva_flags flags; >>>+ >>>+   /** >>>+    * @gem: structure containing the &drm_gem_object and it's offset >>>+    */ >>>+   struct { >>>+       /** >>>+        * @offset: the offset within the &drm_gem_object >>>+        */ >>>+       u64 offset; >>>+ >>>+       /** >>>+        * @obj: the mapped &drm_gem_object >>>+        */ >>>+       struct drm_gem_object *obj; >>>+   } gem; >>>+}; >>>+ >>>+void drm_gpuva_link_locked(struct drm_gpuva *va); >>>+void drm_gpuva_link_unlocked(struct drm_gpuva *va); >>>+void drm_gpuva_unlink_locked(struct drm_gpuva *va); >>>+void drm_gpuva_unlink_unlocked(struct drm_gpuva *va); >>>+ >>>+void drm_gpuva_destroy_locked(struct drm_gpuva *va); >>>+void drm_gpuva_destroy_unlocked(struct drm_gpuva *va); >>>+ >>>+struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr, >>>+                u64 addr, u64 range); >>>+struct drm_gpuva *drm_gpuva_find_prev(struct drm_gpuva_manager >>>*mgr, u64 start); >>>+struct drm_gpuva *drm_gpuva_find_next(struct drm_gpuva_manager >>>*mgr, u64 end); >>>+ >>>+/** >>>+ * drm_gpuva_swap - sets whether the backing BO of this >>>&drm_gpuva is swapped >>>+ * @va: the &drm_gpuva to set the swap flag of >>>+ * @swap: indicates whether the &drm_gpuva is swapped >>>+ */ >>>+static inline void drm_gpuva_swap(struct drm_gpuva *va, bool swap) >>>+{ >>>+   if (swap) >>>+       va->flags |= DRM_GPUVA_SWAPPED; >>>+   else >>>+       va->flags &= ~DRM_GPUVA_SWAPPED; >>>+} >>>+ >>>+/** >>>+ * drm_gpuva_swapped - indicates whether the backing BO of this >>>&drm_gpuva >>>+ * is swapped >>>+ * @va: the &drm_gpuva to check >>>+ */ >>>+static inline bool drm_gpuva_swapped(struct drm_gpuva *va) >>>+{ >>>+   return va->flags & DRM_GPUVA_SWAPPED; >>>+} >>>+ >>>+/** >>>+ * drm_gpuva_for_each_va_in_range - iternator to walk over a >>>range of nodes >>>+ * @node__: &drm_gpuva structure to assign to in each iteration step >>>+ * @gpuva__: &drm_gpuva_manager structure to walk >>>+ * @start__: starting offset, the first node will overlap this >>>+ * @end__: ending offset, the last node will start before this >>>(but may overlap) >>>+ * >>>+ * This iterator walks over all nodes in the range allocator that lie >>>+ * between @start and @end. It is implemented similarly to >>>list_for_each(), >>>+ * but is using &drm_mm's internal interval tree to accelerate >>>the search for >>>+ * the starting node, and hence isn't safe against removal of >>>elements. It >>>+ * assumes that @end is within (or is the upper limit of) the >>>&drm_gpuva_manager. >>>+ * If [@start, @end] are beyond the range of the &drm_gpuva_manager, the >>>+ * iterator may walk over the special _unallocated_ >>>&drm_mm.head_node of the >>>+ * backing &drm_mm, and may even continue indefinitely. >>>+ */ >>>+#define drm_gpuva_for_each_va_in_range(node__, gpuva__, start__, >>>end__) \ >>>+   for (node__ = (struct drm_gpuva >>>*)__drm_mm_interval_first(&(gpuva__)->va_mm, \ >>>+                                 (start__), (end__)-1); \ >>>+        node__->node.start < (end__); \ >>>+        node__ = (struct drm_gpuva >>>*)list_next_entry(&node__->node, node_list)) >>>+ >>>+/** >>>+ * drm_gpuva_for_each_va - iternator to walk over a range of nodes >>>+ * @entry: &drm_gpuva structure to assign to in each iteration step >>>+ * @gpuva: &drm_gpuva_manager structure to walk >>>+ * >>>+ * This iterator walks over all &drm_gpuva structures associated >>>with the >>>+ * &drm_gpuva_manager. >>>+ */ >>>+#define drm_gpuva_for_each_va(entry, gpuva) \ >>>+   list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->va_mm), >>>node.node_list) >>>+ >>>+/** >>>+ * drm_gpuva_for_each_va_safe - iternator to safely walk over a range of >>>+ * nodes >>>+ * @entry: &drm_gpuva structure to assign to in each iteration step >>>+ * @next: &next &drm_gpuva to store the next step >>>+ * @gpuva: &drm_gpuva_manager structure to walk >>>+ * >>>+ * This iterator walks over all &drm_gpuva structures associated >>>with the >>>+ * &drm_gpuva_manager. It is implemented with >>>list_for_each_safe(), so save >>>+ * against removal of elements. >>>+ */ >>>+#define drm_gpuva_for_each_va_safe(entry, next, gpuva) \ >>>+   list_for_each_entry_safe(entry, next, >>>drm_mm_nodes(&(gpuva)->va_mm), node.node_list) >>>+ >>>+/** >>>+ * enum drm_gpuva_op_type - GPU VA operation type >>>+ * >>>+ * Operations to alter the GPU VA mappings tracked by the >>>&drm_gpuva_manager >>>+ * can be map, remap or unmap operations. >>>+ */ >>>+enum drm_gpuva_op_type { >>>+   /** >>>+    * @DRM_GPUVA_OP_MAP: the map op type >>>+    */ >>>+   DRM_GPUVA_OP_MAP, >>>+ >>>+   /** >>>+    * @DRM_GPUVA_OP_REMAP: the remap op type >>>+    */ >>>+   DRM_GPUVA_OP_REMAP, >>>+ >>>+   /** >>>+    * @DRM_GPUVA_OP_UNMAP: the unmap op type >>>+    */ >>>+   DRM_GPUVA_OP_UNMAP, >>>+}; >>>+ >>>+/** >>>+ * struct drm_gpuva_op_map - GPU VA map operation >>>+ * >>>+ * This structure represents a single map operation generated by the >>>+ * DRM GPU VA manager. >>>+ */ >>>+struct drm_gpuva_op_map { >>>+   /** >>>+    * @va: structure containing address and range of a map >>>+    * operation >>>+    */ >>>+   struct { >>>+       /** >>>+        * @addr: the base address of the new mapping >>>+        */ >>>+       u64 addr; >>>+ >>>+       /** >>>+        * @range: the range of the new mapping >>>+        */ >>>+       u64 range; >>>+   } va; >>>+ >>>+   /** >>>+    * @gem: structure containing the &drm_gem_object and it's offset >>>+    */ >>>+   struct { >>>+       /** >>>+        * @offset: the offset within the &drm_gem_object >>>+        */ >>>+       u64 offset; >>>+ >>>+       /** >>>+        * @obj: the &drm_gem_object to map >>>+        */ >>>+       struct drm_gem_object *obj; >>>+   } gem; >>>+}; >>>+ >>>+/** >>>+ * struct drm_gpuva_op_unmap - GPU VA unmap operation >>>+ * >>>+ * This structure represents a single unmap operation generated by the >>>+ * DRM GPU VA manager. >>>+ */ >>>+struct drm_gpuva_op_unmap { >>>+   /** >>>+    * @va: the &drm_gpuva to unmap >>>+    */ >>>+   struct drm_gpuva *va; >>>+ >>>+   /** >>>+    * @keep: >>>+    * >>>+    * Indicates whether this &drm_gpuva is physically contiguous >>>with the >>>+    * original mapping request. >>>+    * >>>+    * Optionally, if &keep is set, drivers may keep the actual >>>page table >>>+    * mappings for this &drm_gpuva, adding the missing page >>>table entries >>>+    * only and update the &drm_gpuva_manager accordingly. >>>+    */ >>>+   bool keep; >>>+}; >>>+ >>>+/** >>>+ * struct drm_gpuva_op_remap - GPU VA remap operation >>>+ * >>>+ * This represents a single remap operation generated by the DRM >>>GPU VA manager. >>>+ * >>>+ * A remap operation is generated when an existing GPU VA >>>mmapping is split up >>>+ * by inserting a new GPU VA mapping or by partially unmapping existent >>>+ * mapping(s), hence it consists of a maximum of two map and one unmap >>>+ * operation. >>>+ * >>>+ * The @unmap operation takes care of removing the original >>>existing mapping. >>>+ * @prev is used to remap the preceding part, @next the subsequent part. >>>+ * >>>+ * If either a new mapping's start address is aligned with the >>>start address >>>+ * of the old mapping or the new mapping's end address is aligned >>>with the >>>+ * end address of the old mapping, either @prev or @next is NULL. >>>+ * >>>+ * Note, the reason for a dedicated remap operation, rather than >>>arbitrary >>>+ * unmap and map operations, is to give drivers the chance of >>>extracting driver >>>+ * specific data for creating the new mappings from the unmap >>>operations's >>>+ * &drm_gpuva structure which typically is embedded in larger >>>driver specific >>>+ * structures. >>>+ */ >>>+struct drm_gpuva_op_remap { >>>+   /** >>>+    * @prev: the preceding part of a split mapping >>>+    */ >>>+   struct drm_gpuva_op_map *prev; >>>+ >>>+   /** >>>+    * @next: the subsequent part of a split mapping >>>+    */ >>>+   struct drm_gpuva_op_map *next; >>>+ >>>+   /** >>>+    * @unmap: the unmap operation for the original existing mapping >>>+    */ >>>+   struct drm_gpuva_op_unmap *unmap; >>>+}; >>>+ >>>+/** >>>+ * struct drm_gpuva_op - GPU VA operation >>>+ * >>>+ * This structure represents a single generic operation, which >>>can be either >>>+ * map, unmap or remap. >>>+ * >>>+ * The particular type of the operation is defined by @op. >>>+ */ >>>+struct drm_gpuva_op { >>>+   /** >>>+    * @entry: >>>+    * >>>+    * The &list_head used to distribute instances of this struct within >>>+    * &drm_gpuva_ops. >>>+    */ >>>+   struct list_head entry; >>>+ >>>+   /** >>>+    * @op: the type of the operation >>>+    */ >>>+   enum drm_gpuva_op_type op; >>>+ >>>+   union { >>>+       /** >>>+        * @map: the map operation >>>+        */ >>>+       struct drm_gpuva_op_map map; >>>+ >>>+       /** >>>+        * @unmap: the unmap operation >>>+        */ >>>+       struct drm_gpuva_op_unmap unmap; >>>+ >>>+       /** >>>+        * @remap: the remap operation >>>+        */ >>>+       struct drm_gpuva_op_remap remap; >>>+   }; >>>+}; >>>+ >>>+/** >>>+ * struct drm_gpuva_ops - wraps a list of &drm_gpuva_op >>>+ */ >>>+struct drm_gpuva_ops { >>>+   /** >>>+    * @list: the &list_head >>>+    */ >>>+   struct list_head list; >>>+}; >>>+ >>>+/** >>>+ * drm_gpuva_for_each_op - iterator to walk over all ops >>>+ * @op: &drm_gpuva_op to assign in each iteration step >>>+ * @ops: &drm_gpuva_ops to walk >>>+ * >>>+ * This iterator walks over all ops within a given list of operations. >>>+ */ >>>+#define drm_gpuva_for_each_op(op, ops) list_for_each_entry(op, >>>&(ops)->list, entry) >>>+ >>>+/** >>>+ * drm_gpuva_for_each_op_safe - iterator to safely walk over all ops >>>+ * @op: &drm_gpuva_op to assign in each iteration step >>>+ * @next: &next &drm_gpuva_op to store the next step >>>+ * @ops: &drm_gpuva_ops to walk >>>+ * >>>+ * This iterator walks over all ops within a given list of >>>operations. It is >>>+ * implemented with list_for_each_safe(), so save against removal >>>of elements. >>>+ */ >>>+#define drm_gpuva_for_each_op_safe(op, next, ops) \ >>>+   list_for_each_entry_safe(op, next, &(ops)->list, entry) >>>+ >>>+struct drm_gpuva_ops * >>>+drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr, >>>+               u64 addr, u64 range, >>>+               struct drm_gem_object *obj, u64 offset); >>>+struct drm_gpuva_ops * >>>+drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr, >>>+                 u64 addr, u64 range); >>>+void drm_gpuva_ops_free(struct drm_gpuva_ops *ops); >>>+ >>>+#endif /* __DRM_GPUVA_MGR_H__ */ >>>-- >>>2.39.0 >>> >> >
On Wed, Jan 18, 2023 at 07:12:45AM +0100, Danilo Krummrich wrote: > This adds the infrastructure for a manager implementation to keep track > of GPU virtual address (VA) mappings. > > New UAPIs, motivated by Vulkan sparse memory bindings graphics drivers > start implementing, allow userspace applications to request multiple and > arbitrary GPU VA mappings of buffer objects. The DRM GPU VA manager is > intended to serve the following purposes in this context. > > 1) Provide a dedicated range allocator to track GPU VA allocations and > mappings, making use of the drm_mm range allocator. > > 2) Generically connect GPU VA mappings to their backing buffers, in > particular DRM GEM objects. > > 3) Provide a common implementation to perform more complex mapping > operations on the GPU VA space. In particular splitting and merging > of GPU VA mappings, e.g. for intersecting mapping requests or partial > unmap requests. > > Idea-suggested-by: Dave Airlie <airlied@redhat.com> > Signed-off-by: Danilo Krummrich <dakr@redhat.com> <snip> > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h > new file mode 100644 > index 000000000000..adeb0c916e91 > --- /dev/null > +++ b/include/drm/drm_gpuva_mgr.h > @@ -0,0 +1,527 @@ > +// SPDX-License-Identifier: GPL-2.0 > + > +#ifndef __DRM_GPUVA_MGR_H__ > +#define __DRM_GPUVA_MGR_H__ > + > +/* > + * Copyright (c) 2022 Red Hat. > + * > + * Permission is hereby granted, free of charge, to any person obtaining a > + * copy of this software and associated documentation files (the "Software"), > + * to deal in the Software without restriction, including without limitation > + * the rights to use, copy, modify, merge, publish, distribute, sublicense, > + * and/or sell copies of the Software, and to permit persons to whom the > + * Software is furnished to do so, subject to the following conditions: > + * > + * The above copyright notice and this permission notice shall be included in > + * all copies or substantial portions of the Software. > + * > + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR > + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, > + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL > + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, DAMAGES OR > + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, > + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR > + * OTHER DEALINGS IN THE SOFTWARE. > + */ > + > +#include <drm/drm_mm.h> > +#include <linux/mm.h> > +#include <linux/rbtree.h> > +#include <linux/spinlock.h> > +#include <linux/types.h> > + > +struct drm_gpuva_region; > +struct drm_gpuva; > +struct drm_gpuva_ops; > + > +/** > + * struct drm_gpuva_manager - DRM GPU VA Manager > + * > + * The DRM GPU VA Manager keeps track of a GPU's virtual address space by using > + * the &drm_mm range allocator. Typically, this structure is embedded in bigger > + * driver structures. > + * > + * Drivers can pass addresses and ranges in an arbitrary unit, e.g. bytes or > + * pages. > + * > + * There should be one manager instance per GPU virtual address space. > + */ > +struct drm_gpuva_manager { > + /** > + * @name: the name of the DRM GPU VA space > + */ > + const char *name; > + > + /** > + * @mm_start: start of the VA space > + */ > + u64 mm_start; > + > + /** > + * @mm_range: length of the VA space > + */ > + u64 mm_range; > + > + /** > + * @region_mm: the &drm_mm range allocator to track GPU VA regions > + */ > + struct drm_mm region_mm; > + I'd suggest using a rb_tree rather than drm_mm, it should be quite a bit more light weight - that is what we currently use in Xe for VM / VMA management. See lines 994-1056 in the following file: https://cgit.freedesktop.org/drm/drm-xe/tree/drivers/gpu/drm/xe/xe_vm.c?h=drm-xe-next I'm pretty sure all of your magic marcos (drm_gpuva_for_each*) should be easily implemented using a rb_tree too. Matt > + /** > + * @va_mm: the &drm_mm range allocator to track GPU VA mappings > + */ > + struct drm_mm va_mm; > + > + /** > + * @kernel_alloc_node: > + * > + * &drm_mm_node representing the address space cutout reserved for > + * the kernel > + */ > + struct drm_mm_node kernel_alloc_node; > +}; > + > +void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr, > + const char *name, > + u64 start_offset, u64 range, > + u64 reserve_offset, u64 reserve_range); > +void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr); > + > +/** > + * struct drm_gpuva_region - structure to track a portion of GPU VA space > + * > + * This structure represents a portion of a GPUs VA space and is associated > + * with a &drm_gpuva_manager. Internally it is based on a &drm_mm_node. > + * > + * GPU VA mappings, represented by &drm_gpuva objects, are restricted to be > + * placed within a &drm_gpuva_region. > + */ > +struct drm_gpuva_region { > + /** > + * @node: the &drm_mm_node to track the GPU VA region > + */ > + struct drm_mm_node node; > + > + /** > + * @mgr: the &drm_gpuva_manager this object is associated with > + */ > + struct drm_gpuva_manager *mgr; > + > + /** > + * @sparse: indicates whether this region is sparse > + */ > + bool sparse; > +}; > + > +struct drm_gpuva_region * > +drm_gpuva_region_find(struct drm_gpuva_manager *mgr, > + u64 addr, u64 range); > +int drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, > + struct drm_gpuva_region *reg, > + u64 addr, u64 range); > +void drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, > + struct drm_gpuva_region *reg); > + > +int drm_gpuva_insert(struct drm_gpuva_manager *mgr, > + struct drm_gpuva *va, > + u64 addr, u64 range); > +/** > + * drm_gpuva_for_each_region_in_range - iternator to walk over a range of nodes > + * @node__: &drm_gpuva_region structure to assign to in each iteration step > + * @gpuva__: &drm_gpuva_manager structure to walk > + * @start__: starting offset, the first node will overlap this > + * @end__: ending offset, the last node will start before this (but may overlap) > + * > + * This iterator walks over all nodes in the range allocator that lie > + * between @start and @end. It is implemented similarly to list_for_each(), > + * but is using &drm_mm's internal interval tree to accelerate the search for > + * the starting node, and hence isn't safe against removal of elements. It > + * assumes that @end is within (or is the upper limit of) the &drm_gpuva_manager. > + * If [@start, @end] are beyond the range of the &drm_gpuva_manager, the > + * iterator may walk over the special _unallocated_ &drm_mm.head_node of the > + * backing &drm_mm, and may even continue indefinitely. > + */ > +#define drm_gpuva_for_each_region_in_range(node__, gpuva__, start__, end__) \ > + for (node__ = (struct drm_gpuva_region *)__drm_mm_interval_first(&(gpuva__)->region_mm, \ > + (start__), (end__)-1); \ > + node__->node.start < (end__); \ > + node__ = (struct drm_gpuva_region *)list_next_entry(&node__->node, node_list)) > + > +/** > + * drm_gpuva_for_each_region - iternator to walk over a range of nodes > + * @entry: &drm_gpuva_region structure to assign to in each iteration step > + * @gpuva: &drm_gpuva_manager structure to walk > + * > + * This iterator walks over all &drm_gpuva_region structures associated with the > + * &drm_gpuva_manager. > + */ > +#define drm_gpuva_for_each_region(entry, gpuva) \ > + list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->region_mm), node.node_list) > + > +/** > + * drm_gpuva_for_each_region_safe - iternator to safely walk over a range of > + * nodes > + * @entry: &drm_gpuva_region structure to assign to in each iteration step > + * @next: &next &drm_gpuva_region to store the next step > + * @gpuva: &drm_gpuva_manager structure to walk > + * > + * This iterator walks over all &drm_gpuva_region structures associated with the > + * &drm_gpuva_manager. It is implemented with list_for_each_safe(), so save > + * against removal of elements. > + */ > +#define drm_gpuva_for_each_region_safe(entry, next, gpuva) \ > + list_for_each_entry_safe(entry, next, drm_mm_nodes(&(gpuva)->region_mm), node.node_list) > + > + > +/** > + * enum drm_gpuva_flags - flags for struct drm_gpuva > + */ > +enum drm_gpuva_flags { > + /** > + * @DRM_GPUVA_SWAPPED: flag indicating that the &drm_gpuva is swapped > + */ > + DRM_GPUVA_SWAPPED = (1 << 0), > +}; > + > +/** > + * struct drm_gpuva - structure to track a GPU VA mapping > + * > + * This structure represents a GPU VA mapping and is associated with a > + * &drm_gpuva_manager. Internally it is based on a &drm_mm_node. > + * > + * Typically, this structure is embedded in bigger driver structures. > + */ > +struct drm_gpuva { > + /** > + * @node: the &drm_mm_node to track the GPU VA mapping > + */ > + struct drm_mm_node node; > + > + /** > + * @mgr: the &drm_gpuva_manager this object is associated with > + */ > + struct drm_gpuva_manager *mgr; > + > + /** > + * @region: the &drm_gpuva_region the &drm_gpuva is mapped in > + */ > + struct drm_gpuva_region *region; > + > + /** > + * @head: the &list_head to attach this object to a &drm_gem_object > + */ > + struct list_head head; > + > + /** > + * @flags: the &drm_gpuva_flags for this mapping > + */ > + enum drm_gpuva_flags flags; > + > + /** > + * @gem: structure containing the &drm_gem_object and it's offset > + */ > + struct { > + /** > + * @offset: the offset within the &drm_gem_object > + */ > + u64 offset; > + > + /** > + * @obj: the mapped &drm_gem_object > + */ > + struct drm_gem_object *obj; > + } gem; > +}; > + > +void drm_gpuva_link_locked(struct drm_gpuva *va); > +void drm_gpuva_link_unlocked(struct drm_gpuva *va); > +void drm_gpuva_unlink_locked(struct drm_gpuva *va); > +void drm_gpuva_unlink_unlocked(struct drm_gpuva *va); > + > +void drm_gpuva_destroy_locked(struct drm_gpuva *va); > +void drm_gpuva_destroy_unlocked(struct drm_gpuva *va); > + > +struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr, > + u64 addr, u64 range); > +struct drm_gpuva *drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start); > +struct drm_gpuva *drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end); > + > +/** > + * drm_gpuva_swap - sets whether the backing BO of this &drm_gpuva is swapped > + * @va: the &drm_gpuva to set the swap flag of > + * @swap: indicates whether the &drm_gpuva is swapped > + */ > +static inline void drm_gpuva_swap(struct drm_gpuva *va, bool swap) > +{ > + if (swap) > + va->flags |= DRM_GPUVA_SWAPPED; > + else > + va->flags &= ~DRM_GPUVA_SWAPPED; > +} > + > +/** > + * drm_gpuva_swapped - indicates whether the backing BO of this &drm_gpuva > + * is swapped > + * @va: the &drm_gpuva to check > + */ > +static inline bool drm_gpuva_swapped(struct drm_gpuva *va) > +{ > + return va->flags & DRM_GPUVA_SWAPPED; > +} > + > +/** > + * drm_gpuva_for_each_va_in_range - iternator to walk over a range of nodes > + * @node__: &drm_gpuva structure to assign to in each iteration step > + * @gpuva__: &drm_gpuva_manager structure to walk > + * @start__: starting offset, the first node will overlap this > + * @end__: ending offset, the last node will start before this (but may overlap) > + * > + * This iterator walks over all nodes in the range allocator that lie > + * between @start and @end. It is implemented similarly to list_for_each(), > + * but is using &drm_mm's internal interval tree to accelerate the search for > + * the starting node, and hence isn't safe against removal of elements. It > + * assumes that @end is within (or is the upper limit of) the &drm_gpuva_manager. > + * If [@start, @end] are beyond the range of the &drm_gpuva_manager, the > + * iterator may walk over the special _unallocated_ &drm_mm.head_node of the > + * backing &drm_mm, and may even continue indefinitely. > + */ > +#define drm_gpuva_for_each_va_in_range(node__, gpuva__, start__, end__) \ > + for (node__ = (struct drm_gpuva *)__drm_mm_interval_first(&(gpuva__)->va_mm, \ > + (start__), (end__)-1); \ > + node__->node.start < (end__); \ > + node__ = (struct drm_gpuva *)list_next_entry(&node__->node, node_list)) > + > +/** > + * drm_gpuva_for_each_va - iternator to walk over a range of nodes > + * @entry: &drm_gpuva structure to assign to in each iteration step > + * @gpuva: &drm_gpuva_manager structure to walk > + * > + * This iterator walks over all &drm_gpuva structures associated with the > + * &drm_gpuva_manager. > + */ > +#define drm_gpuva_for_each_va(entry, gpuva) \ > + list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->va_mm), node.node_list) > + > +/** > + * drm_gpuva_for_each_va_safe - iternator to safely walk over a range of > + * nodes > + * @entry: &drm_gpuva structure to assign to in each iteration step > + * @next: &next &drm_gpuva to store the next step > + * @gpuva: &drm_gpuva_manager structure to walk > + * > + * This iterator walks over all &drm_gpuva structures associated with the > + * &drm_gpuva_manager. It is implemented with list_for_each_safe(), so save > + * against removal of elements. > + */ > +#define drm_gpuva_for_each_va_safe(entry, next, gpuva) \ > + list_for_each_entry_safe(entry, next, drm_mm_nodes(&(gpuva)->va_mm), node.node_list) > + > +/** > + * enum drm_gpuva_op_type - GPU VA operation type > + * > + * Operations to alter the GPU VA mappings tracked by the &drm_gpuva_manager > + * can be map, remap or unmap operations. > + */ > +enum drm_gpuva_op_type { > + /** > + * @DRM_GPUVA_OP_MAP: the map op type > + */ > + DRM_GPUVA_OP_MAP, > + > + /** > + * @DRM_GPUVA_OP_REMAP: the remap op type > + */ > + DRM_GPUVA_OP_REMAP, > + > + /** > + * @DRM_GPUVA_OP_UNMAP: the unmap op type > + */ > + DRM_GPUVA_OP_UNMAP, > +}; > + > +/** > + * struct drm_gpuva_op_map - GPU VA map operation > + * > + * This structure represents a single map operation generated by the > + * DRM GPU VA manager. > + */ > +struct drm_gpuva_op_map { > + /** > + * @va: structure containing address and range of a map > + * operation > + */ > + struct { > + /** > + * @addr: the base address of the new mapping > + */ > + u64 addr; > + > + /** > + * @range: the range of the new mapping > + */ > + u64 range; > + } va; > + > + /** > + * @gem: structure containing the &drm_gem_object and it's offset > + */ > + struct { > + /** > + * @offset: the offset within the &drm_gem_object > + */ > + u64 offset; > + > + /** > + * @obj: the &drm_gem_object to map > + */ > + struct drm_gem_object *obj; > + } gem; > +}; > + > +/** > + * struct drm_gpuva_op_unmap - GPU VA unmap operation > + * > + * This structure represents a single unmap operation generated by the > + * DRM GPU VA manager. > + */ > +struct drm_gpuva_op_unmap { > + /** > + * @va: the &drm_gpuva to unmap > + */ > + struct drm_gpuva *va; > + > + /** > + * @keep: > + * > + * Indicates whether this &drm_gpuva is physically contiguous with the > + * original mapping request. > + * > + * Optionally, if &keep is set, drivers may keep the actual page table > + * mappings for this &drm_gpuva, adding the missing page table entries > + * only and update the &drm_gpuva_manager accordingly. > + */ > + bool keep; > +}; > + > +/** > + * struct drm_gpuva_op_remap - GPU VA remap operation > + * > + * This represents a single remap operation generated by the DRM GPU VA manager. > + * > + * A remap operation is generated when an existing GPU VA mmapping is split up > + * by inserting a new GPU VA mapping or by partially unmapping existent > + * mapping(s), hence it consists of a maximum of two map and one unmap > + * operation. > + * > + * The @unmap operation takes care of removing the original existing mapping. > + * @prev is used to remap the preceding part, @next the subsequent part. > + * > + * If either a new mapping's start address is aligned with the start address > + * of the old mapping or the new mapping's end address is aligned with the > + * end address of the old mapping, either @prev or @next is NULL. > + * > + * Note, the reason for a dedicated remap operation, rather than arbitrary > + * unmap and map operations, is to give drivers the chance of extracting driver > + * specific data for creating the new mappings from the unmap operations's > + * &drm_gpuva structure which typically is embedded in larger driver specific > + * structures. > + */ > +struct drm_gpuva_op_remap { > + /** > + * @prev: the preceding part of a split mapping > + */ > + struct drm_gpuva_op_map *prev; > + > + /** > + * @next: the subsequent part of a split mapping > + */ > + struct drm_gpuva_op_map *next; > + > + /** > + * @unmap: the unmap operation for the original existing mapping > + */ > + struct drm_gpuva_op_unmap *unmap; > +}; > + > +/** > + * struct drm_gpuva_op - GPU VA operation > + * > + * This structure represents a single generic operation, which can be either > + * map, unmap or remap. > + * > + * The particular type of the operation is defined by @op. > + */ > +struct drm_gpuva_op { > + /** > + * @entry: > + * > + * The &list_head used to distribute instances of this struct within > + * &drm_gpuva_ops. > + */ > + struct list_head entry; > + > + /** > + * @op: the type of the operation > + */ > + enum drm_gpuva_op_type op; > + > + union { > + /** > + * @map: the map operation > + */ > + struct drm_gpuva_op_map map; > + > + /** > + * @unmap: the unmap operation > + */ > + struct drm_gpuva_op_unmap unmap; > + > + /** > + * @remap: the remap operation > + */ > + struct drm_gpuva_op_remap remap; > + }; > +}; > + > +/** > + * struct drm_gpuva_ops - wraps a list of &drm_gpuva_op > + */ > +struct drm_gpuva_ops { > + /** > + * @list: the &list_head > + */ > + struct list_head list; > +}; > + > +/** > + * drm_gpuva_for_each_op - iterator to walk over all ops > + * @op: &drm_gpuva_op to assign in each iteration step > + * @ops: &drm_gpuva_ops to walk > + * > + * This iterator walks over all ops within a given list of operations. > + */ > +#define drm_gpuva_for_each_op(op, ops) list_for_each_entry(op, &(ops)->list, entry) > + > +/** > + * drm_gpuva_for_each_op_safe - iterator to safely walk over all ops > + * @op: &drm_gpuva_op to assign in each iteration step > + * @next: &next &drm_gpuva_op to store the next step > + * @ops: &drm_gpuva_ops to walk > + * > + * This iterator walks over all ops within a given list of operations. It is > + * implemented with list_for_each_safe(), so save against removal of elements. > + */ > +#define drm_gpuva_for_each_op_safe(op, next, ops) \ > + list_for_each_entry_safe(op, next, &(ops)->list, entry) > + > +struct drm_gpuva_ops * > +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr, > + u64 addr, u64 range, > + struct drm_gem_object *obj, u64 offset); > +struct drm_gpuva_ops * > +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr, > + u64 addr, u64 range); > +void drm_gpuva_ops_free(struct drm_gpuva_ops *ops); > + > +#endif /* __DRM_GPUVA_MGR_H__ */ > -- > 2.39.0 >
On Wed, Jan 18, 2023 at 07:12:45AM +0100, Danilo Krummrich wrote: > This adds the infrastructure for a manager implementation to keep track > of GPU virtual address (VA) mappings. > > New UAPIs, motivated by Vulkan sparse memory bindings graphics drivers > start implementing, allow userspace applications to request multiple and > arbitrary GPU VA mappings of buffer objects. The DRM GPU VA manager is > intended to serve the following purposes in this context. > > 1) Provide a dedicated range allocator to track GPU VA allocations and > mappings, making use of the drm_mm range allocator. > > 2) Generically connect GPU VA mappings to their backing buffers, in > particular DRM GEM objects. > > 3) Provide a common implementation to perform more complex mapping > operations on the GPU VA space. In particular splitting and merging > of GPU VA mappings, e.g. for intersecting mapping requests or partial > unmap requests. > > Idea-suggested-by: Dave Airlie <airlied@redhat.com> > Signed-off-by: Danilo Krummrich <dakr@redhat.com> <snip> > +++ b/drivers/gpu/drm/drm_gpuva_mgr.c <snip> > +struct drm_gpuva * > +drm_gpuva_find(struct drm_gpuva_manager *mgr, > + u64 addr, u64 range) > +{ > + struct drm_gpuva *va; > + > + drm_gpuva_for_each_va_in_range(va, mgr, addr, range) { Last argument should be: range + addr, right? > + if (va->node.start == addr && > + va->node.size == range) > + return va; > + } > + > + return NULL; > +} > +EXPORT_SYMBOL(drm_gpuva_find); > + > +/** > + * drm_gpuva_find_prev - find the &drm_gpuva before the given address > + * @mgr: the &drm_gpuva_manager to search in > + * @start: the given GPU VA's start address > + * > + * Find the adjacent &drm_gpuva before the GPU VA with given &start address. > + * > + * Note that if there is any free space between the GPU VA mappings no mapping > + * is returned. > + * > + * Returns: a pointer to the found &drm_gpuva or NULL if none was found > + */ > +struct drm_gpuva * > +drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start) > +{ > + struct drm_mm_node *node; > + > + if (start <= mgr->mm_start || > + start > (mgr->mm_start + mgr->mm_range)) > + return NULL; > + > + node = __drm_mm_interval_first(&mgr->va_mm, start - 1, start); > + if (node == &mgr->va_mm.head_node) > + return NULL; > + > + return (struct drm_gpuva *)node; > +} > +EXPORT_SYMBOL(drm_gpuva_find_prev); > + > +/** > + * drm_gpuva_find_next - find the &drm_gpuva after the given address > + * @mgr: the &drm_gpuva_manager to search in > + * @end: the given GPU VA's end address > + * > + * Find the adjacent &drm_gpuva after the GPU VA with given &end address. > + * > + * Note that if there is any free space between the GPU VA mappings no mapping > + * is returned. > + * > + * Returns: a pointer to the found &drm_gpuva or NULL if none was found > + */ > +struct drm_gpuva * > +drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end) > +{ > + struct drm_mm_node *node; > + > + if (end < mgr->mm_start || > + end >= (mgr->mm_start + mgr->mm_range)) > + return NULL; > + > + node = __drm_mm_interval_first(&mgr->va_mm, end, end + 1); > + if (node == &mgr->va_mm.head_node) > + return NULL; > + > + return (struct drm_gpuva *)node; > +} > +EXPORT_SYMBOL(drm_gpuva_find_next); > + > +/** > + * drm_gpuva_region_insert - insert a &drm_gpuva_region > + * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in > + * @reg: the &drm_gpuva_region to insert > + * @addr: the start address of the GPU VA > + * @range: the range of the GPU VA > + * > + * Insert a &drm_gpuva_region with a given address and range into a > + * &drm_gpuva_manager. > + * > + * Returns: 0 on success, negative error code on failure. > + */ > +int > +drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, > + struct drm_gpuva_region *reg, > + u64 addr, u64 range) > +{ > + int ret; > + > + ret = drm_mm_insert_node_in_range(&mgr->region_mm, ®->node, > + range, 0, > + 0, addr, > + addr + range, > + DRM_MM_INSERT_LOW| > + DRM_MM_INSERT_ONCE); > + if (ret) > + return ret; > + > + reg->mgr = mgr; > + > + return 0; > +} > +EXPORT_SYMBOL(drm_gpuva_region_insert); > + > +/** > + * drm_gpuva_region_destroy - destroy a &drm_gpuva_region > + * @mgr: the &drm_gpuva_manager holding the region > + * @reg: the &drm_gpuva to destroy > + * > + * This removes the given ® from the underlaying range allocator. > + */ > +void > +drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, > + struct drm_gpuva_region *reg) > +{ > + struct drm_gpuva *va; > + > + drm_gpuva_for_each_va_in_range(va, mgr, > + reg->node.start, > + reg->node.size) { Last argument should be: reg->node.start + reg->node.size, right? Matt > + WARN(1, "GPU VA region must be empty on destroy.\n"); > + return; > + } > + > + if (®->node == &mgr->kernel_alloc_node) { > + WARN(1, "Can't destroy kernel reserved region.\n"); > + return; > + } > + > + drm_mm_remove_node(®->node); > +} > +EXPORT_SYMBOL(drm_gpuva_region_destroy);
On Fri, Jan 27, 2023 at 12:24:07AM +0000, Matthew Brost wrote: > On Wed, Jan 18, 2023 at 07:12:45AM +0100, Danilo Krummrich wrote: > > This adds the infrastructure for a manager implementation to keep track > > of GPU virtual address (VA) mappings. > > > > New UAPIs, motivated by Vulkan sparse memory bindings graphics drivers > > start implementing, allow userspace applications to request multiple and > > arbitrary GPU VA mappings of buffer objects. The DRM GPU VA manager is > > intended to serve the following purposes in this context. > > > > 1) Provide a dedicated range allocator to track GPU VA allocations and > > mappings, making use of the drm_mm range allocator. > > > > 2) Generically connect GPU VA mappings to their backing buffers, in > > particular DRM GEM objects. > > > > 3) Provide a common implementation to perform more complex mapping > > operations on the GPU VA space. In particular splitting and merging > > of GPU VA mappings, e.g. for intersecting mapping requests or partial > > unmap requests. > > > > Idea-suggested-by: Dave Airlie <airlied@redhat.com> > > Signed-off-by: Danilo Krummrich <dakr@redhat.com> > > <snip> > > > +++ b/drivers/gpu/drm/drm_gpuva_mgr.c > > <snip> > > > +struct drm_gpuva * > > +drm_gpuva_find(struct drm_gpuva_manager *mgr, > > + u64 addr, u64 range) > > +{ > > + struct drm_gpuva *va; > > + > > + drm_gpuva_for_each_va_in_range(va, mgr, addr, range) { > > Last argument should be: range + addr, right? > Thanks, good catch. > > + if (va->node.start == addr && > > + va->node.size == range) > > + return va; > > + } > > + > > + return NULL; > > +} > > +EXPORT_SYMBOL(drm_gpuva_find); > > + > > +/** > > + * drm_gpuva_find_prev - find the &drm_gpuva before the given address > > + * @mgr: the &drm_gpuva_manager to search in > > + * @start: the given GPU VA's start address > > + * > > + * Find the adjacent &drm_gpuva before the GPU VA with given &start address. > > + * > > + * Note that if there is any free space between the GPU VA mappings no mapping > > + * is returned. > > + * > > + * Returns: a pointer to the found &drm_gpuva or NULL if none was found > > + */ > > +struct drm_gpuva * > > +drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start) > > +{ > > + struct drm_mm_node *node; > > + > > + if (start <= mgr->mm_start || > > + start > (mgr->mm_start + mgr->mm_range)) > > + return NULL; > > + > > + node = __drm_mm_interval_first(&mgr->va_mm, start - 1, start); > > + if (node == &mgr->va_mm.head_node) > > + return NULL; > > + > > + return (struct drm_gpuva *)node; > > +} > > +EXPORT_SYMBOL(drm_gpuva_find_prev); > > + > > +/** > > + * drm_gpuva_find_next - find the &drm_gpuva after the given address > > + * @mgr: the &drm_gpuva_manager to search in > > + * @end: the given GPU VA's end address > > + * > > + * Find the adjacent &drm_gpuva after the GPU VA with given &end address. > > + * > > + * Note that if there is any free space between the GPU VA mappings no mapping > > + * is returned. > > + * > > + * Returns: a pointer to the found &drm_gpuva or NULL if none was found > > + */ > > +struct drm_gpuva * > > +drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end) > > +{ > > + struct drm_mm_node *node; > > + > > + if (end < mgr->mm_start || > > + end >= (mgr->mm_start + mgr->mm_range)) > > + return NULL; > > + > > + node = __drm_mm_interval_first(&mgr->va_mm, end, end + 1); > > + if (node == &mgr->va_mm.head_node) > > + return NULL; > > + > > + return (struct drm_gpuva *)node; > > +} > > +EXPORT_SYMBOL(drm_gpuva_find_next); > > + > > +/** > > + * drm_gpuva_region_insert - insert a &drm_gpuva_region > > + * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in > > + * @reg: the &drm_gpuva_region to insert > > + * @addr: the start address of the GPU VA > > + * @range: the range of the GPU VA > > + * > > + * Insert a &drm_gpuva_region with a given address and range into a > > + * &drm_gpuva_manager. > > + * > > + * Returns: 0 on success, negative error code on failure. > > + */ > > +int > > +drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, > > + struct drm_gpuva_region *reg, > > + u64 addr, u64 range) > > +{ > > + int ret; > > + > > + ret = drm_mm_insert_node_in_range(&mgr->region_mm, ®->node, > > + range, 0, > > + 0, addr, > > + addr + range, > > + DRM_MM_INSERT_LOW| > > + DRM_MM_INSERT_ONCE); > > + if (ret) > > + return ret; > > + > > + reg->mgr = mgr; > > + > > + return 0; > > +} > > +EXPORT_SYMBOL(drm_gpuva_region_insert); > > + > > +/** > > + * drm_gpuva_region_destroy - destroy a &drm_gpuva_region > > + * @mgr: the &drm_gpuva_manager holding the region > > + * @reg: the &drm_gpuva to destroy > > + * > > + * This removes the given ® from the underlaying range allocator. > > + */ > > +void > > +drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, > > + struct drm_gpuva_region *reg) > > +{ > > + struct drm_gpuva *va; > > + > > + drm_gpuva_for_each_va_in_range(va, mgr, > > + reg->node.start, > > + reg->node.size) { > > Last argument should be: reg->node.start + reg->node.size, right? Thanks, pushed the fix to new-uapi-drm-next-fixes. > > Matt > > > + WARN(1, "GPU VA region must be empty on destroy.\n"); > > + return; > > + } > > + > > + if (®->node == &mgr->kernel_alloc_node) { > > + WARN(1, "Can't destroy kernel reserved region.\n"); > > + return; > > + } > > + > > + drm_mm_remove_node(®->node); > > +} > > +EXPORT_SYMBOL(drm_gpuva_region_destroy); >
On Wed, Jan 18, 2023 at 07:12:45AM +0100, Danilo Krummrich wrote: > This adds the infrastructure for a manager implementation to keep track > of GPU virtual address (VA) mappings. > > New UAPIs, motivated by Vulkan sparse memory bindings graphics drivers > start implementing, allow userspace applications to request multiple and > arbitrary GPU VA mappings of buffer objects. The DRM GPU VA manager is > intended to serve the following purposes in this context. > > 1) Provide a dedicated range allocator to track GPU VA allocations and > mappings, making use of the drm_mm range allocator. > > 2) Generically connect GPU VA mappings to their backing buffers, in > particular DRM GEM objects. > > 3) Provide a common implementation to perform more complex mapping > operations on the GPU VA space. In particular splitting and merging > of GPU VA mappings, e.g. for intersecting mapping requests or partial > unmap requests. > Over the past week I've hacked together a PoC port of Xe to GPUVA [1], so far it seems really promising. 95% of the way to being feature equivalent of the current Xe VM bind implementation and have line of sight to getting sparse bindings implemented on top of GPUVA too. IMO this has basically everything we need for Xe with a few tweaks. I am out until 2/14 but wanted to get my thoughts / suggestions out on the list before I leave. 1. GPUVA post didn't support the way Xe does userptrs - a NULL GEM. I believe with [2], [3], and [4] GPUVA will support NULL GEMs. Also my thinking sparse binds will also have NULL GEMs, more on sparse bindings below. 2. I agree with Christian that drm_mm probably isn't what we want to base the GPUVA implementation on, rather a RB tree or Maple tree has been discussed. The implementation should be fairly easy to tune once we have benchmarks running so not to concerned here as we can figure this out down the line. 3. In Xe we want create xe_vm_op list which inherits from drm_gpuva_op I've done this with a hack [5], I believe when we rebase we can do this with a custom callback to allocate a large op size. 4. I'd like add user bits to drm_gpuva_flags like I do in [6]. This is similar to DMA_FENCE_FLAG_USER_BITS. 5. In Xe we have VM prefetch operation which is needed for our compute UMD with page faults. I'd like add prefetch type of operation like we do in [7]. 6. In Xe we have VM unbind all mappings for a GEM IOCTL, I'd like to add support to generate this operation list to GPUVA like we do in [8]. 7. I've thought about how Xe will implement sparse mappings (read 0, writes dropped). My current thinking is a sparse mapping will be represented as a drm_gpuva rather than region like in Nouveau. Making regions optional to me seems likes good idea rather than forcing the user of GPUVA code to create 1 large region for the manager as I currently do in the Xe PoC. 8. Personally I'd like the caller to own the locking for GEM drm_gpuva list (drm_gpuva_link_*, drm_gpuva_unlink_* functions). In Xe we almost certainly will have the GEM dma-resv lock when we touch this list so an extra lock here is redundant. Also it kinda goofy that caller owns the for drm_gpuva insertion / removal but not the locking for this list. WRT to Christian thoughts on a common uAPI rules for VM binds, I kinda like that idea but I don't think that is necessary. All of pur uAPI should be close but also the GPUVA implementation should be flexible enough to fit all of our needs and I think for the most part it is. Let me know what everything thinks about this. It would be great if when I'm back on 2/14 I can rebase the Xe port to GPUVA on another version of the GPUVA code and get sparse binding support implementation. Also I'd like to get GPUVA merged in the Xe repo ASAP as our VM bind code badly needed to be cleaned and this was the push we needed to make this happen. Matt [1] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314 [2] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=2ae21d7a3f52e5eb2c105ed8ae231471274bdc36 [3] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=49fca9f5d96201f5cbd1b19c7ff17eedfac65cdc [4] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=61fa6b1e1f10e791ae82358fa971b04421d53024 [5] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=87fc08dcf0840e794b38269fe4c6a95d088d79ec [6] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=a4826c22f6788bc29906ffa263c1cd3c4661fa77 [7] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=f008bbb55b213868e52c7b9cda4c1bfb95af6aee [8] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=41f4f71c05d04d2b17d988dd95369b5df2d7f681 > Idea-suggested-by: Dave Airlie <airlied@redhat.com> > Signed-off-by: Danilo Krummrich <dakr@redhat.com> > --- > Documentation/gpu/drm-mm.rst | 31 + > drivers/gpu/drm/Makefile | 1 + > drivers/gpu/drm/drm_gem.c | 3 + > drivers/gpu/drm/drm_gpuva_mgr.c | 1323 +++++++++++++++++++++++++++++++ > include/drm/drm_drv.h | 6 + > include/drm/drm_gem.h | 75 ++ > include/drm/drm_gpuva_mgr.h | 527 ++++++++++++ > 7 files changed, 1966 insertions(+) > create mode 100644 drivers/gpu/drm/drm_gpuva_mgr.c > create mode 100644 include/drm/drm_gpuva_mgr.h > > diff --git a/Documentation/gpu/drm-mm.rst b/Documentation/gpu/drm-mm.rst > index a52e6f4117d6..c9f120cfe730 100644 > --- a/Documentation/gpu/drm-mm.rst > +++ b/Documentation/gpu/drm-mm.rst > @@ -466,6 +466,37 @@ DRM MM Range Allocator Function References > .. kernel-doc:: drivers/gpu/drm/drm_mm.c > :export: > > +DRM GPU VA Manager > +================== > + > +Overview > +-------- > + > +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c > + :doc: Overview > + > +Split and Merge > +--------------- > + > +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c > + :doc: Split and Merge > + > +Locking > +------- > + > +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c > + :doc: Locking > + > + > +DRM GPU VA Manager Function References > +-------------------------------------- > + > +.. kernel-doc:: include/drm/drm_gpuva_mgr.h > + :internal: > + > +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c > + :export: > + > DRM Buddy Allocator > =================== > > diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile > index 4fe190aee584..de2ffca3b6e4 100644 > --- a/drivers/gpu/drm/Makefile > +++ b/drivers/gpu/drm/Makefile > @@ -45,6 +45,7 @@ drm-y := \ > drm_vblank.o \ > drm_vblank_work.o \ > drm_vma_manager.o \ > + drm_gpuva_mgr.o \ > drm_writeback.o > drm-$(CONFIG_DRM_LEGACY) += \ > drm_agpsupport.o \ > diff --git a/drivers/gpu/drm/drm_gem.c b/drivers/gpu/drm/drm_gem.c > index 59a0bb5ebd85..65115fe88627 100644 > --- a/drivers/gpu/drm/drm_gem.c > +++ b/drivers/gpu/drm/drm_gem.c > @@ -164,6 +164,9 @@ void drm_gem_private_object_init(struct drm_device *dev, > if (!obj->resv) > obj->resv = &obj->_resv; > > + if (drm_core_check_feature(dev, DRIVER_GEM_GPUVA)) > + drm_gem_gpuva_init(obj); > + > drm_vma_node_reset(&obj->vma_node); > INIT_LIST_HEAD(&obj->lru_node); > } > diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c > new file mode 100644 > index 000000000000..e665f642689d > --- /dev/null > +++ b/drivers/gpu/drm/drm_gpuva_mgr.c > @@ -0,0 +1,1323 @@ > +// SPDX-License-Identifier: GPL-2.0 > +/* > + * Copyright (c) 2022 Red Hat. > + * > + * Permission is hereby granted, free of charge, to any person obtaining a > + * copy of this software and associated documentation files (the "Software"), > + * to deal in the Software without restriction, including without limitation > + * the rights to use, copy, modify, merge, publish, distribute, sublicense, > + * and/or sell copies of the Software, and to permit persons to whom the > + * Software is furnished to do so, subject to the following conditions: > + * > + * The above copyright notice and this permission notice shall be included in > + * all copies or substantial portions of the Software. > + * > + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR > + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, > + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL > + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, DAMAGES OR > + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, > + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR > + * OTHER DEALINGS IN THE SOFTWARE. > + * > + * Authors: > + * Danilo Krummrich <dakr@redhat.com> > + * > + */ > + > +#include <drm/drm_gem.h> > +#include <drm/drm_gpuva_mgr.h> > + > +/** > + * DOC: Overview > + * > + * The DRM GPU VA Manager, represented by struct drm_gpuva_manager keeps track > + * of a GPU's virtual address (VA) space and manages the corresponding virtual > + * mappings represented by &drm_gpuva objects. It also keeps track of the > + * mapping's backing &drm_gem_object buffers. > + * > + * &drm_gem_object buffers maintain a list (and a corresponding list lock) of > + * &drm_gpuva objects representing all existent GPU VA mappings using this > + * &drm_gem_object as backing buffer. > + * > + * A GPU VA mapping can only be created within a previously allocated > + * &drm_gpuva_region, which represents a reserved portion of the GPU VA space. > + * GPU VA mappings are not allowed to span over a &drm_gpuva_region's boundary. > + * > + * GPU VA regions can also be flagged as sparse, which allows drivers to create > + * sparse mappings for a whole GPU VA region in order to support Vulkan > + * 'Sparse Resources'. > + * > + * The GPU VA manager internally uses the &drm_mm range allocator to manage the > + * &drm_gpuva mappings and the &drm_gpuva_regions within a GPU's virtual address > + * space. > + * > + * Besides the GPU VA space regions (&drm_gpuva_region) allocated by a driver > + * the &drm_gpuva_manager contains a special region representing the portion of > + * VA space reserved by the kernel. This node is initialized together with the > + * GPU VA manager instance and removed when the GPU VA manager is destroyed. > + * > + * In a typical application drivers would embed struct drm_gpuva_manager, > + * struct drm_gpuva_region and struct drm_gpuva within their own driver > + * specific structures, there won't be any memory allocations of it's own nor > + * memory allocations of &drm_gpuva or &drm_gpuva_region entries. > + */ > + > +/** > + * DOC: Split and Merge > + * > + * The DRM GPU VA manager also provides an algorithm implementing splitting and > + * merging of existent GPU VA mappings with the ones that are requested to be > + * mapped or unmapped. This feature is required by the Vulkan API to implement > + * Vulkan 'Sparse Memory Bindings' - drivers UAPIs often refer to this as > + * VM BIND. > + * > + * Drivers can call drm_gpuva_sm_map_ops_create() to obtain a list of map, unmap > + * and remap operations for a given newly requested mapping. This list > + * represents the set of operations to execute in order to integrate the new > + * mapping cleanly into the current state of the GPU VA space. > + * > + * Depending on how the new GPU VA mapping intersects with the existent mappings > + * of the GPU VA space the &drm_gpuva_ops contain an arbitrary amount of unmap > + * operations, a maximum of two remap operations and a single map operation. > + * The set of operations can also be empty if no operation is required, e.g. if > + * the requested mapping already exists in the exact same way. > + * > + * The single map operation, if existent, represents the original map operation > + * requested by the caller. Please note that this operation might be altered > + * comparing it with the original map operation, e.g. because it was merged with > + * an already existent mapping. Hence, drivers must execute this map operation > + * instead of the original one they passed to drm_gpuva_sm_map_ops_create(). > + * > + * &drm_gpuva_op_unmap contains a 'keep' field, which indicates whether the > + * &drm_gpuva to unmap is physically contiguous with the original mapping > + * request. Optionally, if 'keep' is set, drivers may keep the actual page table > + * entries for this &drm_gpuva, adding the missing page table entries only and > + * update the &drm_gpuva_manager's view of things accordingly. > + * > + * Drivers may do the same optimization, namely delta page table updates, also > + * for remap operations. This is possible since &drm_gpuva_op_remap consists of > + * one unmap operation and one or two map operations, such that drivers can > + * derive the page table update delta accordingly. > + * > + * Note that there can't be more than two existent mappings to split up, one at > + * the beginning and one at the end of the new mapping, hence there is a > + * maximum of two remap operations. > + * > + * Generally, the DRM GPU VA manager never merges mappings across the > + * boundaries of &drm_gpuva_regions. This is the case since merging between > + * GPU VA regions would result into unmap and map operations to be issued for > + * both regions involved although the original mapping request was referred to > + * one specific GPU VA region only. Since the other GPU VA region, the one not > + * explicitly requested to be altered, might be in use by the GPU, we are not > + * allowed to issue any map/unmap operations for this region. > + * > + * Note that before calling drm_gpuva_sm_map_ops_create() again with another > + * mapping request it is necessary to update the &drm_gpuva_manager's view of > + * the GPU VA space. The previously obtained operations must be either fully > + * processed or completely abandoned. > + * > + * To update the &drm_gpuva_manager's view of the GPU VA space > + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or > + * drm_gpuva_destroy_unlocked() should be used. > + * > + * Analogue to drm_gpuva_sm_map_ops_create() drm_gpuva_sm_unmap_ops_create() > + * provides drivers a the list of operations to be executed in order to unmap > + * a range of GPU VA space. The logic behind this functions is way simpler > + * though: For all existent mappings enclosed by the given range unmap > + * operations are created. For mappings which are only partically located within > + * the given range, remap operations are created such that those mappings are > + * split up and re-mapped partically. > + * > + * The following paragraph depicts the basic constellations of existent GPU VA > + * mappings, a newly requested mapping and the resulting mappings as implemented > + * by drm_gpuva_sm_map_ops_create() - it doesn't cover arbitrary combinations > + * of those constellations. > + * > + * :: > + * > + * 1) Existent mapping is kept. > + * ---------------------------- > + * > + * 0 a 1 > + * old: |-----------| (bo_offset=n) > + * > + * 0 a 1 > + * req: |-----------| (bo_offset=n) > + * > + * 0 a 1 > + * new: |-----------| (bo_offset=n) > + * > + * > + * 2) Existent mapping is replaced. > + * -------------------------------- > + * > + * 0 a 1 > + * old: |-----------| (bo_offset=n) > + * > + * 0 a 1 > + * req: |-----------| (bo_offset=m) > + * > + * 0 a 1 > + * new: |-----------| (bo_offset=m) > + * > + * > + * 3) Existent mapping is replaced. > + * -------------------------------- > + * > + * 0 a 1 > + * old: |-----------| (bo_offset=n) > + * > + * 0 b 1 > + * req: |-----------| (bo_offset=n) > + * > + * 0 b 1 > + * new: |-----------| (bo_offset=n) > + * > + * > + * 4) Existent mapping is replaced. > + * -------------------------------- > + * > + * 0 a 1 > + * old: |-----| (bo_offset=n) > + * > + * 0 a 2 > + * req: |-----------| (bo_offset=n) > + * > + * 0 a 2 > + * new: |-----------| (bo_offset=n) > + * > + * Note: We expect to see the same result for a request with a different bo > + * and/or bo_offset. > + * > + * > + * 5) Existent mapping is split. > + * ----------------------------- > + * > + * 0 a 2 > + * old: |-----------| (bo_offset=n) > + * > + * 0 b 1 > + * req: |-----| (bo_offset=n) > + * > + * 0 b 1 a' 2 > + * new: |-----|-----| (b.bo_offset=n, a.bo_offset=n+1) > + * > + * Note: We expect to see the same result for a request with a different bo > + * and/or non-contiguous bo_offset. > + * > + * > + * 6) Existent mapping is kept. > + * ---------------------------- > + * > + * 0 a 2 > + * old: |-----------| (bo_offset=n) > + * > + * 0 a 1 > + * req: |-----| (bo_offset=n) > + * > + * 0 a 2 > + * new: |-----------| (bo_offset=n) > + * > + * > + * 7) Existent mapping is split. > + * ----------------------------- > + * > + * 0 a 2 > + * old: |-----------| (bo_offset=n) > + * > + * 1 b 2 > + * req: |-----| (bo_offset=m) > + * > + * 0 a 1 b 2 > + * new: |-----|-----| (a.bo_offset=n,b.bo_offset=m) > + * > + * > + * 8) Existent mapping is kept. > + * ---------------------------- > + * > + * 0 a 2 > + * old: |-----------| (bo_offset=n) > + * > + * 1 a 2 > + * req: |-----| (bo_offset=n+1) > + * > + * 0 a 2 > + * new: |-----------| (bo_offset=n) > + * > + * > + * 9) Existent mapping is split. > + * ----------------------------- > + * > + * 0 a 2 > + * old: |-----------| (bo_offset=n) > + * > + * 1 b 3 > + * req: |-----------| (bo_offset=m) > + * > + * 0 a 1 b 3 > + * new: |-----|-----------| (a.bo_offset=n,b.bo_offset=m) > + * > + * > + * 10) Existent mapping is merged. > + * ------------------------------- > + * > + * 0 a 2 > + * old: |-----------| (bo_offset=n) > + * > + * 1 a 3 > + * req: |-----------| (bo_offset=n+1) > + * > + * 0 a 3 > + * new: |-----------------| (bo_offset=n) > + * > + * > + * 11) Existent mapping is split. > + * ------------------------------ > + * > + * 0 a 3 > + * old: |-----------------| (bo_offset=n) > + * > + * 1 b 2 > + * req: |-----| (bo_offset=m) > + * > + * 0 a 1 b 2 a' 3 > + * new: |-----|-----|-----| (a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2) > + * > + * > + * 12) Existent mapping is kept. > + * ----------------------------- > + * > + * 0 a 3 > + * old: |-----------------| (bo_offset=n) > + * > + * 1 a 2 > + * req: |-----| (bo_offset=n+1) > + * > + * 0 a 3 > + * old: |-----------------| (bo_offset=n) > + * > + * > + * 13) Existent mapping is replaced. > + * --------------------------------- > + * > + * 1 a 2 > + * old: |-----| (bo_offset=n) > + * > + * 0 a 2 > + * req: |-----------| (bo_offset=n) > + * > + * 0 a 2 > + * new: |-----------| (bo_offset=n) > + * > + * Note: We expect to see the same result for a request with a different bo > + * and/or non-contiguous bo_offset. > + * > + * > + * 14) Existent mapping is replaced. > + * --------------------------------- > + * > + * 1 a 2 > + * old: |-----| (bo_offset=n) > + * > + * 0 a 3 > + * req: |----------------| (bo_offset=n) > + * > + * 0 a 3 > + * new: |----------------| (bo_offset=n) > + * > + * Note: We expect to see the same result for a request with a different bo > + * and/or non-contiguous bo_offset. > + * > + * > + * 15) Existent mapping is split. > + * ------------------------------ > + * > + * 1 a 3 > + * old: |-----------| (bo_offset=n) > + * > + * 0 b 2 > + * req: |-----------| (bo_offset=m) > + * > + * 0 b 2 a' 3 > + * new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2) > + * > + * > + * 16) Existent mappings are merged. > + * --------------------------------- > + * > + * 0 a 1 > + * old: |-----------| (bo_offset=n) > + * > + * 2 a 3 > + * old': |-----------| (bo_offset=n+2) > + * > + * 1 a 2 > + * req: |-----------| (bo_offset=n+1) > + * > + * a > + * new: |----------------------------------| (bo_offset=n) > + */ > + > +/** > + * DOC: Locking > + * > + * Generally, the GPU VA manager does not take care of locking itself, it is > + * the drivers responsibility to take care about locking. Drivers might want to > + * protect the following operations: inserting, destroying and iterating > + * &drm_gpuva and &drm_gpuva_region objects as well as generating split and merge > + * operations. > + * > + * The GPU VA manager does take care of the locking of the backing > + * &drm_gem_object buffers GPU VA lists though, unless the provided functions > + * documentation claims otherwise. > + */ > + > +/** > + * drm_gpuva_manager_init - initialize a &drm_gpuva_manager > + * @mgr: pointer to the &drm_gpuva_manager to initialize > + * @name: the name of the GPU VA space > + * @start_offset: the start offset of the GPU VA space > + * @range: the size of the GPU VA space > + * @reserve_offset: the start of the kernel reserved GPU VA area > + * @reserve_range: the size of the kernel reserved GPU VA area > + * > + * The &drm_gpuva_manager must be initialized with this function before use. > + * > + * Note that @mgr must be cleared to 0 before calling this function. The given > + * &name is expected to be managed by the surrounding driver structures. > + */ > +void > +drm_gpuva_manager_init(struct drm_gpuva_manager *mgr, > + const char *name, > + u64 start_offset, u64 range, > + u64 reserve_offset, u64 reserve_range) > +{ > + drm_mm_init(&mgr->va_mm, start_offset, range); > + drm_mm_init(&mgr->region_mm, start_offset, range); > + > + mgr->mm_start = start_offset; > + mgr->mm_range = range; > + > + mgr->name = name ? name : "unknown"; > + > + memset(&mgr->kernel_alloc_node, 0, sizeof(struct drm_mm_node)); > + mgr->kernel_alloc_node.start = reserve_offset; > + mgr->kernel_alloc_node.size = reserve_range; > + drm_mm_reserve_node(&mgr->region_mm, &mgr->kernel_alloc_node); > +} > +EXPORT_SYMBOL(drm_gpuva_manager_init); > + > +/** > + * drm_gpuva_manager_destroy - cleanup a &drm_gpuva_manager > + * @mgr: pointer to the &drm_gpuva_manager to clean up > + * > + * Note that it is a bug to call this function on a manager that still > + * holds GPU VA mappings. > + */ > +void > +drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr) > +{ > + mgr->name = NULL; > + drm_mm_remove_node(&mgr->kernel_alloc_node); > + drm_mm_takedown(&mgr->va_mm); > + drm_mm_takedown(&mgr->region_mm); > +} > +EXPORT_SYMBOL(drm_gpuva_manager_destroy); > + > +static struct drm_gpuva_region * > +drm_gpuva_in_region(struct drm_gpuva_manager *mgr, u64 addr, u64 range) > +{ > + struct drm_gpuva_region *reg; > + > + /* Find the VA region the requested range is strictly enclosed by. */ > + drm_gpuva_for_each_region_in_range(reg, mgr, addr, addr + range) { > + if (reg->node.start <= addr && > + reg->node.start + reg->node.size >= addr + range && > + ®->node != &mgr->kernel_alloc_node) > + return reg; > + } > + > + return NULL; > +} > + > +static bool > +drm_gpuva_in_any_region(struct drm_gpuva_manager *mgr, u64 addr, u64 range) > +{ > + return !!drm_gpuva_in_region(mgr, addr, range); > +} > + > +/** > + * drm_gpuva_insert - insert a &drm_gpuva > + * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in > + * @va: the &drm_gpuva to insert > + * @addr: the start address of the GPU VA > + * @range: the range of the GPU VA > + * > + * Insert a &drm_gpuva with a given address and range into a > + * &drm_gpuva_manager. > + * > + * The function assumes the caller does not hold the &drm_gem_object's > + * GPU VA list mutex. > + * > + * Returns: 0 on success, negative error code on failure. > + */ > +int > +drm_gpuva_insert(struct drm_gpuva_manager *mgr, > + struct drm_gpuva *va, > + u64 addr, u64 range) > +{ > + struct drm_gpuva_region *reg; > + int ret; > + > + if (!va->gem.obj) > + return -EINVAL; > + > + reg = drm_gpuva_in_region(mgr, addr, range); > + if (!reg) > + return -EINVAL; > + > + ret = drm_mm_insert_node_in_range(&mgr->va_mm, &va->node, > + range, 0, > + 0, addr, > + addr + range, > + DRM_MM_INSERT_LOW|DRM_MM_INSERT_ONCE); > + if (ret) > + return ret; > + > + va->mgr = mgr; > + va->region = reg; > + > + return 0; > +} > +EXPORT_SYMBOL(drm_gpuva_insert); > + > +/** > + * drm_gpuva_link_locked - link a &drm_gpuva > + * @va: the &drm_gpuva to link > + * > + * This adds the given &va to the GPU VA list of the &drm_gem_object it is > + * associated with. > + * > + * The function assumes the caller already holds the &drm_gem_object's > + * GPU VA list mutex. > + */ > +void > +drm_gpuva_link_locked(struct drm_gpuva *va) > +{ > + lockdep_assert_held(&va->gem.obj->gpuva.mutex); > + list_add_tail(&va->head, &va->gem.obj->gpuva.list); > +} > +EXPORT_SYMBOL(drm_gpuva_link_locked); > + > +/** > + * drm_gpuva_link_unlocked - unlink a &drm_gpuva > + * @va: the &drm_gpuva to unlink > + * > + * This adds the given &va to the GPU VA list of the &drm_gem_object it is > + * associated with. > + * > + * The function assumes the caller does not hold the &drm_gem_object's > + * GPU VA list mutex. > + */ > +void > +drm_gpuva_link_unlocked(struct drm_gpuva *va) > +{ > + drm_gem_gpuva_lock(va->gem.obj); > + drm_gpuva_link_locked(va); > + drm_gem_gpuva_unlock(va->gem.obj); > +} > +EXPORT_SYMBOL(drm_gpuva_link_unlocked); > + > +/** > + * drm_gpuva_unlink_locked - unlink a &drm_gpuva > + * @va: the &drm_gpuva to unlink > + * > + * This removes the given &va from the GPU VA list of the &drm_gem_object it is > + * associated with. > + * > + * The function assumes the caller already holds the &drm_gem_object's > + * GPU VA list mutex. > + */ > +void > +drm_gpuva_unlink_locked(struct drm_gpuva *va) > +{ > + lockdep_assert_held(&va->gem.obj->gpuva.mutex); > + list_del_init(&va->head); > +} > +EXPORT_SYMBOL(drm_gpuva_unlink_locked); > + > +/** > + * drm_gpuva_unlink_unlocked - unlink a &drm_gpuva > + * @va: the &drm_gpuva to unlink > + * > + * This removes the given &va from the GPU VA list of the &drm_gem_object it is > + * associated with. > + * > + * The function assumes the caller does not hold the &drm_gem_object's > + * GPU VA list mutex. > + */ > +void > +drm_gpuva_unlink_unlocked(struct drm_gpuva *va) > +{ > + drm_gem_gpuva_lock(va->gem.obj); > + drm_gpuva_unlink_locked(va); > + drm_gem_gpuva_unlock(va->gem.obj); > +} > +EXPORT_SYMBOL(drm_gpuva_unlink_unlocked); > + > +/** > + * drm_gpuva_destroy_locked - destroy a &drm_gpuva > + * @va: the &drm_gpuva to destroy > + * > + * This removes the given &va from GPU VA list of the &drm_gem_object it is > + * associated with and removes it from the underlaying range allocator. > + * > + * The function assumes the caller already holds the &drm_gem_object's > + * GPU VA list mutex. > + */ > +void > +drm_gpuva_destroy_locked(struct drm_gpuva *va) > +{ > + lockdep_assert_held(&va->gem.obj->gpuva.mutex); > + > + list_del(&va->head); > + drm_mm_remove_node(&va->node); > +} > +EXPORT_SYMBOL(drm_gpuva_destroy_locked); > + > +/** > + * drm_gpuva_destroy_unlocked - destroy a &drm_gpuva > + * @va: the &drm_gpuva to destroy > + * > + * This removes the given &va from GPU VA list of the &drm_gem_object it is > + * associated with and removes it from the underlaying range allocator. > + * > + * The function assumes the caller does not hold the &drm_gem_object's > + * GPU VA list mutex. > + */ > +void > +drm_gpuva_destroy_unlocked(struct drm_gpuva *va) > +{ > + drm_gem_gpuva_lock(va->gem.obj); > + list_del(&va->head); > + drm_gem_gpuva_unlock(va->gem.obj); > + > + drm_mm_remove_node(&va->node); > +} > +EXPORT_SYMBOL(drm_gpuva_destroy_unlocked); > + > +/** > + * drm_gpuva_find - find a &drm_gpuva > + * @mgr: the &drm_gpuva_manager to search in > + * @addr: the &drm_gpuvas address > + * @range: the &drm_gpuvas range > + * > + * Returns: the &drm_gpuva at a given &addr and with a given &range > + */ > +struct drm_gpuva * > +drm_gpuva_find(struct drm_gpuva_manager *mgr, > + u64 addr, u64 range) > +{ > + struct drm_gpuva *va; > + > + drm_gpuva_for_each_va_in_range(va, mgr, addr, range) { > + if (va->node.start == addr && > + va->node.size == range) > + return va; > + } > + > + return NULL; > +} > +EXPORT_SYMBOL(drm_gpuva_find); > + > +/** > + * drm_gpuva_find_prev - find the &drm_gpuva before the given address > + * @mgr: the &drm_gpuva_manager to search in > + * @start: the given GPU VA's start address > + * > + * Find the adjacent &drm_gpuva before the GPU VA with given &start address. > + * > + * Note that if there is any free space between the GPU VA mappings no mapping > + * is returned. > + * > + * Returns: a pointer to the found &drm_gpuva or NULL if none was found > + */ > +struct drm_gpuva * > +drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start) > +{ > + struct drm_mm_node *node; > + > + if (start <= mgr->mm_start || > + start > (mgr->mm_start + mgr->mm_range)) > + return NULL; > + > + node = __drm_mm_interval_first(&mgr->va_mm, start - 1, start); > + if (node == &mgr->va_mm.head_node) > + return NULL; > + > + return (struct drm_gpuva *)node; > +} > +EXPORT_SYMBOL(drm_gpuva_find_prev); > + > +/** > + * drm_gpuva_find_next - find the &drm_gpuva after the given address > + * @mgr: the &drm_gpuva_manager to search in > + * @end: the given GPU VA's end address > + * > + * Find the adjacent &drm_gpuva after the GPU VA with given &end address. > + * > + * Note that if there is any free space between the GPU VA mappings no mapping > + * is returned. > + * > + * Returns: a pointer to the found &drm_gpuva or NULL if none was found > + */ > +struct drm_gpuva * > +drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end) > +{ > + struct drm_mm_node *node; > + > + if (end < mgr->mm_start || > + end >= (mgr->mm_start + mgr->mm_range)) > + return NULL; > + > + node = __drm_mm_interval_first(&mgr->va_mm, end, end + 1); > + if (node == &mgr->va_mm.head_node) > + return NULL; > + > + return (struct drm_gpuva *)node; > +} > +EXPORT_SYMBOL(drm_gpuva_find_next); > + > +/** > + * drm_gpuva_region_insert - insert a &drm_gpuva_region > + * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in > + * @reg: the &drm_gpuva_region to insert > + * @addr: the start address of the GPU VA > + * @range: the range of the GPU VA > + * > + * Insert a &drm_gpuva_region with a given address and range into a > + * &drm_gpuva_manager. > + * > + * Returns: 0 on success, negative error code on failure. > + */ > +int > +drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, > + struct drm_gpuva_region *reg, > + u64 addr, u64 range) > +{ > + int ret; > + > + ret = drm_mm_insert_node_in_range(&mgr->region_mm, ®->node, > + range, 0, > + 0, addr, > + addr + range, > + DRM_MM_INSERT_LOW| > + DRM_MM_INSERT_ONCE); > + if (ret) > + return ret; > + > + reg->mgr = mgr; > + > + return 0; > +} > +EXPORT_SYMBOL(drm_gpuva_region_insert); > + > +/** > + * drm_gpuva_region_destroy - destroy a &drm_gpuva_region > + * @mgr: the &drm_gpuva_manager holding the region > + * @reg: the &drm_gpuva to destroy > + * > + * This removes the given ® from the underlaying range allocator. > + */ > +void > +drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, > + struct drm_gpuva_region *reg) > +{ > + struct drm_gpuva *va; > + > + drm_gpuva_for_each_va_in_range(va, mgr, > + reg->node.start, > + reg->node.size) { > + WARN(1, "GPU VA region must be empty on destroy.\n"); > + return; > + } > + > + if (®->node == &mgr->kernel_alloc_node) { > + WARN(1, "Can't destroy kernel reserved region.\n"); > + return; > + } > + > + drm_mm_remove_node(®->node); > +} > +EXPORT_SYMBOL(drm_gpuva_region_destroy); > + > +/** > + * drm_gpuva_region_find - find a &drm_gpuva_region > + * @mgr: the &drm_gpuva_manager to search in > + * @addr: the &drm_gpuva_regions address > + * @range: the &drm_gpuva_regions range > + * > + * Returns: the &drm_gpuva_region at a given &addr and with a given &range > + */ > +struct drm_gpuva_region * > +drm_gpuva_region_find(struct drm_gpuva_manager *mgr, > + u64 addr, u64 range) > +{ > + struct drm_gpuva_region *reg; > + > + drm_gpuva_for_each_region_in_range(reg, mgr, addr, addr + range) > + if (reg->node.start == addr && > + reg->node.size == range) > + return reg; > + > + return NULL; > +} > +EXPORT_SYMBOL(drm_gpuva_region_find); > + > +static int > +gpuva_op_map_new(struct drm_gpuva_op **pop, > + u64 addr, u64 range, > + struct drm_gem_object *obj, u64 offset) > +{ > + struct drm_gpuva_op *op; > + > + op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); > + if (!op) > + return -ENOMEM; > + > + op->op = DRM_GPUVA_OP_MAP; > + op->map.va.addr = addr; > + op->map.va.range = range; > + op->map.gem.obj = obj; > + op->map.gem.offset = offset; > + > + return 0; > +} > + > +static int > +gpuva_op_remap_new(struct drm_gpuva_op **pop, > + struct drm_gpuva_op_map *prev, > + struct drm_gpuva_op_map *next, > + struct drm_gpuva_op_unmap *unmap) > +{ > + struct drm_gpuva_op *op; > + struct drm_gpuva_op_remap *r; > + > + op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); > + if (!op) > + return -ENOMEM; > + > + op->op = DRM_GPUVA_OP_REMAP; > + r = &op->remap; > + > + if (prev) { > + r->prev = kmemdup(prev, sizeof(*prev), GFP_KERNEL); > + if (!r->prev) > + goto err_free_op; > + } > + > + if (next) { > + r->next = kmemdup(next, sizeof(*next), GFP_KERNEL); > + if (!r->next) > + goto err_free_prev; > + } > + > + r->unmap = kmemdup(unmap, sizeof(*unmap), GFP_KERNEL); > + if (!r->unmap) > + goto err_free_next; > + > + return 0; > + > +err_free_next: > + if (next) > + kfree(r->next); > +err_free_prev: > + if (prev) > + kfree(r->prev); > +err_free_op: > + kfree(op); > + *pop = NULL; > + > + return -ENOMEM; > +} > + > +static int > +gpuva_op_unmap_new(struct drm_gpuva_op **pop, > + struct drm_gpuva *va, bool merge) > +{ > + struct drm_gpuva_op *op; > + > + op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); > + if (!op) > + return -ENOMEM; > + > + op->op = DRM_GPUVA_OP_UNMAP; > + op->unmap.va = va; > + op->unmap.keep = merge; > + > + return 0; > +} > + > +#define op_map_new_to_list(_ops, _addr, _range, \ > + _obj, _offset) \ > +do { \ > + struct drm_gpuva_op *op; \ > + \ > + ret = gpuva_op_map_new(&op, _addr, _range, \ > + _obj, _offset); \ > + if (ret) \ > + goto err_free_ops; \ > + \ > + list_add_tail(&op->entry, _ops); \ > +} while (0) > + > +#define op_remap_new_to_list(_ops, _prev, _next, \ > + _unmap) \ > +do { \ > + struct drm_gpuva_op *op; \ > + \ > + ret = gpuva_op_remap_new(&op, _prev, _next, \ > + _unmap); \ > + if (ret) \ > + goto err_free_ops; \ > + \ > + list_add_tail(&op->entry, _ops); \ > +} while (0) > + > +#define op_unmap_new_to_list(_ops, _gpuva, _merge) \ > +do { \ > + struct drm_gpuva_op *op; \ > + \ > + ret = gpuva_op_unmap_new(&op, _gpuva, _merge); \ > + if (ret) \ > + goto err_free_ops; \ > + \ > + list_add_tail(&op->entry, _ops); \ > +} while (0) > + > +/** > + * drm_gpuva_sm_map_ops_create - creates the &drm_gpuva_ops to split and merge > + * @mgr: the &drm_gpuva_manager representing the GPU VA space > + * @req_addr: the start address of the new mapping > + * @req_range: the range of the new mapping > + * @req_obj: the &drm_gem_object to map > + * @req_offset: the offset within the &drm_gem_object > + * > + * This function creates a list of operations to perform splitting and merging > + * of existent mapping(s) with the newly requested one. > + * > + * The list can be iterated with &drm_gpuva_for_each_op and must be processed > + * in the given order. It can contain map, unmap and remap operations, but it > + * also can be empty if no operation is required, e.g. if the requested mapping > + * already exists is the exact same way. > + * > + * There can be an arbitrary amount of unmap operations, a maximum of two remap > + * operations and a single map operation. The latter one, if existent, > + * represents the original map operation requested by the caller. Please note > + * that the map operation might has been modified, e.g. if it was > + * merged with an existent mapping. > + * > + * Note that before calling this function again with another mapping request it > + * is necessary to update the &drm_gpuva_manager's view of the GPU VA space. > + * The previously obtained operations must be either processed or abandoned. > + * To update the &drm_gpuva_manager's view of the GPU VA space > + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or > + * drm_gpuva_destroy_unlocked() should be used. > + * > + * After the caller finished processing the returned &drm_gpuva_ops, they must > + * be freed with &drm_gpuva_ops_free. > + * > + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure > + */ > +struct drm_gpuva_ops * > +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr, > + u64 req_addr, u64 req_range, > + struct drm_gem_object *req_obj, u64 req_offset) > +{ > + struct drm_gpuva_ops *ops; > + struct drm_gpuva *va, *prev = NULL; > + u64 req_end = req_addr + req_range; > + bool skip_pmerge = false, skip_nmerge = false; > + int ret; > + > + if (!drm_gpuva_in_any_region(mgr, req_addr, req_range)) > + return ERR_PTR(-EINVAL); > + > + ops = kzalloc(sizeof(*ops), GFP_KERNEL); > + if (!ops) > + return ERR_PTR(-ENOMEM); > + > + INIT_LIST_HEAD(&ops->list); > + > + drm_gpuva_for_each_va_in_range(va, mgr, req_addr, req_end) { > + struct drm_gem_object *obj = va->gem.obj; > + u64 offset = va->gem.offset; > + u64 addr = va->node.start; > + u64 range = va->node.size; > + u64 end = addr + range; > + > + /* Generally, we want to skip merging with potential mappings > + * left and right of the requested one when we found a > + * collision, since merging happens in this loop already. > + * > + * However, there is one exception when the requested mapping > + * spans into a free VM area. If this is the case we might > + * still hit the boundary of another mapping before and/or > + * after the free VM area. > + */ > + skip_pmerge = true; > + skip_nmerge = true; > + > + if (addr == req_addr) { > + bool merge = obj == req_obj && > + offset == req_offset; > + if (end == req_end) { > + if (merge) > + goto done; > + > + op_unmap_new_to_list(&ops->list, va, false); > + break; > + } > + > + if (end < req_end) { > + skip_nmerge = false; > + op_unmap_new_to_list(&ops->list, va, merge); > + goto next; > + } > + > + if (end > req_end) { > + struct drm_gpuva_op_map n = { > + .va.addr = req_end, > + .va.range = range - req_range, > + .gem.obj = obj, > + .gem.offset = offset + req_range, > + }; > + struct drm_gpuva_op_unmap u = { .va = va }; > + > + if (merge) > + goto done; > + > + op_remap_new_to_list(&ops->list, NULL, &n, &u); > + break; > + } > + } else if (addr < req_addr) { > + u64 ls_range = req_addr - addr; > + struct drm_gpuva_op_map p = { > + .va.addr = addr, > + .va.range = ls_range, > + .gem.obj = obj, > + .gem.offset = offset, > + }; > + struct drm_gpuva_op_unmap u = { .va = va }; > + bool merge = obj == req_obj && > + offset + ls_range == req_offset; > + > + if (end == req_end) { > + if (merge) > + goto done; > + > + op_remap_new_to_list(&ops->list, &p, NULL, &u); > + break; > + } > + > + if (end < req_end) { > + u64 new_addr = addr; > + u64 new_range = req_range + ls_range; > + u64 new_offset = offset; > + > + /* We validated that the requested mapping is > + * within a single VA region already. > + * Since it overlaps the current mapping (which > + * can't cross a VA region boundary) we can be > + * sure that we're still within the boundaries > + * of the same VA region after merging. > + */ > + if (merge) { > + req_offset = new_offset; > + req_addr = new_addr; > + req_range = new_range; > + op_unmap_new_to_list(&ops->list, va, true); > + goto next; > + } > + > + op_remap_new_to_list(&ops->list, &p, NULL, &u); > + goto next; > + } > + > + if (end > req_end) { > + struct drm_gpuva_op_map n = { > + .va.addr = req_end, > + .va.range = end - req_end, > + .gem.obj = obj, > + .gem.offset = offset + ls_range + > + req_range, > + }; > + > + if (merge) > + goto done; > + > + op_remap_new_to_list(&ops->list, &p, &n, &u); > + break; > + } > + } else if (addr > req_addr) { > + bool merge = obj == req_obj && > + offset == req_offset + > + (addr - req_addr); > + if (!prev) > + skip_pmerge = false; > + > + if (end == req_end) { > + op_unmap_new_to_list(&ops->list, va, merge); > + break; > + } > + > + if (end < req_end) { > + skip_nmerge = false; > + op_unmap_new_to_list(&ops->list, va, merge); > + goto next; > + } > + > + if (end > req_end) { > + struct drm_gpuva_op_map n = { > + .va.addr = req_end, > + .va.range = end - req_end, > + .gem.obj = obj, > + .gem.offset = offset + req_end - addr, > + }; > + struct drm_gpuva_op_unmap u = { .va = va }; > + u64 new_end = end; > + u64 new_range = new_end - req_addr; > + > + /* We validated that the requested mapping is > + * within a single VA region already. > + * Since it overlaps the current mapping (which > + * can't cross a VA region boundary) we can be > + * sure that we're still within the boundaries > + * of the same VA region after merging. > + */ > + if (merge) { > + req_end = new_end; > + req_range = new_range; > + op_unmap_new_to_list(&ops->list, va, true); > + break; > + } > + > + op_remap_new_to_list(&ops->list, NULL, &n, &u); > + break; > + } > + } > +next: > + prev = va; > + } > + > + va = skip_pmerge ? NULL : drm_gpuva_find_prev(mgr, req_addr); > + if (va) { > + struct drm_gem_object *obj = va->gem.obj; > + u64 offset = va->gem.offset; > + u64 addr = va->node.start; > + u64 range = va->node.size; > + u64 new_offset = offset; > + u64 new_addr = addr; > + u64 new_range = req_range + range; > + bool merge = obj == req_obj && > + offset + range == req_offset; > + > + /* Don't merge over VA region boundaries. */ > + merge &= drm_gpuva_in_any_region(mgr, new_addr, new_range); > + if (merge) { > + op_unmap_new_to_list(&ops->list, va, true); > + > + req_offset = new_offset; > + req_addr = new_addr; > + req_range = new_range; > + } > + } > + > + va = skip_nmerge ? NULL : drm_gpuva_find_next(mgr, req_end); > + if (va) { > + struct drm_gem_object *obj = va->gem.obj; > + u64 offset = va->gem.offset; > + u64 addr = va->node.start; > + u64 range = va->node.size; > + u64 end = addr + range; > + u64 new_range = req_range + range; > + u64 new_end = end; > + bool merge = obj == req_obj && > + offset == req_offset + req_range; > + > + /* Don't merge over VA region boundaries. */ > + merge &= drm_gpuva_in_any_region(mgr, req_addr, new_range); > + if (merge) { > + op_unmap_new_to_list(&ops->list, va, true); > + > + req_range = new_range; > + req_end = new_end; > + } > + } > + > + op_map_new_to_list(&ops->list, > + req_addr, req_range, > + req_obj, req_offset); > + > +done: > + return ops; > + > +err_free_ops: > + drm_gpuva_ops_free(ops); > + return ERR_PTR(ret); > +} > +EXPORT_SYMBOL(drm_gpuva_sm_map_ops_create); > + > +#undef op_map_new_to_list > +#undef op_remap_new_to_list > +#undef op_unmap_new_to_list > + > +/** > + * drm_gpuva_sm_unmap_ops_create - creates the &drm_gpuva_ops to split on unmap > + * @mgr: the &drm_gpuva_manager representing the GPU VA space > + * @req_addr: the start address of the range to unmap > + * @req_range: the range of the mappings to unmap > + * > + * This function creates a list of operations to perform unmapping and, if > + * required, splitting of the mappings overlapping the unmap range. > + * > + * The list can be iterated with &drm_gpuva_for_each_op and must be processed > + * in the given order. It can contain unmap and remap operations, depending on > + * whether there are actual overlapping mappings to split. > + * > + * There can be an arbitrary amount of unmap operations and a maximum of two > + * remap operations. > + * > + * Note that before calling this function again with another range to unmap it > + * is necessary to update the &drm_gpuva_manager's view of the GPU VA space. > + * The previously obtained operations must be processed or abandoned. > + * To update the &drm_gpuva_manager's view of the GPU VA space > + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or > + * drm_gpuva_destroy_unlocked() should be used. > + * > + * After the caller finished processing the returned &drm_gpuva_ops, they must > + * be freed with &drm_gpuva_ops_free. > + * > + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure > + */ > +struct drm_gpuva_ops * > +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr, > + u64 req_addr, u64 req_range) > +{ > + struct drm_gpuva_ops *ops; > + struct drm_gpuva_op *op; > + struct drm_gpuva_op_remap *r; > + struct drm_gpuva *va; > + u64 req_end = req_addr + req_range; > + int ret; > + > + ops = kzalloc(sizeof(*ops), GFP_KERNEL); > + if (!ops) > + return ERR_PTR(-ENOMEM); > + > + INIT_LIST_HEAD(&ops->list); > + > + drm_gpuva_for_each_va_in_range(va, mgr, req_addr, req_end) { > + struct drm_gem_object *obj = va->gem.obj; > + u64 offset = va->gem.offset; > + u64 addr = va->node.start; > + u64 range = va->node.size; > + u64 end = addr + range; > + > + op = kzalloc(sizeof(*op), GFP_KERNEL); > + if (!op) { > + ret = -ENOMEM; > + goto err_free_ops; > + } > + > + r = &op->remap; > + > + if (addr < req_addr) { > + r->prev = kzalloc(sizeof(*r->prev), GFP_KERNEL); > + if (!r->prev) { > + ret = -ENOMEM; > + goto err_free_op; > + } > + > + r->prev->va.addr = addr; > + r->prev->va.range = req_addr - addr; > + r->prev->gem.obj = obj; > + r->prev->gem.offset = offset; > + } > + > + if (end > req_end) { > + r->next = kzalloc(sizeof(*r->next), GFP_KERNEL); > + if (!r->next) { > + ret = -ENOMEM; > + goto err_free_prev; > + } > + > + r->next->va.addr = req_end; > + r->next->va.range = end - req_end; > + r->next->gem.obj = obj; > + r->next->gem.offset = offset + (req_end - addr); > + } > + > + if (op->remap.prev || op->remap.next) { > + op->op = DRM_GPUVA_OP_REMAP; > + r->unmap = kzalloc(sizeof(*r->unmap), GFP_KERNEL); > + if (!r->unmap) { > + ret = -ENOMEM; > + goto err_free_next; > + } > + > + r->unmap->va = va; > + } else { > + op->op = DRM_GPUVA_OP_UNMAP; > + op->unmap.va = va; > + } > + > + list_add_tail(&op->entry, &ops->list); > + } > + > + return ops; > + > +err_free_next: > + if (r->next) > + kfree(r->next); > +err_free_prev: > + if (r->prev) > + kfree(r->prev); > +err_free_op: > + kfree(op); > +err_free_ops: > + drm_gpuva_ops_free(ops); > + return ERR_PTR(ret); > +} > +EXPORT_SYMBOL(drm_gpuva_sm_unmap_ops_create); > + > +/** > + * drm_gpuva_ops_free - free the given &drm_gpuva_ops > + * @ops: the &drm_gpuva_ops to free > + * > + * Frees the given &drm_gpuva_ops structure including all the ops associated > + * with it. > + */ > +void > +drm_gpuva_ops_free(struct drm_gpuva_ops *ops) > +{ > + struct drm_gpuva_op *op, *next; > + > + drm_gpuva_for_each_op_safe(op, next, ops) { > + list_del(&op->entry); > + if (op->op == DRM_GPUVA_OP_REMAP) { > + if (op->remap.prev) > + kfree(op->remap.prev); > + > + if (op->remap.next) > + kfree(op->remap.next); > + > + kfree(op->remap.unmap); > + } > + kfree(op); > + } > + > + kfree(ops); > +} > +EXPORT_SYMBOL(drm_gpuva_ops_free); > diff --git a/include/drm/drm_drv.h b/include/drm/drm_drv.h > index d7c521e8860f..6feacd93aca6 100644 > --- a/include/drm/drm_drv.h > +++ b/include/drm/drm_drv.h > @@ -104,6 +104,12 @@ enum drm_driver_feature { > * acceleration should be handled by two drivers that are connected using auxiliary bus. > */ > DRIVER_COMPUTE_ACCEL = BIT(7), > + /** > + * @DRIVER_GEM_GPUVA: > + * > + * Driver supports user defined GPU VA bindings for GEM objects. > + */ > + DRIVER_GEM_GPUVA = BIT(8), > > /* IMPORTANT: Below are all the legacy flags, add new ones above. */ > > diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h > index 772a4adf5287..4a3679034966 100644 > --- a/include/drm/drm_gem.h > +++ b/include/drm/drm_gem.h > @@ -36,6 +36,8 @@ > > #include <linux/kref.h> > #include <linux/dma-resv.h> > +#include <linux/list.h> > +#include <linux/mutex.h> > > #include <drm/drm_vma_manager.h> > > @@ -337,6 +339,17 @@ struct drm_gem_object { > */ > struct dma_resv _resv; > > + /** > + * @gpuva: > + * > + * Provides the list and list mutex of GPU VAs attached to this > + * GEM object. > + */ > + struct { > + struct list_head list; > + struct mutex mutex; > + } gpuva; > + > /** > * @funcs: > * > @@ -479,4 +492,66 @@ void drm_gem_lru_move_tail(struct drm_gem_lru *lru, struct drm_gem_object *obj); > unsigned long drm_gem_lru_scan(struct drm_gem_lru *lru, unsigned nr_to_scan, > bool (*shrink)(struct drm_gem_object *obj)); > > +/** > + * drm_gem_gpuva_init - initialize the gpuva list of a GEM object > + * @obj: the &drm_gem_object > + * > + * This initializes the &drm_gem_object's &drm_gpuva list and the mutex > + * protecting it. > + * > + * Calling this function is only necessary for drivers intending to support the > + * &drm_driver_feature DRIVER_GEM_GPUVA. > + */ > +static inline void drm_gem_gpuva_init(struct drm_gem_object *obj) > +{ > + INIT_LIST_HEAD(&obj->gpuva.list); > + mutex_init(&obj->gpuva.mutex); > +} > + > +/** > + * drm_gem_gpuva_lock - lock the GEM's gpuva list mutex > + * @obj: the &drm_gem_object > + * > + * This unlocks the mutex protecting the &drm_gem_object's &drm_gpuva list. > + */ > +static inline void drm_gem_gpuva_lock(struct drm_gem_object *obj) > +{ > + mutex_lock(&obj->gpuva.mutex); > +} > + > +/** > + * drm_gem_gpuva_unlock - unlock the GEM's gpuva list mutex > + * @obj: the &drm_gem_object > + * > + * This unlocks the mutex protecting the &drm_gem_object's &drm_gpuva list. > + */ > +static inline void drm_gem_gpuva_unlock(struct drm_gem_object *obj) > +{ > + mutex_unlock(&obj->gpuva.mutex); > +} > + > +/** > + * drm_gem_for_each_gpuva - iternator to walk over a list of gpuvas > + * @entry: &drm_gpuva structure to assign to in each iteration step > + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with > + * > + * This iterator walks over all &drm_gpuva structures associated with the > + * &drm_gpuva_manager. > + */ > +#define drm_gem_for_each_gpuva(entry, obj) \ > + list_for_each_entry(entry, &obj->gpuva.list, head) > + > +/** > + * drm_gem_for_each_gpuva_safe - iternator to safely walk over a list of gpuvas > + * @entry: &drm_gpuva structure to assign to in each iteration step > + * @next: &next &drm_gpuva to store the next step > + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with > + * > + * This iterator walks over all &drm_gpuva structures associated with the > + * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence > + * it is save against removal of elements. > + */ > +#define drm_gem_for_each_gpuva_safe(entry, next, obj) \ > + list_for_each_entry_safe(entry, next, &obj->gpuva.list, head) > + > #endif /* __DRM_GEM_H__ */ > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h > new file mode 100644 > index 000000000000..adeb0c916e91 > --- /dev/null > +++ b/include/drm/drm_gpuva_mgr.h > @@ -0,0 +1,527 @@ > +// SPDX-License-Identifier: GPL-2.0 > + > +#ifndef __DRM_GPUVA_MGR_H__ > +#define __DRM_GPUVA_MGR_H__ > + > +/* > + * Copyright (c) 2022 Red Hat. > + * > + * Permission is hereby granted, free of charge, to any person obtaining a > + * copy of this software and associated documentation files (the "Software"), > + * to deal in the Software without restriction, including without limitation > + * the rights to use, copy, modify, merge, publish, distribute, sublicense, > + * and/or sell copies of the Software, and to permit persons to whom the > + * Software is furnished to do so, subject to the following conditions: > + * > + * The above copyright notice and this permission notice shall be included in > + * all copies or substantial portions of the Software. > + * > + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR > + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, > + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL > + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, DAMAGES OR > + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, > + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR > + * OTHER DEALINGS IN THE SOFTWARE. > + */ > + > +#include <drm/drm_mm.h> > +#include <linux/mm.h> > +#include <linux/rbtree.h> > +#include <linux/spinlock.h> > +#include <linux/types.h> > + > +struct drm_gpuva_region; > +struct drm_gpuva; > +struct drm_gpuva_ops; > + > +/** > + * struct drm_gpuva_manager - DRM GPU VA Manager > + * > + * The DRM GPU VA Manager keeps track of a GPU's virtual address space by using > + * the &drm_mm range allocator. Typically, this structure is embedded in bigger > + * driver structures. > + * > + * Drivers can pass addresses and ranges in an arbitrary unit, e.g. bytes or > + * pages. > + * > + * There should be one manager instance per GPU virtual address space. > + */ > +struct drm_gpuva_manager { > + /** > + * @name: the name of the DRM GPU VA space > + */ > + const char *name; > + > + /** > + * @mm_start: start of the VA space > + */ > + u64 mm_start; > + > + /** > + * @mm_range: length of the VA space > + */ > + u64 mm_range; > + > + /** > + * @region_mm: the &drm_mm range allocator to track GPU VA regions > + */ > + struct drm_mm region_mm; > + > + /** > + * @va_mm: the &drm_mm range allocator to track GPU VA mappings > + */ > + struct drm_mm va_mm; > + > + /** > + * @kernel_alloc_node: > + * > + * &drm_mm_node representing the address space cutout reserved for > + * the kernel > + */ > + struct drm_mm_node kernel_alloc_node; > +}; > + > +void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr, > + const char *name, > + u64 start_offset, u64 range, > + u64 reserve_offset, u64 reserve_range); > +void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr); > + > +/** > + * struct drm_gpuva_region - structure to track a portion of GPU VA space > + * > + * This structure represents a portion of a GPUs VA space and is associated > + * with a &drm_gpuva_manager. Internally it is based on a &drm_mm_node. > + * > + * GPU VA mappings, represented by &drm_gpuva objects, are restricted to be > + * placed within a &drm_gpuva_region. > + */ > +struct drm_gpuva_region { > + /** > + * @node: the &drm_mm_node to track the GPU VA region > + */ > + struct drm_mm_node node; > + > + /** > + * @mgr: the &drm_gpuva_manager this object is associated with > + */ > + struct drm_gpuva_manager *mgr; > + > + /** > + * @sparse: indicates whether this region is sparse > + */ > + bool sparse; > +}; > + > +struct drm_gpuva_region * > +drm_gpuva_region_find(struct drm_gpuva_manager *mgr, > + u64 addr, u64 range); > +int drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, > + struct drm_gpuva_region *reg, > + u64 addr, u64 range); > +void drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, > + struct drm_gpuva_region *reg); > + > +int drm_gpuva_insert(struct drm_gpuva_manager *mgr, > + struct drm_gpuva *va, > + u64 addr, u64 range); > +/** > + * drm_gpuva_for_each_region_in_range - iternator to walk over a range of nodes > + * @node__: &drm_gpuva_region structure to assign to in each iteration step > + * @gpuva__: &drm_gpuva_manager structure to walk > + * @start__: starting offset, the first node will overlap this > + * @end__: ending offset, the last node will start before this (but may overlap) > + * > + * This iterator walks over all nodes in the range allocator that lie > + * between @start and @end. It is implemented similarly to list_for_each(), > + * but is using &drm_mm's internal interval tree to accelerate the search for > + * the starting node, and hence isn't safe against removal of elements. It > + * assumes that @end is within (or is the upper limit of) the &drm_gpuva_manager. > + * If [@start, @end] are beyond the range of the &drm_gpuva_manager, the > + * iterator may walk over the special _unallocated_ &drm_mm.head_node of the > + * backing &drm_mm, and may even continue indefinitely. > + */ > +#define drm_gpuva_for_each_region_in_range(node__, gpuva__, start__, end__) \ > + for (node__ = (struct drm_gpuva_region *)__drm_mm_interval_first(&(gpuva__)->region_mm, \ > + (start__), (end__)-1); \ > + node__->node.start < (end__); \ > + node__ = (struct drm_gpuva_region *)list_next_entry(&node__->node, node_list)) > + > +/** > + * drm_gpuva_for_each_region - iternator to walk over a range of nodes > + * @entry: &drm_gpuva_region structure to assign to in each iteration step > + * @gpuva: &drm_gpuva_manager structure to walk > + * > + * This iterator walks over all &drm_gpuva_region structures associated with the > + * &drm_gpuva_manager. > + */ > +#define drm_gpuva_for_each_region(entry, gpuva) \ > + list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->region_mm), node.node_list) > + > +/** > + * drm_gpuva_for_each_region_safe - iternator to safely walk over a range of > + * nodes > + * @entry: &drm_gpuva_region structure to assign to in each iteration step > + * @next: &next &drm_gpuva_region to store the next step > + * @gpuva: &drm_gpuva_manager structure to walk > + * > + * This iterator walks over all &drm_gpuva_region structures associated with the > + * &drm_gpuva_manager. It is implemented with list_for_each_safe(), so save > + * against removal of elements. > + */ > +#define drm_gpuva_for_each_region_safe(entry, next, gpuva) \ > + list_for_each_entry_safe(entry, next, drm_mm_nodes(&(gpuva)->region_mm), node.node_list) > + > + > +/** > + * enum drm_gpuva_flags - flags for struct drm_gpuva > + */ > +enum drm_gpuva_flags { > + /** > + * @DRM_GPUVA_SWAPPED: flag indicating that the &drm_gpuva is swapped > + */ > + DRM_GPUVA_SWAPPED = (1 << 0), > +}; > + > +/** > + * struct drm_gpuva - structure to track a GPU VA mapping > + * > + * This structure represents a GPU VA mapping and is associated with a > + * &drm_gpuva_manager. Internally it is based on a &drm_mm_node. > + * > + * Typically, this structure is embedded in bigger driver structures. > + */ > +struct drm_gpuva { > + /** > + * @node: the &drm_mm_node to track the GPU VA mapping > + */ > + struct drm_mm_node node; > + > + /** > + * @mgr: the &drm_gpuva_manager this object is associated with > + */ > + struct drm_gpuva_manager *mgr; > + > + /** > + * @region: the &drm_gpuva_region the &drm_gpuva is mapped in > + */ > + struct drm_gpuva_region *region; > + > + /** > + * @head: the &list_head to attach this object to a &drm_gem_object > + */ > + struct list_head head; > + > + /** > + * @flags: the &drm_gpuva_flags for this mapping > + */ > + enum drm_gpuva_flags flags; > + > + /** > + * @gem: structure containing the &drm_gem_object and it's offset > + */ > + struct { > + /** > + * @offset: the offset within the &drm_gem_object > + */ > + u64 offset; > + > + /** > + * @obj: the mapped &drm_gem_object > + */ > + struct drm_gem_object *obj; > + } gem; > +}; > + > +void drm_gpuva_link_locked(struct drm_gpuva *va); > +void drm_gpuva_link_unlocked(struct drm_gpuva *va); > +void drm_gpuva_unlink_locked(struct drm_gpuva *va); > +void drm_gpuva_unlink_unlocked(struct drm_gpuva *va); > + > +void drm_gpuva_destroy_locked(struct drm_gpuva *va); > +void drm_gpuva_destroy_unlocked(struct drm_gpuva *va); > + > +struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr, > + u64 addr, u64 range); > +struct drm_gpuva *drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start); > +struct drm_gpuva *drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end); > + > +/** > + * drm_gpuva_swap - sets whether the backing BO of this &drm_gpuva is swapped > + * @va: the &drm_gpuva to set the swap flag of > + * @swap: indicates whether the &drm_gpuva is swapped > + */ > +static inline void drm_gpuva_swap(struct drm_gpuva *va, bool swap) > +{ > + if (swap) > + va->flags |= DRM_GPUVA_SWAPPED; > + else > + va->flags &= ~DRM_GPUVA_SWAPPED; > +} > + > +/** > + * drm_gpuva_swapped - indicates whether the backing BO of this &drm_gpuva > + * is swapped > + * @va: the &drm_gpuva to check > + */ > +static inline bool drm_gpuva_swapped(struct drm_gpuva *va) > +{ > + return va->flags & DRM_GPUVA_SWAPPED; > +} > + > +/** > + * drm_gpuva_for_each_va_in_range - iternator to walk over a range of nodes > + * @node__: &drm_gpuva structure to assign to in each iteration step > + * @gpuva__: &drm_gpuva_manager structure to walk > + * @start__: starting offset, the first node will overlap this > + * @end__: ending offset, the last node will start before this (but may overlap) > + * > + * This iterator walks over all nodes in the range allocator that lie > + * between @start and @end. It is implemented similarly to list_for_each(), > + * but is using &drm_mm's internal interval tree to accelerate the search for > + * the starting node, and hence isn't safe against removal of elements. It > + * assumes that @end is within (or is the upper limit of) the &drm_gpuva_manager. > + * If [@start, @end] are beyond the range of the &drm_gpuva_manager, the > + * iterator may walk over the special _unallocated_ &drm_mm.head_node of the > + * backing &drm_mm, and may even continue indefinitely. > + */ > +#define drm_gpuva_for_each_va_in_range(node__, gpuva__, start__, end__) \ > + for (node__ = (struct drm_gpuva *)__drm_mm_interval_first(&(gpuva__)->va_mm, \ > + (start__), (end__)-1); \ > + node__->node.start < (end__); \ > + node__ = (struct drm_gpuva *)list_next_entry(&node__->node, node_list)) > + > +/** > + * drm_gpuva_for_each_va - iternator to walk over a range of nodes > + * @entry: &drm_gpuva structure to assign to in each iteration step > + * @gpuva: &drm_gpuva_manager structure to walk > + * > + * This iterator walks over all &drm_gpuva structures associated with the > + * &drm_gpuva_manager. > + */ > +#define drm_gpuva_for_each_va(entry, gpuva) \ > + list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->va_mm), node.node_list) > + > +/** > + * drm_gpuva_for_each_va_safe - iternator to safely walk over a range of > + * nodes > + * @entry: &drm_gpuva structure to assign to in each iteration step > + * @next: &next &drm_gpuva to store the next step > + * @gpuva: &drm_gpuva_manager structure to walk > + * > + * This iterator walks over all &drm_gpuva structures associated with the > + * &drm_gpuva_manager. It is implemented with list_for_each_safe(), so save > + * against removal of elements. > + */ > +#define drm_gpuva_for_each_va_safe(entry, next, gpuva) \ > + list_for_each_entry_safe(entry, next, drm_mm_nodes(&(gpuva)->va_mm), node.node_list) > + > +/** > + * enum drm_gpuva_op_type - GPU VA operation type > + * > + * Operations to alter the GPU VA mappings tracked by the &drm_gpuva_manager > + * can be map, remap or unmap operations. > + */ > +enum drm_gpuva_op_type { > + /** > + * @DRM_GPUVA_OP_MAP: the map op type > + */ > + DRM_GPUVA_OP_MAP, > + > + /** > + * @DRM_GPUVA_OP_REMAP: the remap op type > + */ > + DRM_GPUVA_OP_REMAP, > + > + /** > + * @DRM_GPUVA_OP_UNMAP: the unmap op type > + */ > + DRM_GPUVA_OP_UNMAP, > +}; > + > +/** > + * struct drm_gpuva_op_map - GPU VA map operation > + * > + * This structure represents a single map operation generated by the > + * DRM GPU VA manager. > + */ > +struct drm_gpuva_op_map { > + /** > + * @va: structure containing address and range of a map > + * operation > + */ > + struct { > + /** > + * @addr: the base address of the new mapping > + */ > + u64 addr; > + > + /** > + * @range: the range of the new mapping > + */ > + u64 range; > + } va; > + > + /** > + * @gem: structure containing the &drm_gem_object and it's offset > + */ > + struct { > + /** > + * @offset: the offset within the &drm_gem_object > + */ > + u64 offset; > + > + /** > + * @obj: the &drm_gem_object to map > + */ > + struct drm_gem_object *obj; > + } gem; > +}; > + > +/** > + * struct drm_gpuva_op_unmap - GPU VA unmap operation > + * > + * This structure represents a single unmap operation generated by the > + * DRM GPU VA manager. > + */ > +struct drm_gpuva_op_unmap { > + /** > + * @va: the &drm_gpuva to unmap > + */ > + struct drm_gpuva *va; > + > + /** > + * @keep: > + * > + * Indicates whether this &drm_gpuva is physically contiguous with the > + * original mapping request. > + * > + * Optionally, if &keep is set, drivers may keep the actual page table > + * mappings for this &drm_gpuva, adding the missing page table entries > + * only and update the &drm_gpuva_manager accordingly. > + */ > + bool keep; > +}; > + > +/** > + * struct drm_gpuva_op_remap - GPU VA remap operation > + * > + * This represents a single remap operation generated by the DRM GPU VA manager. > + * > + * A remap operation is generated when an existing GPU VA mmapping is split up > + * by inserting a new GPU VA mapping or by partially unmapping existent > + * mapping(s), hence it consists of a maximum of two map and one unmap > + * operation. > + * > + * The @unmap operation takes care of removing the original existing mapping. > + * @prev is used to remap the preceding part, @next the subsequent part. > + * > + * If either a new mapping's start address is aligned with the start address > + * of the old mapping or the new mapping's end address is aligned with the > + * end address of the old mapping, either @prev or @next is NULL. > + * > + * Note, the reason for a dedicated remap operation, rather than arbitrary > + * unmap and map operations, is to give drivers the chance of extracting driver > + * specific data for creating the new mappings from the unmap operations's > + * &drm_gpuva structure which typically is embedded in larger driver specific > + * structures. > + */ > +struct drm_gpuva_op_remap { > + /** > + * @prev: the preceding part of a split mapping > + */ > + struct drm_gpuva_op_map *prev; > + > + /** > + * @next: the subsequent part of a split mapping > + */ > + struct drm_gpuva_op_map *next; > + > + /** > + * @unmap: the unmap operation for the original existing mapping > + */ > + struct drm_gpuva_op_unmap *unmap; > +}; > + > +/** > + * struct drm_gpuva_op - GPU VA operation > + * > + * This structure represents a single generic operation, which can be either > + * map, unmap or remap. > + * > + * The particular type of the operation is defined by @op. > + */ > +struct drm_gpuva_op { > + /** > + * @entry: > + * > + * The &list_head used to distribute instances of this struct within > + * &drm_gpuva_ops. > + */ > + struct list_head entry; > + > + /** > + * @op: the type of the operation > + */ > + enum drm_gpuva_op_type op; > + > + union { > + /** > + * @map: the map operation > + */ > + struct drm_gpuva_op_map map; > + > + /** > + * @unmap: the unmap operation > + */ > + struct drm_gpuva_op_unmap unmap; > + > + /** > + * @remap: the remap operation > + */ > + struct drm_gpuva_op_remap remap; > + }; > +}; > + > +/** > + * struct drm_gpuva_ops - wraps a list of &drm_gpuva_op > + */ > +struct drm_gpuva_ops { > + /** > + * @list: the &list_head > + */ > + struct list_head list; > +}; > + > +/** > + * drm_gpuva_for_each_op - iterator to walk over all ops > + * @op: &drm_gpuva_op to assign in each iteration step > + * @ops: &drm_gpuva_ops to walk > + * > + * This iterator walks over all ops within a given list of operations. > + */ > +#define drm_gpuva_for_each_op(op, ops) list_for_each_entry(op, &(ops)->list, entry) > + > +/** > + * drm_gpuva_for_each_op_safe - iterator to safely walk over all ops > + * @op: &drm_gpuva_op to assign in each iteration step > + * @next: &next &drm_gpuva_op to store the next step > + * @ops: &drm_gpuva_ops to walk > + * > + * This iterator walks over all ops within a given list of operations. It is > + * implemented with list_for_each_safe(), so save against removal of elements. > + */ > +#define drm_gpuva_for_each_op_safe(op, next, ops) \ > + list_for_each_entry_safe(op, next, &(ops)->list, entry) > + > +struct drm_gpuva_ops * > +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr, > + u64 addr, u64 range, > + struct drm_gem_object *obj, u64 offset); > +struct drm_gpuva_ops * > +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr, > + u64 addr, u64 range); > +void drm_gpuva_ops_free(struct drm_gpuva_ops *ops); > + > +#endif /* __DRM_GPUVA_MGR_H__ */ > -- > 2.39.0 >
Am 03.02.23 um 18:37 schrieb Matthew Brost: > On Wed, Jan 18, 2023 at 07:12:45AM +0100, Danilo Krummrich wrote: >> This adds the infrastructure for a manager implementation to keep track >> of GPU virtual address (VA) mappings. >> >> New UAPIs, motivated by Vulkan sparse memory bindings graphics drivers >> start implementing, allow userspace applications to request multiple and >> arbitrary GPU VA mappings of buffer objects. The DRM GPU VA manager is >> intended to serve the following purposes in this context. >> >> 1) Provide a dedicated range allocator to track GPU VA allocations and >> mappings, making use of the drm_mm range allocator. >> >> 2) Generically connect GPU VA mappings to their backing buffers, in >> particular DRM GEM objects. >> >> 3) Provide a common implementation to perform more complex mapping >> operations on the GPU VA space. In particular splitting and merging >> of GPU VA mappings, e.g. for intersecting mapping requests or partial >> unmap requests. >> > Over the past week I've hacked together a PoC port of Xe to GPUVA [1], so > far it seems really promising. 95% of the way to being feature > equivalent of the current Xe VM bind implementation and have line of > sight to getting sparse bindings implemented on top of GPUVA too. IMO > this has basically everything we need for Xe with a few tweaks. > > I am out until 2/14 but wanted to get my thoughts / suggestions out on > the list before I leave. > > 1. GPUVA post didn't support the way Xe does userptrs - a NULL GEM. I > believe with [2], [3], and [4] GPUVA will support NULL GEMs. Also my > thinking sparse binds will also have NULL GEMs, more on sparse bindings > below. > > 2. I agree with Christian that drm_mm probably isn't what we want to > base the GPUVA implementation on, rather a RB tree or Maple tree has > been discussed. The implementation should be fairly easy to tune once we > have benchmarks running so not to concerned here as we can figure this > out down the line. > > 3. In Xe we want create xe_vm_op list which inherits from drm_gpuva_op > I've done this with a hack [5], I believe when we rebase we can do this > with a custom callback to allocate a large op size. > > 4. I'd like add user bits to drm_gpuva_flags like I do in [6]. This is > similar to DMA_FENCE_FLAG_USER_BITS. > > 5. In Xe we have VM prefetch operation which is needed for our compute > UMD with page faults. I'd like add prefetch type of operation like we do > in [7]. > > 6. In Xe we have VM unbind all mappings for a GEM IOCTL, I'd like to add > support to generate this operation list to GPUVA like we do in [8]. > > 7. I've thought about how Xe will implement sparse mappings (read 0, > writes dropped). My current thinking is a sparse mapping will be > represented as a drm_gpuva rather than region like in Nouveau. Making > regions optional to me seems likes good idea rather than forcing the > user of GPUVA code to create 1 large region for the manager as I > currently do in the Xe PoC. From Danilo's explanation I'm now pretty sure that regions won't work for Nouveau either. He seems to use an incorrect assumption about applications not changing the sparse mappings behind a VkBuffer while the GPU is using it. As far as I can see games like Forza won't work with this approach. > > 8. Personally I'd like the caller to own the locking for GEM drm_gpuva > list (drm_gpuva_link_*, drm_gpuva_unlink_* functions). In Xe we almost > certainly will have the GEM dma-resv lock when we touch this list so an > extra lock here is redundant. Also it kinda goofy that caller owns the > for drm_gpuva insertion / removal but not the locking for this list. > > WRT to Christian thoughts on a common uAPI rules for VM binds, I kinda > like that idea but I don't think that is necessary. All of pur uAPI > should be close but also the GPUVA implementation should be flexible > enough to fit all of our needs and I think for the most part it is. Maybe I should refine my concerns: A common component for GPUVM mappings is a good idea, but we should not expect that to define any driver independent UAPI. If we want to define driver independent UAPI we should do so explicitly. Christian. > > Let me know what everything thinks about this. It would be great if when > I'm back on 2/14 I can rebase the Xe port to GPUVA on another version of > the GPUVA code and get sparse binding support implementation. Also I'd > like to get GPUVA merged in the Xe repo ASAP as our VM bind code badly > needed to be cleaned and this was the push we needed to make this > happen. > > Matt > > [1] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314 > [2] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=2ae21d7a3f52e5eb2c105ed8ae231471274bdc36 > [3] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=49fca9f5d96201f5cbd1b19c7ff17eedfac65cdc > [4] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=61fa6b1e1f10e791ae82358fa971b04421d53024 > [5] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=87fc08dcf0840e794b38269fe4c6a95d088d79ec > [6] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=a4826c22f6788bc29906ffa263c1cd3c4661fa77 > [7] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=f008bbb55b213868e52c7b9cda4c1bfb95af6aee > [8] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=41f4f71c05d04d2b17d988dd95369b5df2d7f681 > >> Idea-suggested-by: Dave Airlie <airlied@redhat.com> >> Signed-off-by: Danilo Krummrich <dakr@redhat.com> >> --- >> Documentation/gpu/drm-mm.rst | 31 + >> drivers/gpu/drm/Makefile | 1 + >> drivers/gpu/drm/drm_gem.c | 3 + >> drivers/gpu/drm/drm_gpuva_mgr.c | 1323 +++++++++++++++++++++++++++++++ >> include/drm/drm_drv.h | 6 + >> include/drm/drm_gem.h | 75 ++ >> include/drm/drm_gpuva_mgr.h | 527 ++++++++++++ >> 7 files changed, 1966 insertions(+) >> create mode 100644 drivers/gpu/drm/drm_gpuva_mgr.c >> create mode 100644 include/drm/drm_gpuva_mgr.h >> >> diff --git a/Documentation/gpu/drm-mm.rst b/Documentation/gpu/drm-mm.rst >> index a52e6f4117d6..c9f120cfe730 100644 >> --- a/Documentation/gpu/drm-mm.rst >> +++ b/Documentation/gpu/drm-mm.rst >> @@ -466,6 +466,37 @@ DRM MM Range Allocator Function References >> .. kernel-doc:: drivers/gpu/drm/drm_mm.c >> :export: >> >> +DRM GPU VA Manager >> +================== >> + >> +Overview >> +-------- >> + >> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >> + :doc: Overview >> + >> +Split and Merge >> +--------------- >> + >> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >> + :doc: Split and Merge >> + >> +Locking >> +------- >> + >> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >> + :doc: Locking >> + >> + >> +DRM GPU VA Manager Function References >> +-------------------------------------- >> + >> +.. kernel-doc:: include/drm/drm_gpuva_mgr.h >> + :internal: >> + >> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >> + :export: >> + >> DRM Buddy Allocator >> =================== >> >> diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile >> index 4fe190aee584..de2ffca3b6e4 100644 >> --- a/drivers/gpu/drm/Makefile >> +++ b/drivers/gpu/drm/Makefile >> @@ -45,6 +45,7 @@ drm-y := \ >> drm_vblank.o \ >> drm_vblank_work.o \ >> drm_vma_manager.o \ >> + drm_gpuva_mgr.o \ >> drm_writeback.o >> drm-$(CONFIG_DRM_LEGACY) += \ >> drm_agpsupport.o \ >> diff --git a/drivers/gpu/drm/drm_gem.c b/drivers/gpu/drm/drm_gem.c >> index 59a0bb5ebd85..65115fe88627 100644 >> --- a/drivers/gpu/drm/drm_gem.c >> +++ b/drivers/gpu/drm/drm_gem.c >> @@ -164,6 +164,9 @@ void drm_gem_private_object_init(struct drm_device *dev, >> if (!obj->resv) >> obj->resv = &obj->_resv; >> >> + if (drm_core_check_feature(dev, DRIVER_GEM_GPUVA)) >> + drm_gem_gpuva_init(obj); >> + >> drm_vma_node_reset(&obj->vma_node); >> INIT_LIST_HEAD(&obj->lru_node); >> } >> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c >> new file mode 100644 >> index 000000000000..e665f642689d >> --- /dev/null >> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c >> @@ -0,0 +1,1323 @@ >> +// SPDX-License-Identifier: GPL-2.0 >> +/* >> + * Copyright (c) 2022 Red Hat. >> + * >> + * Permission is hereby granted, free of charge, to any person obtaining a >> + * copy of this software and associated documentation files (the "Software"), >> + * to deal in the Software without restriction, including without limitation >> + * the rights to use, copy, modify, merge, publish, distribute, sublicense, >> + * and/or sell copies of the Software, and to permit persons to whom the >> + * Software is furnished to do so, subject to the following conditions: >> + * >> + * The above copyright notice and this permission notice shall be included in >> + * all copies or substantial portions of the Software. >> + * >> + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR >> + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, >> + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL >> + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, DAMAGES OR >> + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, >> + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR >> + * OTHER DEALINGS IN THE SOFTWARE. >> + * >> + * Authors: >> + * Danilo Krummrich <dakr@redhat.com> >> + * >> + */ >> + >> +#include <drm/drm_gem.h> >> +#include <drm/drm_gpuva_mgr.h> >> + >> +/** >> + * DOC: Overview >> + * >> + * The DRM GPU VA Manager, represented by struct drm_gpuva_manager keeps track >> + * of a GPU's virtual address (VA) space and manages the corresponding virtual >> + * mappings represented by &drm_gpuva objects. It also keeps track of the >> + * mapping's backing &drm_gem_object buffers. >> + * >> + * &drm_gem_object buffers maintain a list (and a corresponding list lock) of >> + * &drm_gpuva objects representing all existent GPU VA mappings using this >> + * &drm_gem_object as backing buffer. >> + * >> + * A GPU VA mapping can only be created within a previously allocated >> + * &drm_gpuva_region, which represents a reserved portion of the GPU VA space. >> + * GPU VA mappings are not allowed to span over a &drm_gpuva_region's boundary. >> + * >> + * GPU VA regions can also be flagged as sparse, which allows drivers to create >> + * sparse mappings for a whole GPU VA region in order to support Vulkan >> + * 'Sparse Resources'. >> + * >> + * The GPU VA manager internally uses the &drm_mm range allocator to manage the >> + * &drm_gpuva mappings and the &drm_gpuva_regions within a GPU's virtual address >> + * space. >> + * >> + * Besides the GPU VA space regions (&drm_gpuva_region) allocated by a driver >> + * the &drm_gpuva_manager contains a special region representing the portion of >> + * VA space reserved by the kernel. This node is initialized together with the >> + * GPU VA manager instance and removed when the GPU VA manager is destroyed. >> + * >> + * In a typical application drivers would embed struct drm_gpuva_manager, >> + * struct drm_gpuva_region and struct drm_gpuva within their own driver >> + * specific structures, there won't be any memory allocations of it's own nor >> + * memory allocations of &drm_gpuva or &drm_gpuva_region entries. >> + */ >> + >> +/** >> + * DOC: Split and Merge >> + * >> + * The DRM GPU VA manager also provides an algorithm implementing splitting and >> + * merging of existent GPU VA mappings with the ones that are requested to be >> + * mapped or unmapped. This feature is required by the Vulkan API to implement >> + * Vulkan 'Sparse Memory Bindings' - drivers UAPIs often refer to this as >> + * VM BIND. >> + * >> + * Drivers can call drm_gpuva_sm_map_ops_create() to obtain a list of map, unmap >> + * and remap operations for a given newly requested mapping. This list >> + * represents the set of operations to execute in order to integrate the new >> + * mapping cleanly into the current state of the GPU VA space. >> + * >> + * Depending on how the new GPU VA mapping intersects with the existent mappings >> + * of the GPU VA space the &drm_gpuva_ops contain an arbitrary amount of unmap >> + * operations, a maximum of two remap operations and a single map operation. >> + * The set of operations can also be empty if no operation is required, e.g. if >> + * the requested mapping already exists in the exact same way. >> + * >> + * The single map operation, if existent, represents the original map operation >> + * requested by the caller. Please note that this operation might be altered >> + * comparing it with the original map operation, e.g. because it was merged with >> + * an already existent mapping. Hence, drivers must execute this map operation >> + * instead of the original one they passed to drm_gpuva_sm_map_ops_create(). >> + * >> + * &drm_gpuva_op_unmap contains a 'keep' field, which indicates whether the >> + * &drm_gpuva to unmap is physically contiguous with the original mapping >> + * request. Optionally, if 'keep' is set, drivers may keep the actual page table >> + * entries for this &drm_gpuva, adding the missing page table entries only and >> + * update the &drm_gpuva_manager's view of things accordingly. >> + * >> + * Drivers may do the same optimization, namely delta page table updates, also >> + * for remap operations. This is possible since &drm_gpuva_op_remap consists of >> + * one unmap operation and one or two map operations, such that drivers can >> + * derive the page table update delta accordingly. >> + * >> + * Note that there can't be more than two existent mappings to split up, one at >> + * the beginning and one at the end of the new mapping, hence there is a >> + * maximum of two remap operations. >> + * >> + * Generally, the DRM GPU VA manager never merges mappings across the >> + * boundaries of &drm_gpuva_regions. This is the case since merging between >> + * GPU VA regions would result into unmap and map operations to be issued for >> + * both regions involved although the original mapping request was referred to >> + * one specific GPU VA region only. Since the other GPU VA region, the one not >> + * explicitly requested to be altered, might be in use by the GPU, we are not >> + * allowed to issue any map/unmap operations for this region. >> + * >> + * Note that before calling drm_gpuva_sm_map_ops_create() again with another >> + * mapping request it is necessary to update the &drm_gpuva_manager's view of >> + * the GPU VA space. The previously obtained operations must be either fully >> + * processed or completely abandoned. >> + * >> + * To update the &drm_gpuva_manager's view of the GPU VA space >> + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >> + * drm_gpuva_destroy_unlocked() should be used. >> + * >> + * Analogue to drm_gpuva_sm_map_ops_create() drm_gpuva_sm_unmap_ops_create() >> + * provides drivers a the list of operations to be executed in order to unmap >> + * a range of GPU VA space. The logic behind this functions is way simpler >> + * though: For all existent mappings enclosed by the given range unmap >> + * operations are created. For mappings which are only partically located within >> + * the given range, remap operations are created such that those mappings are >> + * split up and re-mapped partically. >> + * >> + * The following paragraph depicts the basic constellations of existent GPU VA >> + * mappings, a newly requested mapping and the resulting mappings as implemented >> + * by drm_gpuva_sm_map_ops_create() - it doesn't cover arbitrary combinations >> + * of those constellations. >> + * >> + * :: >> + * >> + * 1) Existent mapping is kept. >> + * ---------------------------- >> + * >> + * 0 a 1 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 a 1 >> + * req: |-----------| (bo_offset=n) >> + * >> + * 0 a 1 >> + * new: |-----------| (bo_offset=n) >> + * >> + * >> + * 2) Existent mapping is replaced. >> + * -------------------------------- >> + * >> + * 0 a 1 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 a 1 >> + * req: |-----------| (bo_offset=m) >> + * >> + * 0 a 1 >> + * new: |-----------| (bo_offset=m) >> + * >> + * >> + * 3) Existent mapping is replaced. >> + * -------------------------------- >> + * >> + * 0 a 1 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 b 1 >> + * req: |-----------| (bo_offset=n) >> + * >> + * 0 b 1 >> + * new: |-----------| (bo_offset=n) >> + * >> + * >> + * 4) Existent mapping is replaced. >> + * -------------------------------- >> + * >> + * 0 a 1 >> + * old: |-----| (bo_offset=n) >> + * >> + * 0 a 2 >> + * req: |-----------| (bo_offset=n) >> + * >> + * 0 a 2 >> + * new: |-----------| (bo_offset=n) >> + * >> + * Note: We expect to see the same result for a request with a different bo >> + * and/or bo_offset. >> + * >> + * >> + * 5) Existent mapping is split. >> + * ----------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 b 1 >> + * req: |-----| (bo_offset=n) >> + * >> + * 0 b 1 a' 2 >> + * new: |-----|-----| (b.bo_offset=n, a.bo_offset=n+1) >> + * >> + * Note: We expect to see the same result for a request with a different bo >> + * and/or non-contiguous bo_offset. >> + * >> + * >> + * 6) Existent mapping is kept. >> + * ---------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 a 1 >> + * req: |-----| (bo_offset=n) >> + * >> + * 0 a 2 >> + * new: |-----------| (bo_offset=n) >> + * >> + * >> + * 7) Existent mapping is split. >> + * ----------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 1 b 2 >> + * req: |-----| (bo_offset=m) >> + * >> + * 0 a 1 b 2 >> + * new: |-----|-----| (a.bo_offset=n,b.bo_offset=m) >> + * >> + * >> + * 8) Existent mapping is kept. >> + * ---------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 1 a 2 >> + * req: |-----| (bo_offset=n+1) >> + * >> + * 0 a 2 >> + * new: |-----------| (bo_offset=n) >> + * >> + * >> + * 9) Existent mapping is split. >> + * ----------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 1 b 3 >> + * req: |-----------| (bo_offset=m) >> + * >> + * 0 a 1 b 3 >> + * new: |-----|-----------| (a.bo_offset=n,b.bo_offset=m) >> + * >> + * >> + * 10) Existent mapping is merged. >> + * ------------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 1 a 3 >> + * req: |-----------| (bo_offset=n+1) >> + * >> + * 0 a 3 >> + * new: |-----------------| (bo_offset=n) >> + * >> + * >> + * 11) Existent mapping is split. >> + * ------------------------------ >> + * >> + * 0 a 3 >> + * old: |-----------------| (bo_offset=n) >> + * >> + * 1 b 2 >> + * req: |-----| (bo_offset=m) >> + * >> + * 0 a 1 b 2 a' 3 >> + * new: |-----|-----|-----| (a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2) >> + * >> + * >> + * 12) Existent mapping is kept. >> + * ----------------------------- >> + * >> + * 0 a 3 >> + * old: |-----------------| (bo_offset=n) >> + * >> + * 1 a 2 >> + * req: |-----| (bo_offset=n+1) >> + * >> + * 0 a 3 >> + * old: |-----------------| (bo_offset=n) >> + * >> + * >> + * 13) Existent mapping is replaced. >> + * --------------------------------- >> + * >> + * 1 a 2 >> + * old: |-----| (bo_offset=n) >> + * >> + * 0 a 2 >> + * req: |-----------| (bo_offset=n) >> + * >> + * 0 a 2 >> + * new: |-----------| (bo_offset=n) >> + * >> + * Note: We expect to see the same result for a request with a different bo >> + * and/or non-contiguous bo_offset. >> + * >> + * >> + * 14) Existent mapping is replaced. >> + * --------------------------------- >> + * >> + * 1 a 2 >> + * old: |-----| (bo_offset=n) >> + * >> + * 0 a 3 >> + * req: |----------------| (bo_offset=n) >> + * >> + * 0 a 3 >> + * new: |----------------| (bo_offset=n) >> + * >> + * Note: We expect to see the same result for a request with a different bo >> + * and/or non-contiguous bo_offset. >> + * >> + * >> + * 15) Existent mapping is split. >> + * ------------------------------ >> + * >> + * 1 a 3 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 b 2 >> + * req: |-----------| (bo_offset=m) >> + * >> + * 0 b 2 a' 3 >> + * new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2) >> + * >> + * >> + * 16) Existent mappings are merged. >> + * --------------------------------- >> + * >> + * 0 a 1 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 2 a 3 >> + * old': |-----------| (bo_offset=n+2) >> + * >> + * 1 a 2 >> + * req: |-----------| (bo_offset=n+1) >> + * >> + * a >> + * new: |----------------------------------| (bo_offset=n) >> + */ >> + >> +/** >> + * DOC: Locking >> + * >> + * Generally, the GPU VA manager does not take care of locking itself, it is >> + * the drivers responsibility to take care about locking. Drivers might want to >> + * protect the following operations: inserting, destroying and iterating >> + * &drm_gpuva and &drm_gpuva_region objects as well as generating split and merge >> + * operations. >> + * >> + * The GPU VA manager does take care of the locking of the backing >> + * &drm_gem_object buffers GPU VA lists though, unless the provided functions >> + * documentation claims otherwise. >> + */ >> + >> +/** >> + * drm_gpuva_manager_init - initialize a &drm_gpuva_manager >> + * @mgr: pointer to the &drm_gpuva_manager to initialize >> + * @name: the name of the GPU VA space >> + * @start_offset: the start offset of the GPU VA space >> + * @range: the size of the GPU VA space >> + * @reserve_offset: the start of the kernel reserved GPU VA area >> + * @reserve_range: the size of the kernel reserved GPU VA area >> + * >> + * The &drm_gpuva_manager must be initialized with this function before use. >> + * >> + * Note that @mgr must be cleared to 0 before calling this function. The given >> + * &name is expected to be managed by the surrounding driver structures. >> + */ >> +void >> +drm_gpuva_manager_init(struct drm_gpuva_manager *mgr, >> + const char *name, >> + u64 start_offset, u64 range, >> + u64 reserve_offset, u64 reserve_range) >> +{ >> + drm_mm_init(&mgr->va_mm, start_offset, range); >> + drm_mm_init(&mgr->region_mm, start_offset, range); >> + >> + mgr->mm_start = start_offset; >> + mgr->mm_range = range; >> + >> + mgr->name = name ? name : "unknown"; >> + >> + memset(&mgr->kernel_alloc_node, 0, sizeof(struct drm_mm_node)); >> + mgr->kernel_alloc_node.start = reserve_offset; >> + mgr->kernel_alloc_node.size = reserve_range; >> + drm_mm_reserve_node(&mgr->region_mm, &mgr->kernel_alloc_node); >> +} >> +EXPORT_SYMBOL(drm_gpuva_manager_init); >> + >> +/** >> + * drm_gpuva_manager_destroy - cleanup a &drm_gpuva_manager >> + * @mgr: pointer to the &drm_gpuva_manager to clean up >> + * >> + * Note that it is a bug to call this function on a manager that still >> + * holds GPU VA mappings. >> + */ >> +void >> +drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr) >> +{ >> + mgr->name = NULL; >> + drm_mm_remove_node(&mgr->kernel_alloc_node); >> + drm_mm_takedown(&mgr->va_mm); >> + drm_mm_takedown(&mgr->region_mm); >> +} >> +EXPORT_SYMBOL(drm_gpuva_manager_destroy); >> + >> +static struct drm_gpuva_region * >> +drm_gpuva_in_region(struct drm_gpuva_manager *mgr, u64 addr, u64 range) >> +{ >> + struct drm_gpuva_region *reg; >> + >> + /* Find the VA region the requested range is strictly enclosed by. */ >> + drm_gpuva_for_each_region_in_range(reg, mgr, addr, addr + range) { >> + if (reg->node.start <= addr && >> + reg->node.start + reg->node.size >= addr + range && >> + ®->node != &mgr->kernel_alloc_node) >> + return reg; >> + } >> + >> + return NULL; >> +} >> + >> +static bool >> +drm_gpuva_in_any_region(struct drm_gpuva_manager *mgr, u64 addr, u64 range) >> +{ >> + return !!drm_gpuva_in_region(mgr, addr, range); >> +} >> + >> +/** >> + * drm_gpuva_insert - insert a &drm_gpuva >> + * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in >> + * @va: the &drm_gpuva to insert >> + * @addr: the start address of the GPU VA >> + * @range: the range of the GPU VA >> + * >> + * Insert a &drm_gpuva with a given address and range into a >> + * &drm_gpuva_manager. >> + * >> + * The function assumes the caller does not hold the &drm_gem_object's >> + * GPU VA list mutex. >> + * >> + * Returns: 0 on success, negative error code on failure. >> + */ >> +int >> +drm_gpuva_insert(struct drm_gpuva_manager *mgr, >> + struct drm_gpuva *va, >> + u64 addr, u64 range) >> +{ >> + struct drm_gpuva_region *reg; >> + int ret; >> + >> + if (!va->gem.obj) >> + return -EINVAL; >> + >> + reg = drm_gpuva_in_region(mgr, addr, range); >> + if (!reg) >> + return -EINVAL; >> + >> + ret = drm_mm_insert_node_in_range(&mgr->va_mm, &va->node, >> + range, 0, >> + 0, addr, >> + addr + range, >> + DRM_MM_INSERT_LOW|DRM_MM_INSERT_ONCE); >> + if (ret) >> + return ret; >> + >> + va->mgr = mgr; >> + va->region = reg; >> + >> + return 0; >> +} >> +EXPORT_SYMBOL(drm_gpuva_insert); >> + >> +/** >> + * drm_gpuva_link_locked - link a &drm_gpuva >> + * @va: the &drm_gpuva to link >> + * >> + * This adds the given &va to the GPU VA list of the &drm_gem_object it is >> + * associated with. >> + * >> + * The function assumes the caller already holds the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_link_locked(struct drm_gpuva *va) >> +{ >> + lockdep_assert_held(&va->gem.obj->gpuva.mutex); >> + list_add_tail(&va->head, &va->gem.obj->gpuva.list); >> +} >> +EXPORT_SYMBOL(drm_gpuva_link_locked); >> + >> +/** >> + * drm_gpuva_link_unlocked - unlink a &drm_gpuva >> + * @va: the &drm_gpuva to unlink >> + * >> + * This adds the given &va to the GPU VA list of the &drm_gem_object it is >> + * associated with. >> + * >> + * The function assumes the caller does not hold the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_link_unlocked(struct drm_gpuva *va) >> +{ >> + drm_gem_gpuva_lock(va->gem.obj); >> + drm_gpuva_link_locked(va); >> + drm_gem_gpuva_unlock(va->gem.obj); >> +} >> +EXPORT_SYMBOL(drm_gpuva_link_unlocked); >> + >> +/** >> + * drm_gpuva_unlink_locked - unlink a &drm_gpuva >> + * @va: the &drm_gpuva to unlink >> + * >> + * This removes the given &va from the GPU VA list of the &drm_gem_object it is >> + * associated with. >> + * >> + * The function assumes the caller already holds the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_unlink_locked(struct drm_gpuva *va) >> +{ >> + lockdep_assert_held(&va->gem.obj->gpuva.mutex); >> + list_del_init(&va->head); >> +} >> +EXPORT_SYMBOL(drm_gpuva_unlink_locked); >> + >> +/** >> + * drm_gpuva_unlink_unlocked - unlink a &drm_gpuva >> + * @va: the &drm_gpuva to unlink >> + * >> + * This removes the given &va from the GPU VA list of the &drm_gem_object it is >> + * associated with. >> + * >> + * The function assumes the caller does not hold the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_unlink_unlocked(struct drm_gpuva *va) >> +{ >> + drm_gem_gpuva_lock(va->gem.obj); >> + drm_gpuva_unlink_locked(va); >> + drm_gem_gpuva_unlock(va->gem.obj); >> +} >> +EXPORT_SYMBOL(drm_gpuva_unlink_unlocked); >> + >> +/** >> + * drm_gpuva_destroy_locked - destroy a &drm_gpuva >> + * @va: the &drm_gpuva to destroy >> + * >> + * This removes the given &va from GPU VA list of the &drm_gem_object it is >> + * associated with and removes it from the underlaying range allocator. >> + * >> + * The function assumes the caller already holds the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_destroy_locked(struct drm_gpuva *va) >> +{ >> + lockdep_assert_held(&va->gem.obj->gpuva.mutex); >> + >> + list_del(&va->head); >> + drm_mm_remove_node(&va->node); >> +} >> +EXPORT_SYMBOL(drm_gpuva_destroy_locked); >> + >> +/** >> + * drm_gpuva_destroy_unlocked - destroy a &drm_gpuva >> + * @va: the &drm_gpuva to destroy >> + * >> + * This removes the given &va from GPU VA list of the &drm_gem_object it is >> + * associated with and removes it from the underlaying range allocator. >> + * >> + * The function assumes the caller does not hold the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_destroy_unlocked(struct drm_gpuva *va) >> +{ >> + drm_gem_gpuva_lock(va->gem.obj); >> + list_del(&va->head); >> + drm_gem_gpuva_unlock(va->gem.obj); >> + >> + drm_mm_remove_node(&va->node); >> +} >> +EXPORT_SYMBOL(drm_gpuva_destroy_unlocked); >> + >> +/** >> + * drm_gpuva_find - find a &drm_gpuva >> + * @mgr: the &drm_gpuva_manager to search in >> + * @addr: the &drm_gpuvas address >> + * @range: the &drm_gpuvas range >> + * >> + * Returns: the &drm_gpuva at a given &addr and with a given &range >> + */ >> +struct drm_gpuva * >> +drm_gpuva_find(struct drm_gpuva_manager *mgr, >> + u64 addr, u64 range) >> +{ >> + struct drm_gpuva *va; >> + >> + drm_gpuva_for_each_va_in_range(va, mgr, addr, range) { >> + if (va->node.start == addr && >> + va->node.size == range) >> + return va; >> + } >> + >> + return NULL; >> +} >> +EXPORT_SYMBOL(drm_gpuva_find); >> + >> +/** >> + * drm_gpuva_find_prev - find the &drm_gpuva before the given address >> + * @mgr: the &drm_gpuva_manager to search in >> + * @start: the given GPU VA's start address >> + * >> + * Find the adjacent &drm_gpuva before the GPU VA with given &start address. >> + * >> + * Note that if there is any free space between the GPU VA mappings no mapping >> + * is returned. >> + * >> + * Returns: a pointer to the found &drm_gpuva or NULL if none was found >> + */ >> +struct drm_gpuva * >> +drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start) >> +{ >> + struct drm_mm_node *node; >> + >> + if (start <= mgr->mm_start || >> + start > (mgr->mm_start + mgr->mm_range)) >> + return NULL; >> + >> + node = __drm_mm_interval_first(&mgr->va_mm, start - 1, start); >> + if (node == &mgr->va_mm.head_node) >> + return NULL; >> + >> + return (struct drm_gpuva *)node; >> +} >> +EXPORT_SYMBOL(drm_gpuva_find_prev); >> + >> +/** >> + * drm_gpuva_find_next - find the &drm_gpuva after the given address >> + * @mgr: the &drm_gpuva_manager to search in >> + * @end: the given GPU VA's end address >> + * >> + * Find the adjacent &drm_gpuva after the GPU VA with given &end address. >> + * >> + * Note that if there is any free space between the GPU VA mappings no mapping >> + * is returned. >> + * >> + * Returns: a pointer to the found &drm_gpuva or NULL if none was found >> + */ >> +struct drm_gpuva * >> +drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end) >> +{ >> + struct drm_mm_node *node; >> + >> + if (end < mgr->mm_start || >> + end >= (mgr->mm_start + mgr->mm_range)) >> + return NULL; >> + >> + node = __drm_mm_interval_first(&mgr->va_mm, end, end + 1); >> + if (node == &mgr->va_mm.head_node) >> + return NULL; >> + >> + return (struct drm_gpuva *)node; >> +} >> +EXPORT_SYMBOL(drm_gpuva_find_next); >> + >> +/** >> + * drm_gpuva_region_insert - insert a &drm_gpuva_region >> + * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in >> + * @reg: the &drm_gpuva_region to insert >> + * @addr: the start address of the GPU VA >> + * @range: the range of the GPU VA >> + * >> + * Insert a &drm_gpuva_region with a given address and range into a >> + * &drm_gpuva_manager. >> + * >> + * Returns: 0 on success, negative error code on failure. >> + */ >> +int >> +drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, >> + struct drm_gpuva_region *reg, >> + u64 addr, u64 range) >> +{ >> + int ret; >> + >> + ret = drm_mm_insert_node_in_range(&mgr->region_mm, ®->node, >> + range, 0, >> + 0, addr, >> + addr + range, >> + DRM_MM_INSERT_LOW| >> + DRM_MM_INSERT_ONCE); >> + if (ret) >> + return ret; >> + >> + reg->mgr = mgr; >> + >> + return 0; >> +} >> +EXPORT_SYMBOL(drm_gpuva_region_insert); >> + >> +/** >> + * drm_gpuva_region_destroy - destroy a &drm_gpuva_region >> + * @mgr: the &drm_gpuva_manager holding the region >> + * @reg: the &drm_gpuva to destroy >> + * >> + * This removes the given ® from the underlaying range allocator. >> + */ >> +void >> +drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, >> + struct drm_gpuva_region *reg) >> +{ >> + struct drm_gpuva *va; >> + >> + drm_gpuva_for_each_va_in_range(va, mgr, >> + reg->node.start, >> + reg->node.size) { >> + WARN(1, "GPU VA region must be empty on destroy.\n"); >> + return; >> + } >> + >> + if (®->node == &mgr->kernel_alloc_node) { >> + WARN(1, "Can't destroy kernel reserved region.\n"); >> + return; >> + } >> + >> + drm_mm_remove_node(®->node); >> +} >> +EXPORT_SYMBOL(drm_gpuva_region_destroy); >> + >> +/** >> + * drm_gpuva_region_find - find a &drm_gpuva_region >> + * @mgr: the &drm_gpuva_manager to search in >> + * @addr: the &drm_gpuva_regions address >> + * @range: the &drm_gpuva_regions range >> + * >> + * Returns: the &drm_gpuva_region at a given &addr and with a given &range >> + */ >> +struct drm_gpuva_region * >> +drm_gpuva_region_find(struct drm_gpuva_manager *mgr, >> + u64 addr, u64 range) >> +{ >> + struct drm_gpuva_region *reg; >> + >> + drm_gpuva_for_each_region_in_range(reg, mgr, addr, addr + range) >> + if (reg->node.start == addr && >> + reg->node.size == range) >> + return reg; >> + >> + return NULL; >> +} >> +EXPORT_SYMBOL(drm_gpuva_region_find); >> + >> +static int >> +gpuva_op_map_new(struct drm_gpuva_op **pop, >> + u64 addr, u64 range, >> + struct drm_gem_object *obj, u64 offset) >> +{ >> + struct drm_gpuva_op *op; >> + >> + op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >> + if (!op) >> + return -ENOMEM; >> + >> + op->op = DRM_GPUVA_OP_MAP; >> + op->map.va.addr = addr; >> + op->map.va.range = range; >> + op->map.gem.obj = obj; >> + op->map.gem.offset = offset; >> + >> + return 0; >> +} >> + >> +static int >> +gpuva_op_remap_new(struct drm_gpuva_op **pop, >> + struct drm_gpuva_op_map *prev, >> + struct drm_gpuva_op_map *next, >> + struct drm_gpuva_op_unmap *unmap) >> +{ >> + struct drm_gpuva_op *op; >> + struct drm_gpuva_op_remap *r; >> + >> + op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >> + if (!op) >> + return -ENOMEM; >> + >> + op->op = DRM_GPUVA_OP_REMAP; >> + r = &op->remap; >> + >> + if (prev) { >> + r->prev = kmemdup(prev, sizeof(*prev), GFP_KERNEL); >> + if (!r->prev) >> + goto err_free_op; >> + } >> + >> + if (next) { >> + r->next = kmemdup(next, sizeof(*next), GFP_KERNEL); >> + if (!r->next) >> + goto err_free_prev; >> + } >> + >> + r->unmap = kmemdup(unmap, sizeof(*unmap), GFP_KERNEL); >> + if (!r->unmap) >> + goto err_free_next; >> + >> + return 0; >> + >> +err_free_next: >> + if (next) >> + kfree(r->next); >> +err_free_prev: >> + if (prev) >> + kfree(r->prev); >> +err_free_op: >> + kfree(op); >> + *pop = NULL; >> + >> + return -ENOMEM; >> +} >> + >> +static int >> +gpuva_op_unmap_new(struct drm_gpuva_op **pop, >> + struct drm_gpuva *va, bool merge) >> +{ >> + struct drm_gpuva_op *op; >> + >> + op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >> + if (!op) >> + return -ENOMEM; >> + >> + op->op = DRM_GPUVA_OP_UNMAP; >> + op->unmap.va = va; >> + op->unmap.keep = merge; >> + >> + return 0; >> +} >> + >> +#define op_map_new_to_list(_ops, _addr, _range, \ >> + _obj, _offset) \ >> +do { \ >> + struct drm_gpuva_op *op; \ >> + \ >> + ret = gpuva_op_map_new(&op, _addr, _range, \ >> + _obj, _offset); \ >> + if (ret) \ >> + goto err_free_ops; \ >> + \ >> + list_add_tail(&op->entry, _ops); \ >> +} while (0) >> + >> +#define op_remap_new_to_list(_ops, _prev, _next, \ >> + _unmap) \ >> +do { \ >> + struct drm_gpuva_op *op; \ >> + \ >> + ret = gpuva_op_remap_new(&op, _prev, _next, \ >> + _unmap); \ >> + if (ret) \ >> + goto err_free_ops; \ >> + \ >> + list_add_tail(&op->entry, _ops); \ >> +} while (0) >> + >> +#define op_unmap_new_to_list(_ops, _gpuva, _merge) \ >> +do { \ >> + struct drm_gpuva_op *op; \ >> + \ >> + ret = gpuva_op_unmap_new(&op, _gpuva, _merge); \ >> + if (ret) \ >> + goto err_free_ops; \ >> + \ >> + list_add_tail(&op->entry, _ops); \ >> +} while (0) >> + >> +/** >> + * drm_gpuva_sm_map_ops_create - creates the &drm_gpuva_ops to split and merge >> + * @mgr: the &drm_gpuva_manager representing the GPU VA space >> + * @req_addr: the start address of the new mapping >> + * @req_range: the range of the new mapping >> + * @req_obj: the &drm_gem_object to map >> + * @req_offset: the offset within the &drm_gem_object >> + * >> + * This function creates a list of operations to perform splitting and merging >> + * of existent mapping(s) with the newly requested one. >> + * >> + * The list can be iterated with &drm_gpuva_for_each_op and must be processed >> + * in the given order. It can contain map, unmap and remap operations, but it >> + * also can be empty if no operation is required, e.g. if the requested mapping >> + * already exists is the exact same way. >> + * >> + * There can be an arbitrary amount of unmap operations, a maximum of two remap >> + * operations and a single map operation. The latter one, if existent, >> + * represents the original map operation requested by the caller. Please note >> + * that the map operation might has been modified, e.g. if it was >> + * merged with an existent mapping. >> + * >> + * Note that before calling this function again with another mapping request it >> + * is necessary to update the &drm_gpuva_manager's view of the GPU VA space. >> + * The previously obtained operations must be either processed or abandoned. >> + * To update the &drm_gpuva_manager's view of the GPU VA space >> + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >> + * drm_gpuva_destroy_unlocked() should be used. >> + * >> + * After the caller finished processing the returned &drm_gpuva_ops, they must >> + * be freed with &drm_gpuva_ops_free. >> + * >> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure >> + */ >> +struct drm_gpuva_ops * >> +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr, >> + u64 req_addr, u64 req_range, >> + struct drm_gem_object *req_obj, u64 req_offset) >> +{ >> + struct drm_gpuva_ops *ops; >> + struct drm_gpuva *va, *prev = NULL; >> + u64 req_end = req_addr + req_range; >> + bool skip_pmerge = false, skip_nmerge = false; >> + int ret; >> + >> + if (!drm_gpuva_in_any_region(mgr, req_addr, req_range)) >> + return ERR_PTR(-EINVAL); >> + >> + ops = kzalloc(sizeof(*ops), GFP_KERNEL); >> + if (!ops) >> + return ERR_PTR(-ENOMEM); >> + >> + INIT_LIST_HEAD(&ops->list); >> + >> + drm_gpuva_for_each_va_in_range(va, mgr, req_addr, req_end) { >> + struct drm_gem_object *obj = va->gem.obj; >> + u64 offset = va->gem.offset; >> + u64 addr = va->node.start; >> + u64 range = va->node.size; >> + u64 end = addr + range; >> + >> + /* Generally, we want to skip merging with potential mappings >> + * left and right of the requested one when we found a >> + * collision, since merging happens in this loop already. >> + * >> + * However, there is one exception when the requested mapping >> + * spans into a free VM area. If this is the case we might >> + * still hit the boundary of another mapping before and/or >> + * after the free VM area. >> + */ >> + skip_pmerge = true; >> + skip_nmerge = true; >> + >> + if (addr == req_addr) { >> + bool merge = obj == req_obj && >> + offset == req_offset; >> + if (end == req_end) { >> + if (merge) >> + goto done; >> + >> + op_unmap_new_to_list(&ops->list, va, false); >> + break; >> + } >> + >> + if (end < req_end) { >> + skip_nmerge = false; >> + op_unmap_new_to_list(&ops->list, va, merge); >> + goto next; >> + } >> + >> + if (end > req_end) { >> + struct drm_gpuva_op_map n = { >> + .va.addr = req_end, >> + .va.range = range - req_range, >> + .gem.obj = obj, >> + .gem.offset = offset + req_range, >> + }; >> + struct drm_gpuva_op_unmap u = { .va = va }; >> + >> + if (merge) >> + goto done; >> + >> + op_remap_new_to_list(&ops->list, NULL, &n, &u); >> + break; >> + } >> + } else if (addr < req_addr) { >> + u64 ls_range = req_addr - addr; >> + struct drm_gpuva_op_map p = { >> + .va.addr = addr, >> + .va.range = ls_range, >> + .gem.obj = obj, >> + .gem.offset = offset, >> + }; >> + struct drm_gpuva_op_unmap u = { .va = va }; >> + bool merge = obj == req_obj && >> + offset + ls_range == req_offset; >> + >> + if (end == req_end) { >> + if (merge) >> + goto done; >> + >> + op_remap_new_to_list(&ops->list, &p, NULL, &u); >> + break; >> + } >> + >> + if (end < req_end) { >> + u64 new_addr = addr; >> + u64 new_range = req_range + ls_range; >> + u64 new_offset = offset; >> + >> + /* We validated that the requested mapping is >> + * within a single VA region already. >> + * Since it overlaps the current mapping (which >> + * can't cross a VA region boundary) we can be >> + * sure that we're still within the boundaries >> + * of the same VA region after merging. >> + */ >> + if (merge) { >> + req_offset = new_offset; >> + req_addr = new_addr; >> + req_range = new_range; >> + op_unmap_new_to_list(&ops->list, va, true); >> + goto next; >> + } >> + >> + op_remap_new_to_list(&ops->list, &p, NULL, &u); >> + goto next; >> + } >> + >> + if (end > req_end) { >> + struct drm_gpuva_op_map n = { >> + .va.addr = req_end, >> + .va.range = end - req_end, >> + .gem.obj = obj, >> + .gem.offset = offset + ls_range + >> + req_range, >> + }; >> + >> + if (merge) >> + goto done; >> + >> + op_remap_new_to_list(&ops->list, &p, &n, &u); >> + break; >> + } >> + } else if (addr > req_addr) { >> + bool merge = obj == req_obj && >> + offset == req_offset + >> + (addr - req_addr); >> + if (!prev) >> + skip_pmerge = false; >> + >> + if (end == req_end) { >> + op_unmap_new_to_list(&ops->list, va, merge); >> + break; >> + } >> + >> + if (end < req_end) { >> + skip_nmerge = false; >> + op_unmap_new_to_list(&ops->list, va, merge); >> + goto next; >> + } >> + >> + if (end > req_end) { >> + struct drm_gpuva_op_map n = { >> + .va.addr = req_end, >> + .va.range = end - req_end, >> + .gem.obj = obj, >> + .gem.offset = offset + req_end - addr, >> + }; >> + struct drm_gpuva_op_unmap u = { .va = va }; >> + u64 new_end = end; >> + u64 new_range = new_end - req_addr; >> + >> + /* We validated that the requested mapping is >> + * within a single VA region already. >> + * Since it overlaps the current mapping (which >> + * can't cross a VA region boundary) we can be >> + * sure that we're still within the boundaries >> + * of the same VA region after merging. >> + */ >> + if (merge) { >> + req_end = new_end; >> + req_range = new_range; >> + op_unmap_new_to_list(&ops->list, va, true); >> + break; >> + } >> + >> + op_remap_new_to_list(&ops->list, NULL, &n, &u); >> + break; >> + } >> + } >> +next: >> + prev = va; >> + } >> + >> + va = skip_pmerge ? NULL : drm_gpuva_find_prev(mgr, req_addr); >> + if (va) { >> + struct drm_gem_object *obj = va->gem.obj; >> + u64 offset = va->gem.offset; >> + u64 addr = va->node.start; >> + u64 range = va->node.size; >> + u64 new_offset = offset; >> + u64 new_addr = addr; >> + u64 new_range = req_range + range; >> + bool merge = obj == req_obj && >> + offset + range == req_offset; >> + >> + /* Don't merge over VA region boundaries. */ >> + merge &= drm_gpuva_in_any_region(mgr, new_addr, new_range); >> + if (merge) { >> + op_unmap_new_to_list(&ops->list, va, true); >> + >> + req_offset = new_offset; >> + req_addr = new_addr; >> + req_range = new_range; >> + } >> + } >> + >> + va = skip_nmerge ? NULL : drm_gpuva_find_next(mgr, req_end); >> + if (va) { >> + struct drm_gem_object *obj = va->gem.obj; >> + u64 offset = va->gem.offset; >> + u64 addr = va->node.start; >> + u64 range = va->node.size; >> + u64 end = addr + range; >> + u64 new_range = req_range + range; >> + u64 new_end = end; >> + bool merge = obj == req_obj && >> + offset == req_offset + req_range; >> + >> + /* Don't merge over VA region boundaries. */ >> + merge &= drm_gpuva_in_any_region(mgr, req_addr, new_range); >> + if (merge) { >> + op_unmap_new_to_list(&ops->list, va, true); >> + >> + req_range = new_range; >> + req_end = new_end; >> + } >> + } >> + >> + op_map_new_to_list(&ops->list, >> + req_addr, req_range, >> + req_obj, req_offset); >> + >> +done: >> + return ops; >> + >> +err_free_ops: >> + drm_gpuva_ops_free(ops); >> + return ERR_PTR(ret); >> +} >> +EXPORT_SYMBOL(drm_gpuva_sm_map_ops_create); >> + >> +#undef op_map_new_to_list >> +#undef op_remap_new_to_list >> +#undef op_unmap_new_to_list >> + >> +/** >> + * drm_gpuva_sm_unmap_ops_create - creates the &drm_gpuva_ops to split on unmap >> + * @mgr: the &drm_gpuva_manager representing the GPU VA space >> + * @req_addr: the start address of the range to unmap >> + * @req_range: the range of the mappings to unmap >> + * >> + * This function creates a list of operations to perform unmapping and, if >> + * required, splitting of the mappings overlapping the unmap range. >> + * >> + * The list can be iterated with &drm_gpuva_for_each_op and must be processed >> + * in the given order. It can contain unmap and remap operations, depending on >> + * whether there are actual overlapping mappings to split. >> + * >> + * There can be an arbitrary amount of unmap operations and a maximum of two >> + * remap operations. >> + * >> + * Note that before calling this function again with another range to unmap it >> + * is necessary to update the &drm_gpuva_manager's view of the GPU VA space. >> + * The previously obtained operations must be processed or abandoned. >> + * To update the &drm_gpuva_manager's view of the GPU VA space >> + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >> + * drm_gpuva_destroy_unlocked() should be used. >> + * >> + * After the caller finished processing the returned &drm_gpuva_ops, they must >> + * be freed with &drm_gpuva_ops_free. >> + * >> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure >> + */ >> +struct drm_gpuva_ops * >> +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr, >> + u64 req_addr, u64 req_range) >> +{ >> + struct drm_gpuva_ops *ops; >> + struct drm_gpuva_op *op; >> + struct drm_gpuva_op_remap *r; >> + struct drm_gpuva *va; >> + u64 req_end = req_addr + req_range; >> + int ret; >> + >> + ops = kzalloc(sizeof(*ops), GFP_KERNEL); >> + if (!ops) >> + return ERR_PTR(-ENOMEM); >> + >> + INIT_LIST_HEAD(&ops->list); >> + >> + drm_gpuva_for_each_va_in_range(va, mgr, req_addr, req_end) { >> + struct drm_gem_object *obj = va->gem.obj; >> + u64 offset = va->gem.offset; >> + u64 addr = va->node.start; >> + u64 range = va->node.size; >> + u64 end = addr + range; >> + >> + op = kzalloc(sizeof(*op), GFP_KERNEL); >> + if (!op) { >> + ret = -ENOMEM; >> + goto err_free_ops; >> + } >> + >> + r = &op->remap; >> + >> + if (addr < req_addr) { >> + r->prev = kzalloc(sizeof(*r->prev), GFP_KERNEL); >> + if (!r->prev) { >> + ret = -ENOMEM; >> + goto err_free_op; >> + } >> + >> + r->prev->va.addr = addr; >> + r->prev->va.range = req_addr - addr; >> + r->prev->gem.obj = obj; >> + r->prev->gem.offset = offset; >> + } >> + >> + if (end > req_end) { >> + r->next = kzalloc(sizeof(*r->next), GFP_KERNEL); >> + if (!r->next) { >> + ret = -ENOMEM; >> + goto err_free_prev; >> + } >> + >> + r->next->va.addr = req_end; >> + r->next->va.range = end - req_end; >> + r->next->gem.obj = obj; >> + r->next->gem.offset = offset + (req_end - addr); >> + } >> + >> + if (op->remap.prev || op->remap.next) { >> + op->op = DRM_GPUVA_OP_REMAP; >> + r->unmap = kzalloc(sizeof(*r->unmap), GFP_KERNEL); >> + if (!r->unmap) { >> + ret = -ENOMEM; >> + goto err_free_next; >> + } >> + >> + r->unmap->va = va; >> + } else { >> + op->op = DRM_GPUVA_OP_UNMAP; >> + op->unmap.va = va; >> + } >> + >> + list_add_tail(&op->entry, &ops->list); >> + } >> + >> + return ops; >> + >> +err_free_next: >> + if (r->next) >> + kfree(r->next); >> +err_free_prev: >> + if (r->prev) >> + kfree(r->prev); >> +err_free_op: >> + kfree(op); >> +err_free_ops: >> + drm_gpuva_ops_free(ops); >> + return ERR_PTR(ret); >> +} >> +EXPORT_SYMBOL(drm_gpuva_sm_unmap_ops_create); >> + >> +/** >> + * drm_gpuva_ops_free - free the given &drm_gpuva_ops >> + * @ops: the &drm_gpuva_ops to free >> + * >> + * Frees the given &drm_gpuva_ops structure including all the ops associated >> + * with it. >> + */ >> +void >> +drm_gpuva_ops_free(struct drm_gpuva_ops *ops) >> +{ >> + struct drm_gpuva_op *op, *next; >> + >> + drm_gpuva_for_each_op_safe(op, next, ops) { >> + list_del(&op->entry); >> + if (op->op == DRM_GPUVA_OP_REMAP) { >> + if (op->remap.prev) >> + kfree(op->remap.prev); >> + >> + if (op->remap.next) >> + kfree(op->remap.next); >> + >> + kfree(op->remap.unmap); >> + } >> + kfree(op); >> + } >> + >> + kfree(ops); >> +} >> +EXPORT_SYMBOL(drm_gpuva_ops_free); >> diff --git a/include/drm/drm_drv.h b/include/drm/drm_drv.h >> index d7c521e8860f..6feacd93aca6 100644 >> --- a/include/drm/drm_drv.h >> +++ b/include/drm/drm_drv.h >> @@ -104,6 +104,12 @@ enum drm_driver_feature { >> * acceleration should be handled by two drivers that are connected using auxiliary bus. >> */ >> DRIVER_COMPUTE_ACCEL = BIT(7), >> + /** >> + * @DRIVER_GEM_GPUVA: >> + * >> + * Driver supports user defined GPU VA bindings for GEM objects. >> + */ >> + DRIVER_GEM_GPUVA = BIT(8), >> >> /* IMPORTANT: Below are all the legacy flags, add new ones above. */ >> >> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h >> index 772a4adf5287..4a3679034966 100644 >> --- a/include/drm/drm_gem.h >> +++ b/include/drm/drm_gem.h >> @@ -36,6 +36,8 @@ >> >> #include <linux/kref.h> >> #include <linux/dma-resv.h> >> +#include <linux/list.h> >> +#include <linux/mutex.h> >> >> #include <drm/drm_vma_manager.h> >> >> @@ -337,6 +339,17 @@ struct drm_gem_object { >> */ >> struct dma_resv _resv; >> >> + /** >> + * @gpuva: >> + * >> + * Provides the list and list mutex of GPU VAs attached to this >> + * GEM object. >> + */ >> + struct { >> + struct list_head list; >> + struct mutex mutex; >> + } gpuva; >> + >> /** >> * @funcs: >> * >> @@ -479,4 +492,66 @@ void drm_gem_lru_move_tail(struct drm_gem_lru *lru, struct drm_gem_object *obj); >> unsigned long drm_gem_lru_scan(struct drm_gem_lru *lru, unsigned nr_to_scan, >> bool (*shrink)(struct drm_gem_object *obj)); >> >> +/** >> + * drm_gem_gpuva_init - initialize the gpuva list of a GEM object >> + * @obj: the &drm_gem_object >> + * >> + * This initializes the &drm_gem_object's &drm_gpuva list and the mutex >> + * protecting it. >> + * >> + * Calling this function is only necessary for drivers intending to support the >> + * &drm_driver_feature DRIVER_GEM_GPUVA. >> + */ >> +static inline void drm_gem_gpuva_init(struct drm_gem_object *obj) >> +{ >> + INIT_LIST_HEAD(&obj->gpuva.list); >> + mutex_init(&obj->gpuva.mutex); >> +} >> + >> +/** >> + * drm_gem_gpuva_lock - lock the GEM's gpuva list mutex >> + * @obj: the &drm_gem_object >> + * >> + * This unlocks the mutex protecting the &drm_gem_object's &drm_gpuva list. >> + */ >> +static inline void drm_gem_gpuva_lock(struct drm_gem_object *obj) >> +{ >> + mutex_lock(&obj->gpuva.mutex); >> +} >> + >> +/** >> + * drm_gem_gpuva_unlock - unlock the GEM's gpuva list mutex >> + * @obj: the &drm_gem_object >> + * >> + * This unlocks the mutex protecting the &drm_gem_object's &drm_gpuva list. >> + */ >> +static inline void drm_gem_gpuva_unlock(struct drm_gem_object *obj) >> +{ >> + mutex_unlock(&obj->gpuva.mutex); >> +} >> + >> +/** >> + * drm_gem_for_each_gpuva - iternator to walk over a list of gpuvas >> + * @entry: &drm_gpuva structure to assign to in each iteration step >> + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with >> + * >> + * This iterator walks over all &drm_gpuva structures associated with the >> + * &drm_gpuva_manager. >> + */ >> +#define drm_gem_for_each_gpuva(entry, obj) \ >> + list_for_each_entry(entry, &obj->gpuva.list, head) >> + >> +/** >> + * drm_gem_for_each_gpuva_safe - iternator to safely walk over a list of gpuvas >> + * @entry: &drm_gpuva structure to assign to in each iteration step >> + * @next: &next &drm_gpuva to store the next step >> + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with >> + * >> + * This iterator walks over all &drm_gpuva structures associated with the >> + * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence >> + * it is save against removal of elements. >> + */ >> +#define drm_gem_for_each_gpuva_safe(entry, next, obj) \ >> + list_for_each_entry_safe(entry, next, &obj->gpuva.list, head) >> + >> #endif /* __DRM_GEM_H__ */ >> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h >> new file mode 100644 >> index 000000000000..adeb0c916e91 >> --- /dev/null >> +++ b/include/drm/drm_gpuva_mgr.h >> @@ -0,0 +1,527 @@ >> +// SPDX-License-Identifier: GPL-2.0 >> + >> +#ifndef __DRM_GPUVA_MGR_H__ >> +#define __DRM_GPUVA_MGR_H__ >> + >> +/* >> + * Copyright (c) 2022 Red Hat. >> + * >> + * Permission is hereby granted, free of charge, to any person obtaining a >> + * copy of this software and associated documentation files (the "Software"), >> + * to deal in the Software without restriction, including without limitation >> + * the rights to use, copy, modify, merge, publish, distribute, sublicense, >> + * and/or sell copies of the Software, and to permit persons to whom the >> + * Software is furnished to do so, subject to the following conditions: >> + * >> + * The above copyright notice and this permission notice shall be included in >> + * all copies or substantial portions of the Software. >> + * >> + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR >> + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, >> + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL >> + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, DAMAGES OR >> + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, >> + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR >> + * OTHER DEALINGS IN THE SOFTWARE. >> + */ >> + >> +#include <drm/drm_mm.h> >> +#include <linux/mm.h> >> +#include <linux/rbtree.h> >> +#include <linux/spinlock.h> >> +#include <linux/types.h> >> + >> +struct drm_gpuva_region; >> +struct drm_gpuva; >> +struct drm_gpuva_ops; >> + >> +/** >> + * struct drm_gpuva_manager - DRM GPU VA Manager >> + * >> + * The DRM GPU VA Manager keeps track of a GPU's virtual address space by using >> + * the &drm_mm range allocator. Typically, this structure is embedded in bigger >> + * driver structures. >> + * >> + * Drivers can pass addresses and ranges in an arbitrary unit, e.g. bytes or >> + * pages. >> + * >> + * There should be one manager instance per GPU virtual address space. >> + */ >> +struct drm_gpuva_manager { >> + /** >> + * @name: the name of the DRM GPU VA space >> + */ >> + const char *name; >> + >> + /** >> + * @mm_start: start of the VA space >> + */ >> + u64 mm_start; >> + >> + /** >> + * @mm_range: length of the VA space >> + */ >> + u64 mm_range; >> + >> + /** >> + * @region_mm: the &drm_mm range allocator to track GPU VA regions >> + */ >> + struct drm_mm region_mm; >> + >> + /** >> + * @va_mm: the &drm_mm range allocator to track GPU VA mappings >> + */ >> + struct drm_mm va_mm; >> + >> + /** >> + * @kernel_alloc_node: >> + * >> + * &drm_mm_node representing the address space cutout reserved for >> + * the kernel >> + */ >> + struct drm_mm_node kernel_alloc_node; >> +}; >> + >> +void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr, >> + const char *name, >> + u64 start_offset, u64 range, >> + u64 reserve_offset, u64 reserve_range); >> +void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr); >> + >> +/** >> + * struct drm_gpuva_region - structure to track a portion of GPU VA space >> + * >> + * This structure represents a portion of a GPUs VA space and is associated >> + * with a &drm_gpuva_manager. Internally it is based on a &drm_mm_node. >> + * >> + * GPU VA mappings, represented by &drm_gpuva objects, are restricted to be >> + * placed within a &drm_gpuva_region. >> + */ >> +struct drm_gpuva_region { >> + /** >> + * @node: the &drm_mm_node to track the GPU VA region >> + */ >> + struct drm_mm_node node; >> + >> + /** >> + * @mgr: the &drm_gpuva_manager this object is associated with >> + */ >> + struct drm_gpuva_manager *mgr; >> + >> + /** >> + * @sparse: indicates whether this region is sparse >> + */ >> + bool sparse; >> +}; >> + >> +struct drm_gpuva_region * >> +drm_gpuva_region_find(struct drm_gpuva_manager *mgr, >> + u64 addr, u64 range); >> +int drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, >> + struct drm_gpuva_region *reg, >> + u64 addr, u64 range); >> +void drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, >> + struct drm_gpuva_region *reg); >> + >> +int drm_gpuva_insert(struct drm_gpuva_manager *mgr, >> + struct drm_gpuva *va, >> + u64 addr, u64 range); >> +/** >> + * drm_gpuva_for_each_region_in_range - iternator to walk over a range of nodes >> + * @node__: &drm_gpuva_region structure to assign to in each iteration step >> + * @gpuva__: &drm_gpuva_manager structure to walk >> + * @start__: starting offset, the first node will overlap this >> + * @end__: ending offset, the last node will start before this (but may overlap) >> + * >> + * This iterator walks over all nodes in the range allocator that lie >> + * between @start and @end. It is implemented similarly to list_for_each(), >> + * but is using &drm_mm's internal interval tree to accelerate the search for >> + * the starting node, and hence isn't safe against removal of elements. It >> + * assumes that @end is within (or is the upper limit of) the &drm_gpuva_manager. >> + * If [@start, @end] are beyond the range of the &drm_gpuva_manager, the >> + * iterator may walk over the special _unallocated_ &drm_mm.head_node of the >> + * backing &drm_mm, and may even continue indefinitely. >> + */ >> +#define drm_gpuva_for_each_region_in_range(node__, gpuva__, start__, end__) \ >> + for (node__ = (struct drm_gpuva_region *)__drm_mm_interval_first(&(gpuva__)->region_mm, \ >> + (start__), (end__)-1); \ >> + node__->node.start < (end__); \ >> + node__ = (struct drm_gpuva_region *)list_next_entry(&node__->node, node_list)) >> + >> +/** >> + * drm_gpuva_for_each_region - iternator to walk over a range of nodes >> + * @entry: &drm_gpuva_region structure to assign to in each iteration step >> + * @gpuva: &drm_gpuva_manager structure to walk >> + * >> + * This iterator walks over all &drm_gpuva_region structures associated with the >> + * &drm_gpuva_manager. >> + */ >> +#define drm_gpuva_for_each_region(entry, gpuva) \ >> + list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->region_mm), node.node_list) >> + >> +/** >> + * drm_gpuva_for_each_region_safe - iternator to safely walk over a range of >> + * nodes >> + * @entry: &drm_gpuva_region structure to assign to in each iteration step >> + * @next: &next &drm_gpuva_region to store the next step >> + * @gpuva: &drm_gpuva_manager structure to walk >> + * >> + * This iterator walks over all &drm_gpuva_region structures associated with the >> + * &drm_gpuva_manager. It is implemented with list_for_each_safe(), so save >> + * against removal of elements. >> + */ >> +#define drm_gpuva_for_each_region_safe(entry, next, gpuva) \ >> + list_for_each_entry_safe(entry, next, drm_mm_nodes(&(gpuva)->region_mm), node.node_list) >> + >> + >> +/** >> + * enum drm_gpuva_flags - flags for struct drm_gpuva >> + */ >> +enum drm_gpuva_flags { >> + /** >> + * @DRM_GPUVA_SWAPPED: flag indicating that the &drm_gpuva is swapped >> + */ >> + DRM_GPUVA_SWAPPED = (1 << 0), >> +}; >> + >> +/** >> + * struct drm_gpuva - structure to track a GPU VA mapping >> + * >> + * This structure represents a GPU VA mapping and is associated with a >> + * &drm_gpuva_manager. Internally it is based on a &drm_mm_node. >> + * >> + * Typically, this structure is embedded in bigger driver structures. >> + */ >> +struct drm_gpuva { >> + /** >> + * @node: the &drm_mm_node to track the GPU VA mapping >> + */ >> + struct drm_mm_node node; >> + >> + /** >> + * @mgr: the &drm_gpuva_manager this object is associated with >> + */ >> + struct drm_gpuva_manager *mgr; >> + >> + /** >> + * @region: the &drm_gpuva_region the &drm_gpuva is mapped in >> + */ >> + struct drm_gpuva_region *region; >> + >> + /** >> + * @head: the &list_head to attach this object to a &drm_gem_object >> + */ >> + struct list_head head; >> + >> + /** >> + * @flags: the &drm_gpuva_flags for this mapping >> + */ >> + enum drm_gpuva_flags flags; >> + >> + /** >> + * @gem: structure containing the &drm_gem_object and it's offset >> + */ >> + struct { >> + /** >> + * @offset: the offset within the &drm_gem_object >> + */ >> + u64 offset; >> + >> + /** >> + * @obj: the mapped &drm_gem_object >> + */ >> + struct drm_gem_object *obj; >> + } gem; >> +}; >> + >> +void drm_gpuva_link_locked(struct drm_gpuva *va); >> +void drm_gpuva_link_unlocked(struct drm_gpuva *va); >> +void drm_gpuva_unlink_locked(struct drm_gpuva *va); >> +void drm_gpuva_unlink_unlocked(struct drm_gpuva *va); >> + >> +void drm_gpuva_destroy_locked(struct drm_gpuva *va); >> +void drm_gpuva_destroy_unlocked(struct drm_gpuva *va); >> + >> +struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr, >> + u64 addr, u64 range); >> +struct drm_gpuva *drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start); >> +struct drm_gpuva *drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end); >> + >> +/** >> + * drm_gpuva_swap - sets whether the backing BO of this &drm_gpuva is swapped >> + * @va: the &drm_gpuva to set the swap flag of >> + * @swap: indicates whether the &drm_gpuva is swapped >> + */ >> +static inline void drm_gpuva_swap(struct drm_gpuva *va, bool swap) >> +{ >> + if (swap) >> + va->flags |= DRM_GPUVA_SWAPPED; >> + else >> + va->flags &= ~DRM_GPUVA_SWAPPED; >> +} >> + >> +/** >> + * drm_gpuva_swapped - indicates whether the backing BO of this &drm_gpuva >> + * is swapped >> + * @va: the &drm_gpuva to check >> + */ >> +static inline bool drm_gpuva_swapped(struct drm_gpuva *va) >> +{ >> + return va->flags & DRM_GPUVA_SWAPPED; >> +} >> + >> +/** >> + * drm_gpuva_for_each_va_in_range - iternator to walk over a range of nodes >> + * @node__: &drm_gpuva structure to assign to in each iteration step >> + * @gpuva__: &drm_gpuva_manager structure to walk >> + * @start__: starting offset, the first node will overlap this >> + * @end__: ending offset, the last node will start before this (but may overlap) >> + * >> + * This iterator walks over all nodes in the range allocator that lie >> + * between @start and @end. It is implemented similarly to list_for_each(), >> + * but is using &drm_mm's internal interval tree to accelerate the search for >> + * the starting node, and hence isn't safe against removal of elements. It >> + * assumes that @end is within (or is the upper limit of) the &drm_gpuva_manager. >> + * If [@start, @end] are beyond the range of the &drm_gpuva_manager, the >> + * iterator may walk over the special _unallocated_ &drm_mm.head_node of the >> + * backing &drm_mm, and may even continue indefinitely. >> + */ >> +#define drm_gpuva_for_each_va_in_range(node__, gpuva__, start__, end__) \ >> + for (node__ = (struct drm_gpuva *)__drm_mm_interval_first(&(gpuva__)->va_mm, \ >> + (start__), (end__)-1); \ >> + node__->node.start < (end__); \ >> + node__ = (struct drm_gpuva *)list_next_entry(&node__->node, node_list)) >> + >> +/** >> + * drm_gpuva_for_each_va - iternator to walk over a range of nodes >> + * @entry: &drm_gpuva structure to assign to in each iteration step >> + * @gpuva: &drm_gpuva_manager structure to walk >> + * >> + * This iterator walks over all &drm_gpuva structures associated with the >> + * &drm_gpuva_manager. >> + */ >> +#define drm_gpuva_for_each_va(entry, gpuva) \ >> + list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->va_mm), node.node_list) >> + >> +/** >> + * drm_gpuva_for_each_va_safe - iternator to safely walk over a range of >> + * nodes >> + * @entry: &drm_gpuva structure to assign to in each iteration step >> + * @next: &next &drm_gpuva to store the next step >> + * @gpuva: &drm_gpuva_manager structure to walk >> + * >> + * This iterator walks over all &drm_gpuva structures associated with the >> + * &drm_gpuva_manager. It is implemented with list_for_each_safe(), so save >> + * against removal of elements. >> + */ >> +#define drm_gpuva_for_each_va_safe(entry, next, gpuva) \ >> + list_for_each_entry_safe(entry, next, drm_mm_nodes(&(gpuva)->va_mm), node.node_list) >> + >> +/** >> + * enum drm_gpuva_op_type - GPU VA operation type >> + * >> + * Operations to alter the GPU VA mappings tracked by the &drm_gpuva_manager >> + * can be map, remap or unmap operations. >> + */ >> +enum drm_gpuva_op_type { >> + /** >> + * @DRM_GPUVA_OP_MAP: the map op type >> + */ >> + DRM_GPUVA_OP_MAP, >> + >> + /** >> + * @DRM_GPUVA_OP_REMAP: the remap op type >> + */ >> + DRM_GPUVA_OP_REMAP, >> + >> + /** >> + * @DRM_GPUVA_OP_UNMAP: the unmap op type >> + */ >> + DRM_GPUVA_OP_UNMAP, >> +}; >> + >> +/** >> + * struct drm_gpuva_op_map - GPU VA map operation >> + * >> + * This structure represents a single map operation generated by the >> + * DRM GPU VA manager. >> + */ >> +struct drm_gpuva_op_map { >> + /** >> + * @va: structure containing address and range of a map >> + * operation >> + */ >> + struct { >> + /** >> + * @addr: the base address of the new mapping >> + */ >> + u64 addr; >> + >> + /** >> + * @range: the range of the new mapping >> + */ >> + u64 range; >> + } va; >> + >> + /** >> + * @gem: structure containing the &drm_gem_object and it's offset >> + */ >> + struct { >> + /** >> + * @offset: the offset within the &drm_gem_object >> + */ >> + u64 offset; >> + >> + /** >> + * @obj: the &drm_gem_object to map >> + */ >> + struct drm_gem_object *obj; >> + } gem; >> +}; >> + >> +/** >> + * struct drm_gpuva_op_unmap - GPU VA unmap operation >> + * >> + * This structure represents a single unmap operation generated by the >> + * DRM GPU VA manager. >> + */ >> +struct drm_gpuva_op_unmap { >> + /** >> + * @va: the &drm_gpuva to unmap >> + */ >> + struct drm_gpuva *va; >> + >> + /** >> + * @keep: >> + * >> + * Indicates whether this &drm_gpuva is physically contiguous with the >> + * original mapping request. >> + * >> + * Optionally, if &keep is set, drivers may keep the actual page table >> + * mappings for this &drm_gpuva, adding the missing page table entries >> + * only and update the &drm_gpuva_manager accordingly. >> + */ >> + bool keep; >> +}; >> + >> +/** >> + * struct drm_gpuva_op_remap - GPU VA remap operation >> + * >> + * This represents a single remap operation generated by the DRM GPU VA manager. >> + * >> + * A remap operation is generated when an existing GPU VA mmapping is split up >> + * by inserting a new GPU VA mapping or by partially unmapping existent >> + * mapping(s), hence it consists of a maximum of two map and one unmap >> + * operation. >> + * >> + * The @unmap operation takes care of removing the original existing mapping. >> + * @prev is used to remap the preceding part, @next the subsequent part. >> + * >> + * If either a new mapping's start address is aligned with the start address >> + * of the old mapping or the new mapping's end address is aligned with the >> + * end address of the old mapping, either @prev or @next is NULL. >> + * >> + * Note, the reason for a dedicated remap operation, rather than arbitrary >> + * unmap and map operations, is to give drivers the chance of extracting driver >> + * specific data for creating the new mappings from the unmap operations's >> + * &drm_gpuva structure which typically is embedded in larger driver specific >> + * structures. >> + */ >> +struct drm_gpuva_op_remap { >> + /** >> + * @prev: the preceding part of a split mapping >> + */ >> + struct drm_gpuva_op_map *prev; >> + >> + /** >> + * @next: the subsequent part of a split mapping >> + */ >> + struct drm_gpuva_op_map *next; >> + >> + /** >> + * @unmap: the unmap operation for the original existing mapping >> + */ >> + struct drm_gpuva_op_unmap *unmap; >> +}; >> + >> +/** >> + * struct drm_gpuva_op - GPU VA operation >> + * >> + * This structure represents a single generic operation, which can be either >> + * map, unmap or remap. >> + * >> + * The particular type of the operation is defined by @op. >> + */ >> +struct drm_gpuva_op { >> + /** >> + * @entry: >> + * >> + * The &list_head used to distribute instances of this struct within >> + * &drm_gpuva_ops. >> + */ >> + struct list_head entry; >> + >> + /** >> + * @op: the type of the operation >> + */ >> + enum drm_gpuva_op_type op; >> + >> + union { >> + /** >> + * @map: the map operation >> + */ >> + struct drm_gpuva_op_map map; >> + >> + /** >> + * @unmap: the unmap operation >> + */ >> + struct drm_gpuva_op_unmap unmap; >> + >> + /** >> + * @remap: the remap operation >> + */ >> + struct drm_gpuva_op_remap remap; >> + }; >> +}; >> + >> +/** >> + * struct drm_gpuva_ops - wraps a list of &drm_gpuva_op >> + */ >> +struct drm_gpuva_ops { >> + /** >> + * @list: the &list_head >> + */ >> + struct list_head list; >> +}; >> + >> +/** >> + * drm_gpuva_for_each_op - iterator to walk over all ops >> + * @op: &drm_gpuva_op to assign in each iteration step >> + * @ops: &drm_gpuva_ops to walk >> + * >> + * This iterator walks over all ops within a given list of operations. >> + */ >> +#define drm_gpuva_for_each_op(op, ops) list_for_each_entry(op, &(ops)->list, entry) >> + >> +/** >> + * drm_gpuva_for_each_op_safe - iterator to safely walk over all ops >> + * @op: &drm_gpuva_op to assign in each iteration step >> + * @next: &next &drm_gpuva_op to store the next step >> + * @ops: &drm_gpuva_ops to walk >> + * >> + * This iterator walks over all ops within a given list of operations. It is >> + * implemented with list_for_each_safe(), so save against removal of elements. >> + */ >> +#define drm_gpuva_for_each_op_safe(op, next, ops) \ >> + list_for_each_entry_safe(op, next, &(ops)->list, entry) >> + >> +struct drm_gpuva_ops * >> +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr, >> + u64 addr, u64 range, >> + struct drm_gem_object *obj, u64 offset); >> +struct drm_gpuva_ops * >> +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr, >> + u64 addr, u64 range); >> +void drm_gpuva_ops_free(struct drm_gpuva_ops *ops); >> + >> +#endif /* __DRM_GPUVA_MGR_H__ */ >> -- >> 2.39.0 >>
On 2/6/23 14:35, Christian König wrote: > Am 03.02.23 um 18:37 schrieb Matthew Brost: >> On Wed, Jan 18, 2023 at 07:12:45AM +0100, Danilo Krummrich wrote: >>> This adds the infrastructure for a manager implementation to keep track >>> of GPU virtual address (VA) mappings. >>> >>> New UAPIs, motivated by Vulkan sparse memory bindings graphics drivers >>> start implementing, allow userspace applications to request multiple and >>> arbitrary GPU VA mappings of buffer objects. The DRM GPU VA manager is >>> intended to serve the following purposes in this context. >>> >>> 1) Provide a dedicated range allocator to track GPU VA allocations and >>>    mappings, making use of the drm_mm range allocator. >>> >>> 2) Generically connect GPU VA mappings to their backing buffers, in >>>    particular DRM GEM objects. >>> >>> 3) Provide a common implementation to perform more complex mapping >>>    operations on the GPU VA space. In particular splitting and merging >>>    of GPU VA mappings, e.g. for intersecting mapping requests or >>> partial >>>    unmap requests. >>> >> Over the past week I've hacked together a PoC port of Xe to GPUVA [1], so >> far it seems really promising. 95% of the way to being feature >> equivalent of the current Xe VM bind implementation and have line of >> sight to getting sparse bindings implemented on top of GPUVA too. IMO >> this has basically everything we need for Xe with a few tweaks. >> >> I am out until 2/14 but wanted to get my thoughts / suggestions out on >> the list before I leave. >> >> 1. GPUVA post didn't support the way Xe does userptrs - a NULL GEM. I >> believe with [2], [3], and [4] GPUVA will support NULL GEMs. Also my >> thinking sparse binds will also have NULL GEMs, more on sparse bindings >> below. >> >> 2. I agree with Christian that drm_mm probably isn't what we want to >> base the GPUVA implementation on, rather a RB tree or Maple tree has >> been discussed. The implementation should be fairly easy to tune once we >> have benchmarks running so not to concerned here as we can figure this >> out down the line. >> >> 3. In Xe we want create xe_vm_op list which inherits from drm_gpuva_op >> I've done this with a hack [5], I believe when we rebase we can do this >> with a custom callback to allocate a large op size. >> >> 4. I'd like add user bits to drm_gpuva_flags like I do in [6]. This is >> similar to DMA_FENCE_FLAG_USER_BITS. >> >> 5. In Xe we have VM prefetch operation which is needed for our compute >> UMD with page faults. I'd like add prefetch type of operation like we do >> in [7]. >> >> 6. In Xe we have VM unbind all mappings for a GEM IOCTL, I'd like to add >> support to generate this operation list to GPUVA like we do in [8]. >> >> 7. I've thought about how Xe will implement sparse mappings (read 0, >> writes dropped). My current thinking is a sparse mapping will be >> represented as a drm_gpuva rather than region like in Nouveau. Making >> regions optional to me seems likes good idea rather than forcing the >> user of GPUVA code to create 1 large region for the manager as I >> currently do in the Xe PoC. > > From Danilo's explanation I'm now pretty sure that regions won't work > for Nouveau either. > > He seems to use an incorrect assumption about applications not changing > the sparse mappings behind a VkBuffer while the GPU is using it. > > As far as I can see games like Forza won't work with this approach. > I appreciate sharing your concerns since they're seriously helping me to improve things and consider things I missed before. However, I'd prefer to wait for clarification before distributing them to other sub-threads. Depending on whether the understanding of where those concerns arise from turns out to be right or wrong, this might cause confusion for people not following *all* sub-threads. >> >> 8. Personally I'd like the caller to own the locking for GEM drm_gpuva >> list (drm_gpuva_link_*, drm_gpuva_unlink_* functions). In Xe we almost >> certainly will have the GEM dma-resv lock when we touch this list so an >> extra lock here is redundant. Also it kinda goofy that caller owns the >> for drm_gpuva insertion / removal but not the locking for this list. >> >> WRT to Christian thoughts on a common uAPI rules for VM binds, I kinda >> like that idea but I don't think that is necessary. All of pur uAPI >> should be close but also the GPUVA implementation should be flexible >> enough to fit all of our needs and I think for the most part it is. > > Maybe I should refine my concerns: A common component for GPUVM mappings > is a good idea, but we should not expect that to define any driver > independent UAPI. > > If we want to define driver independent UAPI we should do so explicitly. > > Christian. > >> >> Let me know what everything thinks about this. It would be great if when >> I'm back on 2/14 I can rebase the Xe port to GPUVA on another version of >> the GPUVA code and get sparse binding support implementation. Also I'd >> like to get GPUVA merged in the Xe repo ASAP as our VM bind code badly >> needed to be cleaned and this was the push we needed to make this >> happen. >> >> Matt >> >> [1] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314 >> [2] >> https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=2ae21d7a3f52e5eb2c105ed8ae231471274bdc36 >> [3] >> https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=49fca9f5d96201f5cbd1b19c7ff17eedfac65cdc >> [4] >> https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=61fa6b1e1f10e791ae82358fa971b04421d53024 >> [5] >> https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=87fc08dcf0840e794b38269fe4c6a95d088d79ec >> [6] >> https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=a4826c22f6788bc29906ffa263c1cd3c4661fa77 >> [7] >> https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=f008bbb55b213868e52c7b9cda4c1bfb95af6aee >> [8] >> https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=41f4f71c05d04d2b17d988dd95369b5df2d7f681 >> >>> Idea-suggested-by: Dave Airlie <airlied@redhat.com> >>> Signed-off-by: Danilo Krummrich <dakr@redhat.com> >>> --- >>>  Documentation/gpu/drm-mm.rst   |  31 + >>>  drivers/gpu/drm/Makefile       |   1 + >>>  drivers/gpu/drm/drm_gem.c      |   3 + >>>  drivers/gpu/drm/drm_gpuva_mgr.c | 1323 +++++++++++++++++++++++++++++++ >>>  include/drm/drm_drv.h          |   6 + >>>  include/drm/drm_gem.h          |  75 ++ >>>  include/drm/drm_gpuva_mgr.h    | 527 ++++++++++++ >>>  7 files changed, 1966 insertions(+) >>>  create mode 100644 drivers/gpu/drm/drm_gpuva_mgr.c >>>  create mode 100644 include/drm/drm_gpuva_mgr.h >>> >>> diff --git a/Documentation/gpu/drm-mm.rst b/Documentation/gpu/drm-mm.rst >>> index a52e6f4117d6..c9f120cfe730 100644 >>> --- a/Documentation/gpu/drm-mm.rst >>> +++ b/Documentation/gpu/drm-mm.rst >>> @@ -466,6 +466,37 @@ DRM MM Range Allocator Function References >>>  .. kernel-doc:: drivers/gpu/drm/drm_mm.c >>>     :export: >>> +DRM GPU VA Manager >>> +================== >>> + >>> +Overview >>> +-------- >>> + >>> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >>> +  :doc: Overview >>> + >>> +Split and Merge >>> +--------------- >>> + >>> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >>> +  :doc: Split and Merge >>> + >>> +Locking >>> +------- >>> + >>> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >>> +  :doc: Locking >>> + >>> + >>> +DRM GPU VA Manager Function References >>> +-------------------------------------- >>> + >>> +.. kernel-doc:: include/drm/drm_gpuva_mgr.h >>> +  :internal: >>> + >>> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >>> +  :export: >>> + >>>  DRM Buddy Allocator >>>  =================== >>> diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile >>> index 4fe190aee584..de2ffca3b6e4 100644 >>> --- a/drivers/gpu/drm/Makefile >>> +++ b/drivers/gpu/drm/Makefile >>> @@ -45,6 +45,7 @@ drm-y := \ >>>      drm_vblank.o \ >>>      drm_vblank_work.o \ >>>      drm_vma_manager.o \ >>> +   drm_gpuva_mgr.o \ >>>      drm_writeback.o >>>  drm-$(CONFIG_DRM_LEGACY) += \ >>>      drm_agpsupport.o \ >>> diff --git a/drivers/gpu/drm/drm_gem.c b/drivers/gpu/drm/drm_gem.c >>> index 59a0bb5ebd85..65115fe88627 100644 >>> --- a/drivers/gpu/drm/drm_gem.c >>> +++ b/drivers/gpu/drm/drm_gem.c >>> @@ -164,6 +164,9 @@ void drm_gem_private_object_init(struct >>> drm_device *dev, >>>      if (!obj->resv) >>>          obj->resv = &obj->_resv; >>> +   if (drm_core_check_feature(dev, DRIVER_GEM_GPUVA)) >>> +       drm_gem_gpuva_init(obj); >>> + >>>      drm_vma_node_reset(&obj->vma_node); >>>      INIT_LIST_HEAD(&obj->lru_node); >>>  } >>> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c >>> b/drivers/gpu/drm/drm_gpuva_mgr.c >>> new file mode 100644 >>> index 000000000000..e665f642689d >>> --- /dev/null >>> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c >>> @@ -0,0 +1,1323 @@ >>> +// SPDX-License-Identifier: GPL-2.0 >>> +/* >>> + * Copyright (c) 2022 Red Hat. >>> + * >>> + * Permission is hereby granted, free of charge, to any person >>> obtaining a >>> + * copy of this software and associated documentation files (the >>> "Software"), >>> + * to deal in the Software without restriction, including without >>> limitation >>> + * the rights to use, copy, modify, merge, publish, distribute, >>> sublicense, >>> + * and/or sell copies of the Software, and to permit persons to whom >>> the >>> + * Software is furnished to do so, subject to the following conditions: >>> + * >>> + * The above copyright notice and this permission notice shall be >>> included in >>> + * all copies or substantial portions of the Software. >>> + * >>> + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, >>> EXPRESS OR >>> + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF >>> MERCHANTABILITY, >>> + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO >>> EVENT SHALL >>> + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, >>> DAMAGES OR >>> + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR >>> OTHERWISE, >>> + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE >>> USE OR >>> + * OTHER DEALINGS IN THE SOFTWARE. >>> + * >>> + * Authors: >>> + *    Danilo Krummrich <dakr@redhat.com> >>> + * >>> + */ >>> + >>> +#include <drm/drm_gem.h> >>> +#include <drm/drm_gpuva_mgr.h> >>> + >>> +/** >>> + * DOC: Overview >>> + * >>> + * The DRM GPU VA Manager, represented by struct drm_gpuva_manager >>> keeps track >>> + * of a GPU's virtual address (VA) space and manages the >>> corresponding virtual >>> + * mappings represented by &drm_gpuva objects. It also keeps track >>> of the >>> + * mapping's backing &drm_gem_object buffers. >>> + * >>> + * &drm_gem_object buffers maintain a list (and a corresponding list >>> lock) of >>> + * &drm_gpuva objects representing all existent GPU VA mappings >>> using this >>> + * &drm_gem_object as backing buffer. >>> + * >>> + * A GPU VA mapping can only be created within a previously allocated >>> + * &drm_gpuva_region, which represents a reserved portion of the GPU >>> VA space. >>> + * GPU VA mappings are not allowed to span over a >>> &drm_gpuva_region's boundary. >>> + * >>> + * GPU VA regions can also be flagged as sparse, which allows >>> drivers to create >>> + * sparse mappings for a whole GPU VA region in order to support Vulkan >>> + * 'Sparse Resources'. >>> + * >>> + * The GPU VA manager internally uses the &drm_mm range allocator to >>> manage the >>> + * &drm_gpuva mappings and the &drm_gpuva_regions within a GPU's >>> virtual address >>> + * space. >>> + * >>> + * Besides the GPU VA space regions (&drm_gpuva_region) allocated by >>> a driver >>> + * the &drm_gpuva_manager contains a special region representing the >>> portion of >>> + * VA space reserved by the kernel. This node is initialized >>> together with the >>> + * GPU VA manager instance and removed when the GPU VA manager is >>> destroyed. >>> + * >>> + * In a typical application drivers would embed struct >>> drm_gpuva_manager, >>> + * struct drm_gpuva_region and struct drm_gpuva within their own driver >>> + * specific structures, there won't be any memory allocations of >>> it's own nor >>> + * memory allocations of &drm_gpuva or &drm_gpuva_region entries. >>> + */ >>> + >>> +/** >>> + * DOC: Split and Merge >>> + * >>> + * The DRM GPU VA manager also provides an algorithm implementing >>> splitting and >>> + * merging of existent GPU VA mappings with the ones that are >>> requested to be >>> + * mapped or unmapped. This feature is required by the Vulkan API to >>> implement >>> + * Vulkan 'Sparse Memory Bindings' - drivers UAPIs often refer to >>> this as >>> + * VM BIND. >>> + * >>> + * Drivers can call drm_gpuva_sm_map_ops_create() to obtain a list >>> of map, unmap >>> + * and remap operations for a given newly requested mapping. This list >>> + * represents the set of operations to execute in order to integrate >>> the new >>> + * mapping cleanly into the current state of the GPU VA space. >>> + * >>> + * Depending on how the new GPU VA mapping intersects with the >>> existent mappings >>> + * of the GPU VA space the &drm_gpuva_ops contain an arbitrary >>> amount of unmap >>> + * operations, a maximum of two remap operations and a single map >>> operation. >>> + * The set of operations can also be empty if no operation is >>> required, e.g. if >>> + * the requested mapping already exists in the exact same way. >>> + * >>> + * The single map operation, if existent, represents the original >>> map operation >>> + * requested by the caller. Please note that this operation might be >>> altered >>> + * comparing it with the original map operation, e.g. because it was >>> merged with >>> + * an already existent mapping. Hence, drivers must execute this >>> map operation >>> + * instead of the original one they passed to >>> drm_gpuva_sm_map_ops_create(). >>> + * >>> + * &drm_gpuva_op_unmap contains a 'keep' field, which indicates >>> whether the >>> + * &drm_gpuva to unmap is physically contiguous with the original >>> mapping >>> + * request. Optionally, if 'keep' is set, drivers may keep the >>> actual page table >>> + * entries for this &drm_gpuva, adding the missing page table >>> entries only and >>> + * update the &drm_gpuva_manager's view of things accordingly. >>> + * >>> + * Drivers may do the same optimization, namely delta page table >>> updates, also >>> + * for remap operations. This is possible since &drm_gpuva_op_remap >>> consists of >>> + * one unmap operation and one or two map operations, such that >>> drivers can >>> + * derive the page table update delta accordingly. >>> + * >>> + * Note that there can't be more than two existent mappings to split >>> up, one at >>> + * the beginning and one at the end of the new mapping, hence there >>> is a >>> + * maximum of two remap operations. >>> + * >>> + * Generally, the DRM GPU VA manager never merges mappings across the >>> + * boundaries of &drm_gpuva_regions. This is the case since merging >>> between >>> + * GPU VA regions would result into unmap and map operations to be >>> issued for >>> + * both regions involved although the original mapping request was >>> referred to >>> + * one specific GPU VA region only. Since the other GPU VA region, >>> the one not >>> + * explicitly requested to be altered, might be in use by the GPU, >>> we are not >>> + * allowed to issue any map/unmap operations for this region. >>> + * >>> + * Note that before calling drm_gpuva_sm_map_ops_create() again with >>> another >>> + * mapping request it is necessary to update the >>> &drm_gpuva_manager's view of >>> + * the GPU VA space. The previously obtained operations must be >>> either fully >>> + * processed or completely abandoned. >>> + * >>> + * To update the &drm_gpuva_manager's view of the GPU VA space >>> + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >>> + * drm_gpuva_destroy_unlocked() should be used. >>> + * >>> + * Analogue to drm_gpuva_sm_map_ops_create() >>> drm_gpuva_sm_unmap_ops_create() >>> + * provides drivers a the list of operations to be executed in order >>> to unmap >>> + * a range of GPU VA space. The logic behind this functions is way >>> simpler >>> + * though: For all existent mappings enclosed by the given range unmap >>> + * operations are created. For mappings which are only partically >>> located within >>> + * the given range, remap operations are created such that those >>> mappings are >>> + * split up and re-mapped partically. >>> + * >>> + * The following paragraph depicts the basic constellations of >>> existent GPU VA >>> + * mappings, a newly requested mapping and the resulting mappings as >>> implemented >>> + * by drm_gpuva_sm_map_ops_create() - it doesn't cover arbitrary >>> combinations >>> + * of those constellations. >>> + * >>> + * :: >>> + * >>> + *   1) Existent mapping is kept. >>> + *   ---------------------------- >>> + * >>> + *        0    a    1 >>> + *   old: |-----------| (bo_offset=n) >>> + * >>> + *        0    a    1 >>> + *   req: |-----------| (bo_offset=n) >>> + * >>> + *        0    a    1 >>> + *   new: |-----------| (bo_offset=n) >>> + * >>> + * >>> + *   2) Existent mapping is replaced. >>> + *   -------------------------------- >>> + * >>> + *        0    a    1 >>> + *   old: |-----------| (bo_offset=n) >>> + * >>> + *        0    a    1 >>> + *   req: |-----------| (bo_offset=m) >>> + * >>> + *        0    a    1 >>> + *   new: |-----------| (bo_offset=m) >>> + * >>> + * >>> + *   3) Existent mapping is replaced. >>> + *   -------------------------------- >>> + * >>> + *        0    a    1 >>> + *   old: |-----------| (bo_offset=n) >>> + * >>> + *        0    b    1 >>> + *   req: |-----------| (bo_offset=n) >>> + * >>> + *        0    b    1 >>> + *   new: |-----------| (bo_offset=n) >>> + * >>> + * >>> + *   4) Existent mapping is replaced. >>> + *   -------------------------------- >>> + * >>> + *        0 a 1 >>> + *   old: |-----|      (bo_offset=n) >>> + * >>> + *        0    a    2 >>> + *   req: |-----------| (bo_offset=n) >>> + * >>> + *        0    a    2 >>> + *   new: |-----------| (bo_offset=n) >>> + * >>> + *   Note: We expect to see the same result for a request with a >>> different bo >>> + *         and/or bo_offset. >>> + * >>> + * >>> + *   5) Existent mapping is split. >>> + *   ----------------------------- >>> + * >>> + *        0    a    2 >>> + *   old: |-----------| (bo_offset=n) >>> + * >>> + *        0 b 1 >>> + *   req: |-----|      (bo_offset=n) >>> + * >>> + *        0 b 1 a' 2 >>> + *   new: |-----|-----| (b.bo_offset=n, a.bo_offset=n+1) >>> + * >>> + *   Note: We expect to see the same result for a request with a >>> different bo >>> + *         and/or non-contiguous bo_offset. >>> + * >>> + * >>> + *   6) Existent mapping is kept. >>> + *   ---------------------------- >>> + * >>> + *        0    a    2 >>> + *   old: |-----------| (bo_offset=n) >>> + * >>> + *        0 a 1 >>> + *   req: |-----|      (bo_offset=n) >>> + * >>> + *        0    a    2 >>> + *   new: |-----------| (bo_offset=n) >>> + * >>> + * >>> + *   7) Existent mapping is split. >>> + *   ----------------------------- >>> + * >>> + *        0    a    2 >>> + *   old: |-----------| (bo_offset=n) >>> + * >>> + *              1 b 2 >>> + *   req:      |-----| (bo_offset=m) >>> + * >>> + *        0 a 1 b 2 >>> + *   new: |-----|-----| (a.bo_offset=n,b.bo_offset=m) >>> + * >>> + * >>> + *   8) Existent mapping is kept. >>> + *   ---------------------------- >>> + * >>> + *         0    a    2 >>> + *   old: |-----------| (bo_offset=n) >>> + * >>> + *              1 a 2 >>> + *   req:      |-----| (bo_offset=n+1) >>> + * >>> + *        0    a    2 >>> + *   new: |-----------| (bo_offset=n) >>> + * >>> + * >>> + *   9) Existent mapping is split. >>> + *   ----------------------------- >>> + * >>> + *        0    a    2 >>> + *   old: |-----------|      (bo_offset=n) >>> + * >>> + *              1    b    3 >>> + *   req:      |-----------| (bo_offset=m) >>> + * >>> + *        0 a 1    b    3 >>> + *   new: |-----|-----------| (a.bo_offset=n,b.bo_offset=m) >>> + * >>> + * >>> + *   10) Existent mapping is merged. >>> + *   ------------------------------- >>> + * >>> + *        0    a    2 >>> + *   old: |-----------|      (bo_offset=n) >>> + * >>> + *              1    a    3 >>> + *   req:      |-----------| (bo_offset=n+1) >>> + * >>> + *        0       a       3 >>> + *   new: |-----------------| (bo_offset=n) >>> + * >>> + * >>> + *   11) Existent mapping is split. >>> + *   ------------------------------ >>> + * >>> + *        0       a       3 >>> + *   old: |-----------------| (bo_offset=n) >>> + * >>> + *              1 b 2 >>> + *   req:      |-----|      (bo_offset=m) >>> + * >>> + *        0 a 1 b 2 a' 3 >>> + *   new: |-----|-----|-----| >>> (a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2) >>> + * >>> + * >>> + *   12) Existent mapping is kept. >>> + *   ----------------------------- >>> + * >>> + *        0       a       3 >>> + *   old: |-----------------| (bo_offset=n) >>> + * >>> + *              1 a 2 >>> + *   req:      |-----|      (bo_offset=n+1) >>> + * >>> + *        0       a       3 >>> + *   old: |-----------------| (bo_offset=n) >>> + * >>> + * >>> + *   13) Existent mapping is replaced. >>> + *   --------------------------------- >>> + * >>> + *              1 a 2 >>> + *   old:      |-----| (bo_offset=n) >>> + * >>> + *        0    a    2 >>> + *   req: |-----------| (bo_offset=n) >>> + * >>> + *        0    a    2 >>> + *   new: |-----------| (bo_offset=n) >>> + * >>> + *   Note: We expect to see the same result for a request with a >>> different bo >>> + *         and/or non-contiguous bo_offset. >>> + * >>> + * >>> + *   14) Existent mapping is replaced. >>> + *   --------------------------------- >>> + * >>> + *              1 a 2 >>> + *   old:      |-----| (bo_offset=n) >>> + * >>> + *        0       a      3 >>> + *   req: |----------------| (bo_offset=n) >>> + * >>> + *        0       a      3 >>> + *   new: |----------------| (bo_offset=n) >>> + * >>> + *   Note: We expect to see the same result for a request with a >>> different bo >>> + *         and/or non-contiguous bo_offset. >>> + * >>> + * >>> + *   15) Existent mapping is split. >>> + *   ------------------------------ >>> + * >>> + *              1    a    3 >>> + *   old:      |-----------| (bo_offset=n) >>> + * >>> + *        0    b    2 >>> + *   req: |-----------|      (bo_offset=m) >>> + * >>> + *        0    b    2 a' 3 >>> + *   new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2) >>> + * >>> + * >>> + *   16) Existent mappings are merged. >>> + *   --------------------------------- >>> + * >>> + *        0    a    1 >>> + *   old: |-----------|                       (bo_offset=n) >>> + * >>> + *                               2    a    3 >>> + *   old':                      |-----------| (bo_offset=n+2) >>> + * >>> + *                   1    a    2 >>> + *   req:           |-----------|            (bo_offset=n+1) >>> + * >>> + *                         a >>> + *   new: |----------------------------------| (bo_offset=n) >>> + */ >>> + >>> +/** >>> + * DOC: Locking >>> + * >>> + * Generally, the GPU VA manager does not take care of locking >>> itself, it is >>> + * the drivers responsibility to take care about locking. Drivers >>> might want to >>> + * protect the following operations: inserting, destroying and >>> iterating >>> + * &drm_gpuva and &drm_gpuva_region objects as well as generating >>> split and merge >>> + * operations. >>> + * >>> + * The GPU VA manager does take care of the locking of the backing >>> + * &drm_gem_object buffers GPU VA lists though, unless the provided >>> functions >>> + * documentation claims otherwise. >>> + */ >>> + >>> +/** >>> + * drm_gpuva_manager_init - initialize a &drm_gpuva_manager >>> + * @mgr: pointer to the &drm_gpuva_manager to initialize >>> + * @name: the name of the GPU VA space >>> + * @start_offset: the start offset of the GPU VA space >>> + * @range: the size of the GPU VA space >>> + * @reserve_offset: the start of the kernel reserved GPU VA area >>> + * @reserve_range: the size of the kernel reserved GPU VA area >>> + * >>> + * The &drm_gpuva_manager must be initialized with this function >>> before use. >>> + * >>> + * Note that @mgr must be cleared to 0 before calling this function. >>> The given >>> + * &name is expected to be managed by the surrounding driver >>> structures. >>> + */ >>> +void >>> +drm_gpuva_manager_init(struct drm_gpuva_manager *mgr, >>> +              const char *name, >>> +              u64 start_offset, u64 range, >>> +              u64 reserve_offset, u64 reserve_range) >>> +{ >>> +   drm_mm_init(&mgr->va_mm, start_offset, range); >>> +   drm_mm_init(&mgr->region_mm, start_offset, range); >>> + >>> +   mgr->mm_start = start_offset; >>> +   mgr->mm_range = range; >>> + >>> +   mgr->name = name ? name : "unknown"; >>> + >>> +   memset(&mgr->kernel_alloc_node, 0, sizeof(struct drm_mm_node)); >>> +   mgr->kernel_alloc_node.start = reserve_offset; >>> +   mgr->kernel_alloc_node.size = reserve_range; >>> +   drm_mm_reserve_node(&mgr->region_mm, &mgr->kernel_alloc_node); >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_manager_init); >>> + >>> +/** >>> + * drm_gpuva_manager_destroy - cleanup a &drm_gpuva_manager >>> + * @mgr: pointer to the &drm_gpuva_manager to clean up >>> + * >>> + * Note that it is a bug to call this function on a manager that still >>> + * holds GPU VA mappings. >>> + */ >>> +void >>> +drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr) >>> +{ >>> +   mgr->name = NULL; >>> +   drm_mm_remove_node(&mgr->kernel_alloc_node); >>> +   drm_mm_takedown(&mgr->va_mm); >>> +   drm_mm_takedown(&mgr->region_mm); >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_manager_destroy); >>> + >>> +static struct drm_gpuva_region * >>> +drm_gpuva_in_region(struct drm_gpuva_manager *mgr, u64 addr, u64 range) >>> +{ >>> +   struct drm_gpuva_region *reg; >>> + >>> +   /* Find the VA region the requested range is strictly enclosed >>> by. */ >>> +   drm_gpuva_for_each_region_in_range(reg, mgr, addr, addr + range) { >>> +       if (reg->node.start <= addr && >>> +           reg->node.start + reg->node.size >= addr + range && >>> +           ®->node != &mgr->kernel_alloc_node) >>> +           return reg; >>> +   } >>> + >>> +   return NULL; >>> +} >>> + >>> +static bool >>> +drm_gpuva_in_any_region(struct drm_gpuva_manager *mgr, u64 addr, u64 >>> range) >>> +{ >>> +   return !!drm_gpuva_in_region(mgr, addr, range); >>> +} >>> + >>> +/** >>> + * drm_gpuva_insert - insert a &drm_gpuva >>> + * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in >>> + * @va: the &drm_gpuva to insert >>> + * @addr: the start address of the GPU VA >>> + * @range: the range of the GPU VA >>> + * >>> + * Insert a &drm_gpuva with a given address and range into a >>> + * &drm_gpuva_manager. >>> + * >>> + * The function assumes the caller does not hold the &drm_gem_object's >>> + * GPU VA list mutex. >>> + * >>> + * Returns: 0 on success, negative error code on failure. >>> + */ >>> +int >>> +drm_gpuva_insert(struct drm_gpuva_manager *mgr, >>> +        struct drm_gpuva *va, >>> +        u64 addr, u64 range) >>> +{ >>> +   struct drm_gpuva_region *reg; >>> +   int ret; >>> + >>> +   if (!va->gem.obj) >>> +       return -EINVAL; >>> + >>> +   reg = drm_gpuva_in_region(mgr, addr, range); >>> +   if (!reg) >>> +       return -EINVAL; >>> + >>> +   ret = drm_mm_insert_node_in_range(&mgr->va_mm, &va->node, >>> +                     range, 0, >>> +                     0, addr, >>> +                     addr + range, >>> +                     DRM_MM_INSERT_LOW|DRM_MM_INSERT_ONCE); >>> +   if (ret) >>> +       return ret; >>> + >>> +   va->mgr = mgr; >>> +   va->region = reg; >>> + >>> +   return 0; >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_insert); >>> + >>> +/** >>> + * drm_gpuva_link_locked - link a &drm_gpuva >>> + * @va: the &drm_gpuva to link >>> + * >>> + * This adds the given &va to the GPU VA list of the &drm_gem_object >>> it is >>> + * associated with. >>> + * >>> + * The function assumes the caller already holds the &drm_gem_object's >>> + * GPU VA list mutex. >>> + */ >>> +void >>> +drm_gpuva_link_locked(struct drm_gpuva *va) >>> +{ >>> +   lockdep_assert_held(&va->gem.obj->gpuva.mutex); >>> +   list_add_tail(&va->head, &va->gem.obj->gpuva.list); >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_link_locked); >>> + >>> +/** >>> + * drm_gpuva_link_unlocked - unlink a &drm_gpuva >>> + * @va: the &drm_gpuva to unlink >>> + * >>> + * This adds the given &va to the GPU VA list of the &drm_gem_object >>> it is >>> + * associated with. >>> + * >>> + * The function assumes the caller does not hold the &drm_gem_object's >>> + * GPU VA list mutex. >>> + */ >>> +void >>> +drm_gpuva_link_unlocked(struct drm_gpuva *va) >>> +{ >>> +   drm_gem_gpuva_lock(va->gem.obj); >>> +   drm_gpuva_link_locked(va); >>> +   drm_gem_gpuva_unlock(va->gem.obj); >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_link_unlocked); >>> + >>> +/** >>> + * drm_gpuva_unlink_locked - unlink a &drm_gpuva >>> + * @va: the &drm_gpuva to unlink >>> + * >>> + * This removes the given &va from the GPU VA list of the >>> &drm_gem_object it is >>> + * associated with. >>> + * >>> + * The function assumes the caller already holds the &drm_gem_object's >>> + * GPU VA list mutex. >>> + */ >>> +void >>> +drm_gpuva_unlink_locked(struct drm_gpuva *va) >>> +{ >>> +   lockdep_assert_held(&va->gem.obj->gpuva.mutex); >>> +   list_del_init(&va->head); >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_unlink_locked); >>> + >>> +/** >>> + * drm_gpuva_unlink_unlocked - unlink a &drm_gpuva >>> + * @va: the &drm_gpuva to unlink >>> + * >>> + * This removes the given &va from the GPU VA list of the >>> &drm_gem_object it is >>> + * associated with. >>> + * >>> + * The function assumes the caller does not hold the &drm_gem_object's >>> + * GPU VA list mutex. >>> + */ >>> +void >>> +drm_gpuva_unlink_unlocked(struct drm_gpuva *va) >>> +{ >>> +   drm_gem_gpuva_lock(va->gem.obj); >>> +   drm_gpuva_unlink_locked(va); >>> +   drm_gem_gpuva_unlock(va->gem.obj); >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_unlink_unlocked); >>> + >>> +/** >>> + * drm_gpuva_destroy_locked - destroy a &drm_gpuva >>> + * @va: the &drm_gpuva to destroy >>> + * >>> + * This removes the given &va from GPU VA list of the >>> &drm_gem_object it is >>> + * associated with and removes it from the underlaying range allocator. >>> + * >>> + * The function assumes the caller already holds the &drm_gem_object's >>> + * GPU VA list mutex. >>> + */ >>> +void >>> +drm_gpuva_destroy_locked(struct drm_gpuva *va) >>> +{ >>> +   lockdep_assert_held(&va->gem.obj->gpuva.mutex); >>> + >>> +   list_del(&va->head); >>> +   drm_mm_remove_node(&va->node); >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_destroy_locked); >>> + >>> +/** >>> + * drm_gpuva_destroy_unlocked - destroy a &drm_gpuva >>> + * @va: the &drm_gpuva to destroy >>> + * >>> + * This removes the given &va from GPU VA list of the >>> &drm_gem_object it is >>> + * associated with and removes it from the underlaying range allocator. >>> + * >>> + * The function assumes the caller does not hold the &drm_gem_object's >>> + * GPU VA list mutex. >>> + */ >>> +void >>> +drm_gpuva_destroy_unlocked(struct drm_gpuva *va) >>> +{ >>> +   drm_gem_gpuva_lock(va->gem.obj); >>> +   list_del(&va->head); >>> +   drm_gem_gpuva_unlock(va->gem.obj); >>> + >>> +   drm_mm_remove_node(&va->node); >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_destroy_unlocked); >>> + >>> +/** >>> + * drm_gpuva_find - find a &drm_gpuva >>> + * @mgr: the &drm_gpuva_manager to search in >>> + * @addr: the &drm_gpuvas address >>> + * @range: the &drm_gpuvas range >>> + * >>> + * Returns: the &drm_gpuva at a given &addr and with a given &range >>> + */ >>> +struct drm_gpuva * >>> +drm_gpuva_find(struct drm_gpuva_manager *mgr, >>> +          u64 addr, u64 range) >>> +{ >>> +   struct drm_gpuva *va; >>> + >>> +   drm_gpuva_for_each_va_in_range(va, mgr, addr, range) { >>> +       if (va->node.start == addr && >>> +           va->node.size == range) >>> +           return va; >>> +   } >>> + >>> +   return NULL; >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_find); >>> + >>> +/** >>> + * drm_gpuva_find_prev - find the &drm_gpuva before the given address >>> + * @mgr: the &drm_gpuva_manager to search in >>> + * @start: the given GPU VA's start address >>> + * >>> + * Find the adjacent &drm_gpuva before the GPU VA with given &start >>> address. >>> + * >>> + * Note that if there is any free space between the GPU VA mappings >>> no mapping >>> + * is returned. >>> + * >>> + * Returns: a pointer to the found &drm_gpuva or NULL if none was found >>> + */ >>> +struct drm_gpuva * >>> +drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start) >>> +{ >>> +   struct drm_mm_node *node; >>> + >>> +   if (start <= mgr->mm_start || >>> +       start > (mgr->mm_start + mgr->mm_range)) >>> +       return NULL; >>> + >>> +   node = __drm_mm_interval_first(&mgr->va_mm, start - 1, start); >>> +   if (node == &mgr->va_mm.head_node) >>> +       return NULL; >>> + >>> +   return (struct drm_gpuva *)node; >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_find_prev); >>> + >>> +/** >>> + * drm_gpuva_find_next - find the &drm_gpuva after the given address >>> + * @mgr: the &drm_gpuva_manager to search in >>> + * @end: the given GPU VA's end address >>> + * >>> + * Find the adjacent &drm_gpuva after the GPU VA with given &end >>> address. >>> + * >>> + * Note that if there is any free space between the GPU VA mappings >>> no mapping >>> + * is returned. >>> + * >>> + * Returns: a pointer to the found &drm_gpuva or NULL if none was found >>> + */ >>> +struct drm_gpuva * >>> +drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end) >>> +{ >>> +   struct drm_mm_node *node; >>> + >>> +   if (end < mgr->mm_start || >>> +       end >= (mgr->mm_start + mgr->mm_range)) >>> +       return NULL; >>> + >>> +   node = __drm_mm_interval_first(&mgr->va_mm, end, end + 1); >>> +   if (node == &mgr->va_mm.head_node) >>> +       return NULL; >>> + >>> +   return (struct drm_gpuva *)node; >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_find_next); >>> + >>> +/** >>> + * drm_gpuva_region_insert - insert a &drm_gpuva_region >>> + * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in >>> + * @reg: the &drm_gpuva_region to insert >>> + * @addr: the start address of the GPU VA >>> + * @range: the range of the GPU VA >>> + * >>> + * Insert a &drm_gpuva_region with a given address and range into a >>> + * &drm_gpuva_manager. >>> + * >>> + * Returns: 0 on success, negative error code on failure. >>> + */ >>> +int >>> +drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, >>> +           struct drm_gpuva_region *reg, >>> +           u64 addr, u64 range) >>> +{ >>> +   int ret; >>> + >>> +   ret = drm_mm_insert_node_in_range(&mgr->region_mm, ®->node, >>> +                     range, 0, >>> +                     0, addr, >>> +                     addr + range, >>> +                     DRM_MM_INSERT_LOW| >>> +                     DRM_MM_INSERT_ONCE); >>> +   if (ret) >>> +       return ret; >>> + >>> +   reg->mgr = mgr; >>> + >>> +   return 0; >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_region_insert); >>> + >>> +/** >>> + * drm_gpuva_region_destroy - destroy a &drm_gpuva_region >>> + * @mgr: the &drm_gpuva_manager holding the region >>> + * @reg: the &drm_gpuva to destroy >>> + * >>> + * This removes the given ® from the underlaying range allocator. >>> + */ >>> +void >>> +drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, >>> +            struct drm_gpuva_region *reg) >>> +{ >>> +   struct drm_gpuva *va; >>> + >>> +   drm_gpuva_for_each_va_in_range(va, mgr, >>> +                      reg->node.start, >>> +                      reg->node.size) { >>> +       WARN(1, "GPU VA region must be empty on destroy.\n"); >>> +       return; >>> +   } >>> + >>> +   if (®->node == &mgr->kernel_alloc_node) { >>> +       WARN(1, "Can't destroy kernel reserved region.\n"); >>> +       return; >>> +   } >>> + >>> +   drm_mm_remove_node(®->node); >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_region_destroy); >>> + >>> +/** >>> + * drm_gpuva_region_find - find a &drm_gpuva_region >>> + * @mgr: the &drm_gpuva_manager to search in >>> + * @addr: the &drm_gpuva_regions address >>> + * @range: the &drm_gpuva_regions range >>> + * >>> + * Returns: the &drm_gpuva_region at a given &addr and with a given >>> &range >>> + */ >>> +struct drm_gpuva_region * >>> +drm_gpuva_region_find(struct drm_gpuva_manager *mgr, >>> +             u64 addr, u64 range) >>> +{ >>> +   struct drm_gpuva_region *reg; >>> + >>> +   drm_gpuva_for_each_region_in_range(reg, mgr, addr, addr + range) >>> +       if (reg->node.start == addr && >>> +           reg->node.size == range) >>> +           return reg; >>> + >>> +   return NULL; >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_region_find); >>> + >>> +static int >>> +gpuva_op_map_new(struct drm_gpuva_op **pop, >>> +        u64 addr, u64 range, >>> +        struct drm_gem_object *obj, u64 offset) >>> +{ >>> +   struct drm_gpuva_op *op; >>> + >>> +   op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >>> +   if (!op) >>> +       return -ENOMEM; >>> + >>> +   op->op = DRM_GPUVA_OP_MAP; >>> +   op->map.va.addr = addr; >>> +   op->map.va.range = range; >>> +   op->map.gem.obj = obj; >>> +   op->map.gem.offset = offset; >>> + >>> +   return 0; >>> +} >>> + >>> +static int >>> +gpuva_op_remap_new(struct drm_gpuva_op **pop, >>> +          struct drm_gpuva_op_map *prev, >>> +          struct drm_gpuva_op_map *next, >>> +          struct drm_gpuva_op_unmap *unmap) >>> +{ >>> +   struct drm_gpuva_op *op; >>> +   struct drm_gpuva_op_remap *r; >>> + >>> +   op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >>> +   if (!op) >>> +       return -ENOMEM; >>> + >>> +   op->op = DRM_GPUVA_OP_REMAP; >>> +   r = &op->remap; >>> + >>> +   if (prev) { >>> +       r->prev = kmemdup(prev, sizeof(*prev), GFP_KERNEL); >>> +       if (!r->prev) >>> +           goto err_free_op; >>> +   } >>> + >>> +   if (next) { >>> +       r->next = kmemdup(next, sizeof(*next), GFP_KERNEL); >>> +       if (!r->next) >>> +           goto err_free_prev; >>> +   } >>> + >>> +   r->unmap = kmemdup(unmap, sizeof(*unmap), GFP_KERNEL); >>> +   if (!r->unmap) >>> +       goto err_free_next; >>> + >>> +   return 0; >>> + >>> +err_free_next: >>> +   if (next) >>> +       kfree(r->next); >>> +err_free_prev: >>> +   if (prev) >>> +       kfree(r->prev); >>> +err_free_op: >>> +   kfree(op); >>> +   *pop = NULL; >>> + >>> +   return -ENOMEM; >>> +} >>> + >>> +static int >>> +gpuva_op_unmap_new(struct drm_gpuva_op **pop, >>> +          struct drm_gpuva *va, bool merge) >>> +{ >>> +   struct drm_gpuva_op *op; >>> + >>> +   op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >>> +   if (!op) >>> +       return -ENOMEM; >>> + >>> +   op->op = DRM_GPUVA_OP_UNMAP; >>> +   op->unmap.va = va; >>> +   op->unmap.keep = merge; >>> + >>> +   return 0; >>> +} >>> + >>> +#define op_map_new_to_list(_ops, _addr, _range,       \ >>> +              _obj, _offset)       \ >>> +do {                           \ >>> +   struct drm_gpuva_op *op;           \ >>> +                           \ >>> +   ret = gpuva_op_map_new(&op, _addr, _range,   \ >>> +                  _obj, _offset);       \ >>> +   if (ret)                   \ >>> +       goto err_free_ops;           \ >>> +                           \ >>> +   list_add_tail(&op->entry, _ops);       \ >>> +} while (0) >>> + >>> +#define op_remap_new_to_list(_ops, _prev, _next,   \ >>> +                _unmap)           \ >>> +do {                           \ >>> +   struct drm_gpuva_op *op;           \ >>> +                           \ >>> +   ret = gpuva_op_remap_new(&op, _prev, _next,   \ >>> +                _unmap);       \ >>> +   if (ret)                   \ >>> +       goto err_free_ops;           \ >>> +                           \ >>> +   list_add_tail(&op->entry, _ops);       \ >>> +} while (0) >>> + >>> +#define op_unmap_new_to_list(_ops, _gpuva, _merge)   \ >>> +do {                           \ >>> +   struct drm_gpuva_op *op;           \ >>> +                           \ >>> +   ret = gpuva_op_unmap_new(&op, _gpuva, _merge);   \ >>> +   if (ret)                   \ >>> +       goto err_free_ops;           \ >>> +                           \ >>> +   list_add_tail(&op->entry, _ops);       \ >>> +} while (0) >>> + >>> +/** >>> + * drm_gpuva_sm_map_ops_create - creates the &drm_gpuva_ops to split >>> and merge >>> + * @mgr: the &drm_gpuva_manager representing the GPU VA space >>> + * @req_addr: the start address of the new mapping >>> + * @req_range: the range of the new mapping >>> + * @req_obj: the &drm_gem_object to map >>> + * @req_offset: the offset within the &drm_gem_object >>> + * >>> + * This function creates a list of operations to perform splitting >>> and merging >>> + * of existent mapping(s) with the newly requested one. >>> + * >>> + * The list can be iterated with &drm_gpuva_for_each_op and must be >>> processed >>> + * in the given order. It can contain map, unmap and remap >>> operations, but it >>> + * also can be empty if no operation is required, e.g. if the >>> requested mapping >>> + * already exists is the exact same way. >>> + * >>> + * There can be an arbitrary amount of unmap operations, a maximum >>> of two remap >>> + * operations and a single map operation. The latter one, if existent, >>> + * represents the original map operation requested by the caller. >>> Please note >>> + * that the map operation might has been modified, e.g. if it was >>> + * merged with an existent mapping. >>> + * >>> + * Note that before calling this function again with another mapping >>> request it >>> + * is necessary to update the &drm_gpuva_manager's view of the GPU >>> VA space. >>> + * The previously obtained operations must be either processed or >>> abandoned. >>> + * To update the &drm_gpuva_manager's view of the GPU VA space >>> + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >>> + * drm_gpuva_destroy_unlocked() should be used. >>> + * >>> + * After the caller finished processing the returned &drm_gpuva_ops, >>> they must >>> + * be freed with &drm_gpuva_ops_free. >>> + * >>> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR >>> on failure >>> + */ >>> +struct drm_gpuva_ops * >>> +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr, >>> +               u64 req_addr, u64 req_range, >>> +               struct drm_gem_object *req_obj, u64 req_offset) >>> +{ >>> +   struct drm_gpuva_ops *ops; >>> +   struct drm_gpuva *va, *prev = NULL; >>> +   u64 req_end = req_addr + req_range; >>> +   bool skip_pmerge = false, skip_nmerge = false; >>> +   int ret; >>> + >>> +   if (!drm_gpuva_in_any_region(mgr, req_addr, req_range)) >>> +       return ERR_PTR(-EINVAL); >>> + >>> +   ops = kzalloc(sizeof(*ops), GFP_KERNEL); >>> +   if (!ops) >>> +       return ERR_PTR(-ENOMEM); >>> + >>> +   INIT_LIST_HEAD(&ops->list); >>> + >>> +   drm_gpuva_for_each_va_in_range(va, mgr, req_addr, req_end) { >>> +       struct drm_gem_object *obj = va->gem.obj; >>> +       u64 offset = va->gem.offset; >>> +       u64 addr = va->node.start; >>> +       u64 range = va->node.size; >>> +       u64 end = addr + range; >>> + >>> +       /* Generally, we want to skip merging with potential mappings >>> +        * left and right of the requested one when we found a >>> +        * collision, since merging happens in this loop already. >>> +        * >>> +        * However, there is one exception when the requested mapping >>> +        * spans into a free VM area. If this is the case we might >>> +        * still hit the boundary of another mapping before and/or >>> +        * after the free VM area. >>> +        */ >>> +       skip_pmerge = true; >>> +       skip_nmerge = true; >>> + >>> +       if (addr == req_addr) { >>> +           bool merge = obj == req_obj && >>> +                    offset == req_offset; >>> +           if (end == req_end) { >>> +               if (merge) >>> +                   goto done; >>> + >>> +               op_unmap_new_to_list(&ops->list, va, false); >>> +               break; >>> +           } >>> + >>> +           if (end < req_end) { >>> +               skip_nmerge = false; >>> +               op_unmap_new_to_list(&ops->list, va, merge); >>> +               goto next; >>> +           } >>> + >>> +           if (end > req_end) { >>> +               struct drm_gpuva_op_map n = { >>> +                   .va.addr = req_end, >>> +                   .va.range = range - req_range, >>> +                   .gem.obj = obj, >>> +                   .gem.offset = offset + req_range, >>> +               }; >>> +               struct drm_gpuva_op_unmap u = { .va = va }; >>> + >>> +               if (merge) >>> +                   goto done; >>> + >>> +               op_remap_new_to_list(&ops->list, NULL, &n, &u); >>> +               break; >>> +           } >>> +       } else if (addr < req_addr) { >>> +           u64 ls_range = req_addr - addr; >>> +           struct drm_gpuva_op_map p = { >>> +               .va.addr = addr, >>> +               .va.range = ls_range, >>> +               .gem.obj = obj, >>> +               .gem.offset = offset, >>> +           }; >>> +           struct drm_gpuva_op_unmap u = { .va = va }; >>> +           bool merge = obj == req_obj && >>> +                    offset + ls_range == req_offset; >>> + >>> +           if (end == req_end) { >>> +               if (merge) >>> +                   goto done; >>> + >>> +               op_remap_new_to_list(&ops->list, &p, NULL, &u); >>> +               break; >>> +           } >>> + >>> +           if (end < req_end) { >>> +               u64 new_addr = addr; >>> +               u64 new_range = req_range + ls_range; >>> +               u64 new_offset = offset; >>> + >>> +               /* We validated that the requested mapping is >>> +                * within a single VA region already. >>> +                * Since it overlaps the current mapping (which >>> +                * can't cross a VA region boundary) we can be >>> +                * sure that we're still within the boundaries >>> +                * of the same VA region after merging. >>> +                */ >>> +               if (merge) { >>> +                   req_offset = new_offset; >>> +                   req_addr = new_addr; >>> +                   req_range = new_range; >>> +                   op_unmap_new_to_list(&ops->list, va, true); >>> +                   goto next; >>> +               } >>> + >>> +               op_remap_new_to_list(&ops->list, &p, NULL, &u); >>> +               goto next; >>> +           } >>> + >>> +           if (end > req_end) { >>> +               struct drm_gpuva_op_map n = { >>> +                   .va.addr = req_end, >>> +                   .va.range = end - req_end, >>> +                   .gem.obj = obj, >>> +                   .gem.offset = offset + ls_range + >>> +                             req_range, >>> +               }; >>> + >>> +               if (merge) >>> +                   goto done; >>> + >>> +               op_remap_new_to_list(&ops->list, &p, &n, &u); >>> +               break; >>> +           } >>> +       } else if (addr > req_addr) { >>> +           bool merge = obj == req_obj && >>> +                    offset == req_offset + >>> +                          (addr - req_addr); >>> +           if (!prev) >>> +               skip_pmerge = false; >>> + >>> +           if (end == req_end) { >>> +               op_unmap_new_to_list(&ops->list, va, merge); >>> +               break; >>> +           } >>> + >>> +           if (end < req_end) { >>> +               skip_nmerge = false; >>> +               op_unmap_new_to_list(&ops->list, va, merge); >>> +               goto next; >>> +           } >>> + >>> +           if (end > req_end) { >>> +               struct drm_gpuva_op_map n = { >>> +                   .va.addr = req_end, >>> +                   .va.range = end - req_end, >>> +                   .gem.obj = obj, >>> +                   .gem.offset = offset + req_end - addr, >>> +               }; >>> +               struct drm_gpuva_op_unmap u = { .va = va }; >>> +               u64 new_end = end; >>> +               u64 new_range = new_end - req_addr; >>> + >>> +               /* We validated that the requested mapping is >>> +                * within a single VA region already. >>> +                * Since it overlaps the current mapping (which >>> +                * can't cross a VA region boundary) we can be >>> +                * sure that we're still within the boundaries >>> +                * of the same VA region after merging. >>> +                */ >>> +               if (merge) { >>> +                   req_end = new_end; >>> +                   req_range = new_range; >>> +                   op_unmap_new_to_list(&ops->list, va, true); >>> +                   break; >>> +               } >>> + >>> +               op_remap_new_to_list(&ops->list, NULL, &n, &u); >>> +               break; >>> +           } >>> +       } >>> +next: >>> +       prev = va; >>> +   } >>> + >>> +   va = skip_pmerge ? NULL : drm_gpuva_find_prev(mgr, req_addr); >>> +   if (va) { >>> +       struct drm_gem_object *obj = va->gem.obj; >>> +       u64 offset = va->gem.offset; >>> +       u64 addr = va->node.start; >>> +       u64 range = va->node.size; >>> +       u64 new_offset = offset; >>> +       u64 new_addr = addr; >>> +       u64 new_range = req_range + range; >>> +       bool merge = obj == req_obj && >>> +                offset + range == req_offset; >>> + >>> +       /* Don't merge over VA region boundaries. */ >>> +       merge &= drm_gpuva_in_any_region(mgr, new_addr, new_range); >>> +       if (merge) { >>> +           op_unmap_new_to_list(&ops->list, va, true); >>> + >>> +           req_offset = new_offset; >>> +           req_addr = new_addr; >>> +           req_range = new_range; >>> +       } >>> +   } >>> + >>> +   va = skip_nmerge ? NULL : drm_gpuva_find_next(mgr, req_end); >>> +   if (va) { >>> +       struct drm_gem_object *obj = va->gem.obj; >>> +       u64 offset = va->gem.offset; >>> +       u64 addr = va->node.start; >>> +       u64 range = va->node.size; >>> +       u64 end = addr + range; >>> +       u64 new_range = req_range + range; >>> +       u64 new_end = end; >>> +       bool merge = obj == req_obj && >>> +                offset == req_offset + req_range; >>> + >>> +       /* Don't merge over VA region boundaries. */ >>> +       merge &= drm_gpuva_in_any_region(mgr, req_addr, new_range); >>> +       if (merge) { >>> +           op_unmap_new_to_list(&ops->list, va, true); >>> + >>> +           req_range = new_range; >>> +           req_end = new_end; >>> +       } >>> +   } >>> + >>> +   op_map_new_to_list(&ops->list, >>> +              req_addr, req_range, >>> +              req_obj, req_offset); >>> + >>> +done: >>> +   return ops; >>> + >>> +err_free_ops: >>> +   drm_gpuva_ops_free(ops); >>> +   return ERR_PTR(ret); >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_sm_map_ops_create); >>> + >>> +#undef op_map_new_to_list >>> +#undef op_remap_new_to_list >>> +#undef op_unmap_new_to_list >>> + >>> +/** >>> + * drm_gpuva_sm_unmap_ops_create - creates the &drm_gpuva_ops to >>> split on unmap >>> + * @mgr: the &drm_gpuva_manager representing the GPU VA space >>> + * @req_addr: the start address of the range to unmap >>> + * @req_range: the range of the mappings to unmap >>> + * >>> + * This function creates a list of operations to perform unmapping >>> and, if >>> + * required, splitting of the mappings overlapping the unmap range. >>> + * >>> + * The list can be iterated with &drm_gpuva_for_each_op and must be >>> processed >>> + * in the given order. It can contain unmap and remap operations, >>> depending on >>> + * whether there are actual overlapping mappings to split. >>> + * >>> + * There can be an arbitrary amount of unmap operations and a >>> maximum of two >>> + * remap operations. >>> + * >>> + * Note that before calling this function again with another range >>> to unmap it >>> + * is necessary to update the &drm_gpuva_manager's view of the GPU >>> VA space. >>> + * The previously obtained operations must be processed or abandoned. >>> + * To update the &drm_gpuva_manager's view of the GPU VA space >>> + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >>> + * drm_gpuva_destroy_unlocked() should be used. >>> + * >>> + * After the caller finished processing the returned &drm_gpuva_ops, >>> they must >>> + * be freed with &drm_gpuva_ops_free. >>> + * >>> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR >>> on failure >>> + */ >>> +struct drm_gpuva_ops * >>> +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr, >>> +                 u64 req_addr, u64 req_range) >>> +{ >>> +   struct drm_gpuva_ops *ops; >>> +   struct drm_gpuva_op *op; >>> +   struct drm_gpuva_op_remap *r; >>> +   struct drm_gpuva *va; >>> +   u64 req_end = req_addr + req_range; >>> +   int ret; >>> + >>> +   ops = kzalloc(sizeof(*ops), GFP_KERNEL); >>> +   if (!ops) >>> +       return ERR_PTR(-ENOMEM); >>> + >>> +   INIT_LIST_HEAD(&ops->list); >>> + >>> +   drm_gpuva_for_each_va_in_range(va, mgr, req_addr, req_end) { >>> +       struct drm_gem_object *obj = va->gem.obj; >>> +       u64 offset = va->gem.offset; >>> +       u64 addr = va->node.start; >>> +       u64 range = va->node.size; >>> +       u64 end = addr + range; >>> + >>> +       op = kzalloc(sizeof(*op), GFP_KERNEL); >>> +       if (!op) { >>> +           ret = -ENOMEM; >>> +           goto err_free_ops; >>> +       } >>> + >>> +       r = &op->remap; >>> + >>> +       if (addr < req_addr) { >>> +           r->prev = kzalloc(sizeof(*r->prev), GFP_KERNEL); >>> +           if (!r->prev) { >>> +               ret = -ENOMEM; >>> +               goto err_free_op; >>> +           } >>> + >>> +           r->prev->va.addr = addr; >>> +           r->prev->va.range = req_addr - addr; >>> +           r->prev->gem.obj = obj; >>> +           r->prev->gem.offset = offset; >>> +       } >>> + >>> +       if (end > req_end) { >>> +           r->next = kzalloc(sizeof(*r->next), GFP_KERNEL); >>> +           if (!r->next) { >>> +               ret = -ENOMEM; >>> +               goto err_free_prev; >>> +           } >>> + >>> +           r->next->va.addr = req_end; >>> +           r->next->va.range = end - req_end; >>> +           r->next->gem.obj = obj; >>> +           r->next->gem.offset = offset + (req_end - addr); >>> +       } >>> + >>> +       if (op->remap.prev || op->remap.next) { >>> +           op->op = DRM_GPUVA_OP_REMAP; >>> +           r->unmap = kzalloc(sizeof(*r->unmap), GFP_KERNEL); >>> +           if (!r->unmap) { >>> +               ret = -ENOMEM; >>> +               goto err_free_next; >>> +           } >>> + >>> +           r->unmap->va = va; >>> +       } else { >>> +           op->op = DRM_GPUVA_OP_UNMAP; >>> +           op->unmap.va = va; >>> +       } >>> + >>> +       list_add_tail(&op->entry, &ops->list); >>> +   } >>> + >>> +   return ops; >>> + >>> +err_free_next: >>> +   if (r->next) >>> +       kfree(r->next); >>> +err_free_prev: >>> +   if (r->prev) >>> +       kfree(r->prev); >>> +err_free_op: >>> +   kfree(op); >>> +err_free_ops: >>> +   drm_gpuva_ops_free(ops); >>> +   return ERR_PTR(ret); >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_sm_unmap_ops_create); >>> + >>> +/** >>> + * drm_gpuva_ops_free - free the given &drm_gpuva_ops >>> + * @ops: the &drm_gpuva_ops to free >>> + * >>> + * Frees the given &drm_gpuva_ops structure including all the ops >>> associated >>> + * with it. >>> + */ >>> +void >>> +drm_gpuva_ops_free(struct drm_gpuva_ops *ops) >>> +{ >>> +   struct drm_gpuva_op *op, *next; >>> + >>> +   drm_gpuva_for_each_op_safe(op, next, ops) { >>> +       list_del(&op->entry); >>> +       if (op->op == DRM_GPUVA_OP_REMAP) { >>> +           if (op->remap.prev) >>> +               kfree(op->remap.prev); >>> + >>> +           if (op->remap.next) >>> +               kfree(op->remap.next); >>> + >>> +           kfree(op->remap.unmap); >>> +       } >>> +       kfree(op); >>> +   } >>> + >>> +   kfree(ops); >>> +} >>> +EXPORT_SYMBOL(drm_gpuva_ops_free); >>> diff --git a/include/drm/drm_drv.h b/include/drm/drm_drv.h >>> index d7c521e8860f..6feacd93aca6 100644 >>> --- a/include/drm/drm_drv.h >>> +++ b/include/drm/drm_drv.h >>> @@ -104,6 +104,12 @@ enum drm_driver_feature { >>>       * acceleration should be handled by two drivers that are >>> connected using auxiliary bus. >>>       */ >>>      DRIVER_COMPUTE_ACCEL           = BIT(7), >>> +   /** >>> +    * @DRIVER_GEM_GPUVA: >>> +    * >>> +    * Driver supports user defined GPU VA bindings for GEM objects. >>> +    */ >>> +   DRIVER_GEM_GPUVA       = BIT(8), >>>      /* IMPORTANT: Below are all the legacy flags, add new ones >>> above. */ >>> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h >>> index 772a4adf5287..4a3679034966 100644 >>> --- a/include/drm/drm_gem.h >>> +++ b/include/drm/drm_gem.h >>> @@ -36,6 +36,8 @@ >>>  #include <linux/kref.h> >>>  #include <linux/dma-resv.h> >>> +#include <linux/list.h> >>> +#include <linux/mutex.h> >>>  #include <drm/drm_vma_manager.h> >>> @@ -337,6 +339,17 @@ struct drm_gem_object { >>>       */ >>>      struct dma_resv _resv; >>> +   /** >>> +    * @gpuva: >>> +    * >>> +    * Provides the list and list mutex of GPU VAs attached to this >>> +    * GEM object. >>> +    */ >>> +   struct { >>> +       struct list_head list; >>> +       struct mutex mutex; >>> +   } gpuva; >>> + >>>      /** >>>       * @funcs: >>>       * >>> @@ -479,4 +492,66 @@ void drm_gem_lru_move_tail(struct drm_gem_lru >>> *lru, struct drm_gem_object *obj); >>>  unsigned long drm_gem_lru_scan(struct drm_gem_lru *lru, unsigned >>> nr_to_scan, >>>                     bool (*shrink)(struct drm_gem_object *obj)); >>> +/** >>> + * drm_gem_gpuva_init - initialize the gpuva list of a GEM object >>> + * @obj: the &drm_gem_object >>> + * >>> + * This initializes the &drm_gem_object's &drm_gpuva list and the mutex >>> + * protecting it. >>> + * >>> + * Calling this function is only necessary for drivers intending to >>> support the >>> + * &drm_driver_feature DRIVER_GEM_GPUVA. >>> + */ >>> +static inline void drm_gem_gpuva_init(struct drm_gem_object *obj) >>> +{ >>> +   INIT_LIST_HEAD(&obj->gpuva.list); >>> +   mutex_init(&obj->gpuva.mutex); >>> +} >>> + >>> +/** >>> + * drm_gem_gpuva_lock - lock the GEM's gpuva list mutex >>> + * @obj: the &drm_gem_object >>> + * >>> + * This unlocks the mutex protecting the &drm_gem_object's >>> &drm_gpuva list. >>> + */ >>> +static inline void drm_gem_gpuva_lock(struct drm_gem_object *obj) >>> +{ >>> +   mutex_lock(&obj->gpuva.mutex); >>> +} >>> + >>> +/** >>> + * drm_gem_gpuva_unlock - unlock the GEM's gpuva list mutex >>> + * @obj: the &drm_gem_object >>> + * >>> + * This unlocks the mutex protecting the &drm_gem_object's >>> &drm_gpuva list. >>> + */ >>> +static inline void drm_gem_gpuva_unlock(struct drm_gem_object *obj) >>> +{ >>> +   mutex_unlock(&obj->gpuva.mutex); >>> +} >>> + >>> +/** >>> + * drm_gem_for_each_gpuva - iternator to walk over a list of gpuvas >>> + * @entry: &drm_gpuva structure to assign to in each iteration step >>> + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated >>> with >>> + * >>> + * This iterator walks over all &drm_gpuva structures associated >>> with the >>> + * &drm_gpuva_manager. >>> + */ >>> +#define drm_gem_for_each_gpuva(entry, obj) \ >>> +   list_for_each_entry(entry, &obj->gpuva.list, head) >>> + >>> +/** >>> + * drm_gem_for_each_gpuva_safe - iternator to safely walk over a >>> list of gpuvas >>> + * @entry: &drm_gpuva structure to assign to in each iteration step >>> + * @next: &next &drm_gpuva to store the next step >>> + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated >>> with >>> + * >>> + * This iterator walks over all &drm_gpuva structures associated >>> with the >>> + * &drm_gem_object. It is implemented with >>> list_for_each_entry_safe(), hence >>> + * it is save against removal of elements. >>> + */ >>> +#define drm_gem_for_each_gpuva_safe(entry, next, obj) \ >>> +   list_for_each_entry_safe(entry, next, &obj->gpuva.list, head) >>> + >>>  #endif /* __DRM_GEM_H__ */ >>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h >>> new file mode 100644 >>> index 000000000000..adeb0c916e91 >>> --- /dev/null >>> +++ b/include/drm/drm_gpuva_mgr.h >>> @@ -0,0 +1,527 @@ >>> +// SPDX-License-Identifier: GPL-2.0 >>> + >>> +#ifndef __DRM_GPUVA_MGR_H__ >>> +#define __DRM_GPUVA_MGR_H__ >>> + >>> +/* >>> + * Copyright (c) 2022 Red Hat. >>> + * >>> + * Permission is hereby granted, free of charge, to any person >>> obtaining a >>> + * copy of this software and associated documentation files (the >>> "Software"), >>> + * to deal in the Software without restriction, including without >>> limitation >>> + * the rights to use, copy, modify, merge, publish, distribute, >>> sublicense, >>> + * and/or sell copies of the Software, and to permit persons to whom >>> the >>> + * Software is furnished to do so, subject to the following conditions: >>> + * >>> + * The above copyright notice and this permission notice shall be >>> included in >>> + * all copies or substantial portions of the Software. >>> + * >>> + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, >>> EXPRESS OR >>> + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF >>> MERCHANTABILITY, >>> + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO >>> EVENT SHALL >>> + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, >>> DAMAGES OR >>> + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR >>> OTHERWISE, >>> + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE >>> USE OR >>> + * OTHER DEALINGS IN THE SOFTWARE. >>> + */ >>> + >>> +#include <drm/drm_mm.h> >>> +#include <linux/mm.h> >>> +#include <linux/rbtree.h> >>> +#include <linux/spinlock.h> >>> +#include <linux/types.h> >>> + >>> +struct drm_gpuva_region; >>> +struct drm_gpuva; >>> +struct drm_gpuva_ops; >>> + >>> +/** >>> + * struct drm_gpuva_manager - DRM GPU VA Manager >>> + * >>> + * The DRM GPU VA Manager keeps track of a GPU's virtual address >>> space by using >>> + * the &drm_mm range allocator. Typically, this structure is >>> embedded in bigger >>> + * driver structures. >>> + * >>> + * Drivers can pass addresses and ranges in an arbitrary unit, e.g. >>> bytes or >>> + * pages. >>> + * >>> + * There should be one manager instance per GPU virtual address space. >>> + */ >>> +struct drm_gpuva_manager { >>> +   /** >>> +    * @name: the name of the DRM GPU VA space >>> +    */ >>> +   const char *name; >>> + >>> +   /** >>> +    * @mm_start: start of the VA space >>> +    */ >>> +   u64 mm_start; >>> + >>> +   /** >>> +    * @mm_range: length of the VA space >>> +    */ >>> +   u64 mm_range; >>> + >>> +   /** >>> +    * @region_mm: the &drm_mm range allocator to track GPU VA regions >>> +    */ >>> +   struct drm_mm region_mm; >>> + >>> +   /** >>> +    * @va_mm: the &drm_mm range allocator to track GPU VA mappings >>> +    */ >>> +   struct drm_mm va_mm; >>> + >>> +   /** >>> +    * @kernel_alloc_node: >>> +    * >>> +    * &drm_mm_node representing the address space cutout reserved for >>> +    * the kernel >>> +    */ >>> +   struct drm_mm_node kernel_alloc_node; >>> +}; >>> + >>> +void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr, >>> +               const char *name, >>> +               u64 start_offset, u64 range, >>> +               u64 reserve_offset, u64 reserve_range); >>> +void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr); >>> + >>> +/** >>> + * struct drm_gpuva_region - structure to track a portion of GPU VA >>> space >>> + * >>> + * This structure represents a portion of a GPUs VA space and is >>> associated >>> + * with a &drm_gpuva_manager. Internally it is based on a &drm_mm_node. >>> + * >>> + * GPU VA mappings, represented by &drm_gpuva objects, are >>> restricted to be >>> + * placed within a &drm_gpuva_region. >>> + */ >>> +struct drm_gpuva_region { >>> +   /** >>> +    * @node: the &drm_mm_node to track the GPU VA region >>> +    */ >>> +   struct drm_mm_node node; >>> + >>> +   /** >>> +    * @mgr: the &drm_gpuva_manager this object is associated with >>> +    */ >>> +   struct drm_gpuva_manager *mgr; >>> + >>> +   /** >>> +    * @sparse: indicates whether this region is sparse >>> +    */ >>> +   bool sparse; >>> +}; >>> + >>> +struct drm_gpuva_region * >>> +drm_gpuva_region_find(struct drm_gpuva_manager *mgr, >>> +             u64 addr, u64 range); >>> +int drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, >>> +               struct drm_gpuva_region *reg, >>> +               u64 addr, u64 range); >>> +void drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, >>> +                 struct drm_gpuva_region *reg); >>> + >>> +int drm_gpuva_insert(struct drm_gpuva_manager *mgr, >>> +            struct drm_gpuva *va, >>> +            u64 addr, u64 range); >>> +/** >>> + * drm_gpuva_for_each_region_in_range - iternator to walk over a >>> range of nodes >>> + * @node__: &drm_gpuva_region structure to assign to in each >>> iteration step >>> + * @gpuva__: &drm_gpuva_manager structure to walk >>> + * @start__: starting offset, the first node will overlap this >>> + * @end__: ending offset, the last node will start before this (but >>> may overlap) >>> + * >>> + * This iterator walks over all nodes in the range allocator that lie >>> + * between @start and @end. It is implemented similarly to >>> list_for_each(), >>> + * but is using &drm_mm's internal interval tree to accelerate the >>> search for >>> + * the starting node, and hence isn't safe against removal of >>> elements. It >>> + * assumes that @end is within (or is the upper limit of) the >>> &drm_gpuva_manager. >>> + * If [@start, @end] are beyond the range of the &drm_gpuva_manager, >>> the >>> + * iterator may walk over the special _unallocated_ >>> &drm_mm.head_node of the >>> + * backing &drm_mm, and may even continue indefinitely. >>> + */ >>> +#define drm_gpuva_for_each_region_in_range(node__, gpuva__, start__, >>> end__) \ >>> +   for (node__ = (struct drm_gpuva_region >>> *)__drm_mm_interval_first(&(gpuva__)->region_mm, \ >>> +                                    (start__), (end__)-1); \ >>> +        node__->node.start < (end__); \ >>> +        node__ = (struct drm_gpuva_region >>> *)list_next_entry(&node__->node, node_list)) >>> + >>> +/** >>> + * drm_gpuva_for_each_region - iternator to walk over a range of nodes >>> + * @entry: &drm_gpuva_region structure to assign to in each >>> iteration step >>> + * @gpuva: &drm_gpuva_manager structure to walk >>> + * >>> + * This iterator walks over all &drm_gpuva_region structures >>> associated with the >>> + * &drm_gpuva_manager. >>> + */ >>> +#define drm_gpuva_for_each_region(entry, gpuva) \ >>> +   list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->region_mm), >>> node.node_list) >>> + >>> +/** >>> + * drm_gpuva_for_each_region_safe - iternator to safely walk over a >>> range of >>> + * nodes >>> + * @entry: &drm_gpuva_region structure to assign to in each >>> iteration step >>> + * @next: &next &drm_gpuva_region to store the next step >>> + * @gpuva: &drm_gpuva_manager structure to walk >>> + * >>> + * This iterator walks over all &drm_gpuva_region structures >>> associated with the >>> + * &drm_gpuva_manager. It is implemented with list_for_each_safe(), >>> so save >>> + * against removal of elements. >>> + */ >>> +#define drm_gpuva_for_each_region_safe(entry, next, gpuva) \ >>> +   list_for_each_entry_safe(entry, next, >>> drm_mm_nodes(&(gpuva)->region_mm), node.node_list) >>> + >>> + >>> +/** >>> + * enum drm_gpuva_flags - flags for struct drm_gpuva >>> + */ >>> +enum drm_gpuva_flags { >>> +   /** >>> +    * @DRM_GPUVA_SWAPPED: flag indicating that the &drm_gpuva is >>> swapped >>> +    */ >>> +   DRM_GPUVA_SWAPPED = (1 << 0), >>> +}; >>> + >>> +/** >>> + * struct drm_gpuva - structure to track a GPU VA mapping >>> + * >>> + * This structure represents a GPU VA mapping and is associated with a >>> + * &drm_gpuva_manager. Internally it is based on a &drm_mm_node. >>> + * >>> + * Typically, this structure is embedded in bigger driver structures. >>> + */ >>> +struct drm_gpuva { >>> +   /** >>> +    * @node: the &drm_mm_node to track the GPU VA mapping >>> +    */ >>> +   struct drm_mm_node node; >>> + >>> +   /** >>> +    * @mgr: the &drm_gpuva_manager this object is associated with >>> +    */ >>> +   struct drm_gpuva_manager *mgr; >>> + >>> +   /** >>> +    * @region: the &drm_gpuva_region the &drm_gpuva is mapped in >>> +    */ >>> +   struct drm_gpuva_region *region; >>> + >>> +   /** >>> +    * @head: the &list_head to attach this object to a &drm_gem_object >>> +    */ >>> +   struct list_head head; >>> + >>> +   /** >>> +    * @flags: the &drm_gpuva_flags for this mapping >>> +    */ >>> +   enum drm_gpuva_flags flags; >>> + >>> +   /** >>> +    * @gem: structure containing the &drm_gem_object and it's offset >>> +    */ >>> +   struct { >>> +       /** >>> +        * @offset: the offset within the &drm_gem_object >>> +        */ >>> +       u64 offset; >>> + >>> +       /** >>> +        * @obj: the mapped &drm_gem_object >>> +        */ >>> +       struct drm_gem_object *obj; >>> +   } gem; >>> +}; >>> + >>> +void drm_gpuva_link_locked(struct drm_gpuva *va); >>> +void drm_gpuva_link_unlocked(struct drm_gpuva *va); >>> +void drm_gpuva_unlink_locked(struct drm_gpuva *va); >>> +void drm_gpuva_unlink_unlocked(struct drm_gpuva *va); >>> + >>> +void drm_gpuva_destroy_locked(struct drm_gpuva *va); >>> +void drm_gpuva_destroy_unlocked(struct drm_gpuva *va); >>> + >>> +struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr, >>> +                u64 addr, u64 range); >>> +struct drm_gpuva *drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, >>> u64 start); >>> +struct drm_gpuva *drm_gpuva_find_next(struct drm_gpuva_manager *mgr, >>> u64 end); >>> + >>> +/** >>> + * drm_gpuva_swap - sets whether the backing BO of this &drm_gpuva >>> is swapped >>> + * @va: the &drm_gpuva to set the swap flag of >>> + * @swap: indicates whether the &drm_gpuva is swapped >>> + */ >>> +static inline void drm_gpuva_swap(struct drm_gpuva *va, bool swap) >>> +{ >>> +   if (swap) >>> +       va->flags |= DRM_GPUVA_SWAPPED; >>> +   else >>> +       va->flags &= ~DRM_GPUVA_SWAPPED; >>> +} >>> + >>> +/** >>> + * drm_gpuva_swapped - indicates whether the backing BO of this >>> &drm_gpuva >>> + * is swapped >>> + * @va: the &drm_gpuva to check >>> + */ >>> +static inline bool drm_gpuva_swapped(struct drm_gpuva *va) >>> +{ >>> +   return va->flags & DRM_GPUVA_SWAPPED; >>> +} >>> + >>> +/** >>> + * drm_gpuva_for_each_va_in_range - iternator to walk over a range >>> of nodes >>> + * @node__: &drm_gpuva structure to assign to in each iteration step >>> + * @gpuva__: &drm_gpuva_manager structure to walk >>> + * @start__: starting offset, the first node will overlap this >>> + * @end__: ending offset, the last node will start before this (but >>> may overlap) >>> + * >>> + * This iterator walks over all nodes in the range allocator that lie >>> + * between @start and @end. It is implemented similarly to >>> list_for_each(), >>> + * but is using &drm_mm's internal interval tree to accelerate the >>> search for >>> + * the starting node, and hence isn't safe against removal of >>> elements. It >>> + * assumes that @end is within (or is the upper limit of) the >>> &drm_gpuva_manager. >>> + * If [@start, @end] are beyond the range of the &drm_gpuva_manager, >>> the >>> + * iterator may walk over the special _unallocated_ >>> &drm_mm.head_node of the >>> + * backing &drm_mm, and may even continue indefinitely. >>> + */ >>> +#define drm_gpuva_for_each_va_in_range(node__, gpuva__, start__, >>> end__) \ >>> +   for (node__ = (struct drm_gpuva >>> *)__drm_mm_interval_first(&(gpuva__)->va_mm, \ >>> +                                 (start__), (end__)-1); \ >>> +        node__->node.start < (end__); \ >>> +        node__ = (struct drm_gpuva *)list_next_entry(&node__->node, >>> node_list)) >>> + >>> +/** >>> + * drm_gpuva_for_each_va - iternator to walk over a range of nodes >>> + * @entry: &drm_gpuva structure to assign to in each iteration step >>> + * @gpuva: &drm_gpuva_manager structure to walk >>> + * >>> + * This iterator walks over all &drm_gpuva structures associated >>> with the >>> + * &drm_gpuva_manager. >>> + */ >>> +#define drm_gpuva_for_each_va(entry, gpuva) \ >>> +   list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->va_mm), >>> node.node_list) >>> + >>> +/** >>> + * drm_gpuva_for_each_va_safe - iternator to safely walk over a >>> range of >>> + * nodes >>> + * @entry: &drm_gpuva structure to assign to in each iteration step >>> + * @next: &next &drm_gpuva to store the next step >>> + * @gpuva: &drm_gpuva_manager structure to walk >>> + * >>> + * This iterator walks over all &drm_gpuva structures associated >>> with the >>> + * &drm_gpuva_manager. It is implemented with list_for_each_safe(), >>> so save >>> + * against removal of elements. >>> + */ >>> +#define drm_gpuva_for_each_va_safe(entry, next, gpuva) \ >>> +   list_for_each_entry_safe(entry, next, >>> drm_mm_nodes(&(gpuva)->va_mm), node.node_list) >>> + >>> +/** >>> + * enum drm_gpuva_op_type - GPU VA operation type >>> + * >>> + * Operations to alter the GPU VA mappings tracked by the >>> &drm_gpuva_manager >>> + * can be map, remap or unmap operations. >>> + */ >>> +enum drm_gpuva_op_type { >>> +   /** >>> +    * @DRM_GPUVA_OP_MAP: the map op type >>> +    */ >>> +   DRM_GPUVA_OP_MAP, >>> + >>> +   /** >>> +    * @DRM_GPUVA_OP_REMAP: the remap op type >>> +    */ >>> +   DRM_GPUVA_OP_REMAP, >>> + >>> +   /** >>> +    * @DRM_GPUVA_OP_UNMAP: the unmap op type >>> +    */ >>> +   DRM_GPUVA_OP_UNMAP, >>> +}; >>> + >>> +/** >>> + * struct drm_gpuva_op_map - GPU VA map operation >>> + * >>> + * This structure represents a single map operation generated by the >>> + * DRM GPU VA manager. >>> + */ >>> +struct drm_gpuva_op_map { >>> +   /** >>> +    * @va: structure containing address and range of a map >>> +    * operation >>> +    */ >>> +   struct { >>> +       /** >>> +        * @addr: the base address of the new mapping >>> +        */ >>> +       u64 addr; >>> + >>> +       /** >>> +        * @range: the range of the new mapping >>> +        */ >>> +       u64 range; >>> +   } va; >>> + >>> +   /** >>> +    * @gem: structure containing the &drm_gem_object and it's offset >>> +    */ >>> +   struct { >>> +       /** >>> +        * @offset: the offset within the &drm_gem_object >>> +        */ >>> +       u64 offset; >>> + >>> +       /** >>> +        * @obj: the &drm_gem_object to map >>> +        */ >>> +       struct drm_gem_object *obj; >>> +   } gem; >>> +}; >>> + >>> +/** >>> + * struct drm_gpuva_op_unmap - GPU VA unmap operation >>> + * >>> + * This structure represents a single unmap operation generated by the >>> + * DRM GPU VA manager. >>> + */ >>> +struct drm_gpuva_op_unmap { >>> +   /** >>> +    * @va: the &drm_gpuva to unmap >>> +    */ >>> +   struct drm_gpuva *va; >>> + >>> +   /** >>> +    * @keep: >>> +    * >>> +    * Indicates whether this &drm_gpuva is physically contiguous >>> with the >>> +    * original mapping request. >>> +    * >>> +    * Optionally, if &keep is set, drivers may keep the actual page >>> table >>> +    * mappings for this &drm_gpuva, adding the missing page table >>> entries >>> +    * only and update the &drm_gpuva_manager accordingly. >>> +    */ >>> +   bool keep; >>> +}; >>> + >>> +/** >>> + * struct drm_gpuva_op_remap - GPU VA remap operation >>> + * >>> + * This represents a single remap operation generated by the DRM GPU >>> VA manager. >>> + * >>> + * A remap operation is generated when an existing GPU VA mmapping >>> is split up >>> + * by inserting a new GPU VA mapping or by partially unmapping existent >>> + * mapping(s), hence it consists of a maximum of two map and one unmap >>> + * operation. >>> + * >>> + * The @unmap operation takes care of removing the original existing >>> mapping. >>> + * @prev is used to remap the preceding part, @next the subsequent >>> part. >>> + * >>> + * If either a new mapping's start address is aligned with the start >>> address >>> + * of the old mapping or the new mapping's end address is aligned >>> with the >>> + * end address of the old mapping, either @prev or @next is NULL. >>> + * >>> + * Note, the reason for a dedicated remap operation, rather than >>> arbitrary >>> + * unmap and map operations, is to give drivers the chance of >>> extracting driver >>> + * specific data for creating the new mappings from the unmap >>> operations's >>> + * &drm_gpuva structure which typically is embedded in larger driver >>> specific >>> + * structures. >>> + */ >>> +struct drm_gpuva_op_remap { >>> +   /** >>> +    * @prev: the preceding part of a split mapping >>> +    */ >>> +   struct drm_gpuva_op_map *prev; >>> + >>> +   /** >>> +    * @next: the subsequent part of a split mapping >>> +    */ >>> +   struct drm_gpuva_op_map *next; >>> + >>> +   /** >>> +    * @unmap: the unmap operation for the original existing mapping >>> +    */ >>> +   struct drm_gpuva_op_unmap *unmap; >>> +}; >>> + >>> +/** >>> + * struct drm_gpuva_op - GPU VA operation >>> + * >>> + * This structure represents a single generic operation, which can >>> be either >>> + * map, unmap or remap. >>> + * >>> + * The particular type of the operation is defined by @op. >>> + */ >>> +struct drm_gpuva_op { >>> +   /** >>> +    * @entry: >>> +    * >>> +    * The &list_head used to distribute instances of this struct >>> within >>> +    * &drm_gpuva_ops. >>> +    */ >>> +   struct list_head entry; >>> + >>> +   /** >>> +    * @op: the type of the operation >>> +    */ >>> +   enum drm_gpuva_op_type op; >>> + >>> +   union { >>> +       /** >>> +        * @map: the map operation >>> +        */ >>> +       struct drm_gpuva_op_map map; >>> + >>> +       /** >>> +        * @unmap: the unmap operation >>> +        */ >>> +       struct drm_gpuva_op_unmap unmap; >>> + >>> +       /** >>> +        * @remap: the remap operation >>> +        */ >>> +       struct drm_gpuva_op_remap remap; >>> +   }; >>> +}; >>> + >>> +/** >>> + * struct drm_gpuva_ops - wraps a list of &drm_gpuva_op >>> + */ >>> +struct drm_gpuva_ops { >>> +   /** >>> +    * @list: the &list_head >>> +    */ >>> +   struct list_head list; >>> +}; >>> + >>> +/** >>> + * drm_gpuva_for_each_op - iterator to walk over all ops >>> + * @op: &drm_gpuva_op to assign in each iteration step >>> + * @ops: &drm_gpuva_ops to walk >>> + * >>> + * This iterator walks over all ops within a given list of operations. >>> + */ >>> +#define drm_gpuva_for_each_op(op, ops) list_for_each_entry(op, >>> &(ops)->list, entry) >>> + >>> +/** >>> + * drm_gpuva_for_each_op_safe - iterator to safely walk over all ops >>> + * @op: &drm_gpuva_op to assign in each iteration step >>> + * @next: &next &drm_gpuva_op to store the next step >>> + * @ops: &drm_gpuva_ops to walk >>> + * >>> + * This iterator walks over all ops within a given list of >>> operations. It is >>> + * implemented with list_for_each_safe(), so save against removal of >>> elements. >>> + */ >>> +#define drm_gpuva_for_each_op_safe(op, next, ops) \ >>> +   list_for_each_entry_safe(op, next, &(ops)->list, entry) >>> + >>> +struct drm_gpuva_ops * >>> +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr, >>> +               u64 addr, u64 range, >>> +               struct drm_gem_object *obj, u64 offset); >>> +struct drm_gpuva_ops * >>> +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr, >>> +                 u64 addr, u64 range); >>> +void drm_gpuva_ops_free(struct drm_gpuva_ops *ops); >>> + >>> +#endif /* __DRM_GPUVA_MGR_H__ */ >>> -- >>> 2.39.0 >>> >
On 2/3/23 18:37, Matthew Brost wrote: > On Wed, Jan 18, 2023 at 07:12:45AM +0100, Danilo Krummrich wrote: >> This adds the infrastructure for a manager implementation to keep track >> of GPU virtual address (VA) mappings. >> >> New UAPIs, motivated by Vulkan sparse memory bindings graphics drivers >> start implementing, allow userspace applications to request multiple and >> arbitrary GPU VA mappings of buffer objects. The DRM GPU VA manager is >> intended to serve the following purposes in this context. >> >> 1) Provide a dedicated range allocator to track GPU VA allocations and >> mappings, making use of the drm_mm range allocator. >> >> 2) Generically connect GPU VA mappings to their backing buffers, in >> particular DRM GEM objects. >> >> 3) Provide a common implementation to perform more complex mapping >> operations on the GPU VA space. In particular splitting and merging >> of GPU VA mappings, e.g. for intersecting mapping requests or partial >> unmap requests. >> > > Over the past week I've hacked together a PoC port of Xe to GPUVA [1], so > far it seems really promising. 95% of the way to being feature > equivalent of the current Xe VM bind implementation and have line of > sight to getting sparse bindings implemented on top of GPUVA too. IMO > this has basically everything we need for Xe with a few tweaks. > > I am out until 2/14 but wanted to get my thoughts / suggestions out on > the list before I leave. Thanks for your work on that! > 1. GPUVA post didn't support the way Xe does userptrs - a NULL GEM. I > believe with [2], [3], and [4] GPUVA will support NULL GEMs. Also my > thinking sparse binds will also have NULL GEMs, more on sparse bindings > below. > > 2. I agree with Christian that drm_mm probably isn't what we want to > base the GPUVA implementation on, rather a RB tree or Maple tree has > been discussed. The implementation should be fairly easy to tune once we > have benchmarks running so not to concerned here as we can figure this > out down the line. > > 3. In Xe we want create xe_vm_op list which inherits from drm_gpuva_op > I've done this with a hack [5], I believe when we rebase we can do this > with a custom callback to allocate a large op size. > > 4. I'd like add user bits to drm_gpuva_flags like I do in [6]. This is > similar to DMA_FENCE_FLAG_USER_BITS. > > 5. In Xe we have VM prefetch operation which is needed for our compute > UMD with page faults. I'd like add prefetch type of operation like we do > in [7]. > > 6. In Xe we have VM unbind all mappings for a GEM IOCTL, I'd like to add > support to generate this operation list to GPUVA like we do in [8]. > > 7. I've thought about how Xe will implement sparse mappings (read 0, > writes dropped). My current thinking is a sparse mapping will be > represented as a drm_gpuva rather than region like in Nouveau. Making > regions optional to me seems likes good idea rather than forcing the > user of GPUVA code to create 1 large region for the manager as I > currently do in the Xe PoC. > > 8. Personally I'd like the caller to own the locking for GEM drm_gpuva > list (drm_gpuva_link_*, drm_gpuva_unlink_* functions). In Xe we almost > certainly will have the GEM dma-resv lock when we touch this list so an > extra lock here is redundant. Also it kinda goofy that caller owns the > for drm_gpuva insertion / removal but not the locking for this list. You really mean having the dma-resv lock aquired or have a fence on the dma_resv obj? In Nouveau I map/unmap gpuvas in the ttm_device_funcs' move() callback. I only validate() and add a dma_resv fence to GEMs of new mappings. For unmap / remap operations I just take the GEMs gpuva list lock and check whether the GEM is evicted currently. If it is evicted currently (and hence unmapped) and I'm on a remap operation I can just do the update on the GPUVA space, since latest on the next EXEC ioctl() the corresponding GEM is validated and hence re-mapped correctly. If it's an unmap operation I just need to remove the GPUVA, since the corresponding mapping is already unmapped when it's GEM is evicted. If it's not evicted I proceed as usual. Anyway, drm_gpuva_insert() and drm_gpuva_remove() (was drm_gpuva_destroy() before) do *not* implicitly add the gpuva to the GEM's gpuva list anymore. Instead there is only drm_gpuva_link() and drm_gpuva_unlink(), not doing any lockdep checks, but clearly documenting that the caller is resposible to take care of mutual exclusion. > > WRT to Christian thoughts on a common uAPI rules for VM binds, I kinda > like that idea but I don't think that is necessary. All of pur uAPI > should be close but also the GPUVA implementation should be flexible > enough to fit all of our needs and I think for the most part it is. > > Let me know what everything thinks about this. It would be great if when > I'm back on 2/14 I can rebase the Xe port to GPUVA on another version of > the GPUVA code and get sparse binding support implementation. Also I'd > like to get GPUVA merged in the Xe repo ASAP as our VM bind code badly > needed to be cleaned and this was the push we needed to make this > happen. All those are great improvements and some of I will pick up for Nouveau as well. Except for switching from drm_mm to maple_tree I implemented all other suggestions. You can find them here: https://gitlab.freedesktop.org/nouvelles/kernel/-/tree/new-uapi-drm-next-fixes I'm just about to start moving the GPUVA Manager to use maple_tree instead and plan to send a V2 by end of this week. - Danilo > > Matt > > [1] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314 > [2] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=2ae21d7a3f52e5eb2c105ed8ae231471274bdc36 > [3] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=49fca9f5d96201f5cbd1b19c7ff17eedfac65cdc > [4] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=61fa6b1e1f10e791ae82358fa971b04421d53024 > [5] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=87fc08dcf0840e794b38269fe4c6a95d088d79ec > [6] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=a4826c22f6788bc29906ffa263c1cd3c4661fa77 > [7] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=f008bbb55b213868e52c7b9cda4c1bfb95af6aee > [8] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/314/diffs?commit_id=41f4f71c05d04d2b17d988dd95369b5df2d7f681 > >> Idea-suggested-by: Dave Airlie <airlied@redhat.com> >> Signed-off-by: Danilo Krummrich <dakr@redhat.com> >> --- >> Documentation/gpu/drm-mm.rst | 31 + >> drivers/gpu/drm/Makefile | 1 + >> drivers/gpu/drm/drm_gem.c | 3 + >> drivers/gpu/drm/drm_gpuva_mgr.c | 1323 +++++++++++++++++++++++++++++++ >> include/drm/drm_drv.h | 6 + >> include/drm/drm_gem.h | 75 ++ >> include/drm/drm_gpuva_mgr.h | 527 ++++++++++++ >> 7 files changed, 1966 insertions(+) >> create mode 100644 drivers/gpu/drm/drm_gpuva_mgr.c >> create mode 100644 include/drm/drm_gpuva_mgr.h >> >> diff --git a/Documentation/gpu/drm-mm.rst b/Documentation/gpu/drm-mm.rst >> index a52e6f4117d6..c9f120cfe730 100644 >> --- a/Documentation/gpu/drm-mm.rst >> +++ b/Documentation/gpu/drm-mm.rst >> @@ -466,6 +466,37 @@ DRM MM Range Allocator Function References >> .. kernel-doc:: drivers/gpu/drm/drm_mm.c >> :export: >> >> +DRM GPU VA Manager >> +================== >> + >> +Overview >> +-------- >> + >> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >> + :doc: Overview >> + >> +Split and Merge >> +--------------- >> + >> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >> + :doc: Split and Merge >> + >> +Locking >> +------- >> + >> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >> + :doc: Locking >> + >> + >> +DRM GPU VA Manager Function References >> +-------------------------------------- >> + >> +.. kernel-doc:: include/drm/drm_gpuva_mgr.h >> + :internal: >> + >> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c >> + :export: >> + >> DRM Buddy Allocator >> =================== >> >> diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile >> index 4fe190aee584..de2ffca3b6e4 100644 >> --- a/drivers/gpu/drm/Makefile >> +++ b/drivers/gpu/drm/Makefile >> @@ -45,6 +45,7 @@ drm-y := \ >> drm_vblank.o \ >> drm_vblank_work.o \ >> drm_vma_manager.o \ >> + drm_gpuva_mgr.o \ >> drm_writeback.o >> drm-$(CONFIG_DRM_LEGACY) += \ >> drm_agpsupport.o \ >> diff --git a/drivers/gpu/drm/drm_gem.c b/drivers/gpu/drm/drm_gem.c >> index 59a0bb5ebd85..65115fe88627 100644 >> --- a/drivers/gpu/drm/drm_gem.c >> +++ b/drivers/gpu/drm/drm_gem.c >> @@ -164,6 +164,9 @@ void drm_gem_private_object_init(struct drm_device *dev, >> if (!obj->resv) >> obj->resv = &obj->_resv; >> >> + if (drm_core_check_feature(dev, DRIVER_GEM_GPUVA)) >> + drm_gem_gpuva_init(obj); >> + >> drm_vma_node_reset(&obj->vma_node); >> INIT_LIST_HEAD(&obj->lru_node); >> } >> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c >> new file mode 100644 >> index 000000000000..e665f642689d >> --- /dev/null >> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c >> @@ -0,0 +1,1323 @@ >> +// SPDX-License-Identifier: GPL-2.0 >> +/* >> + * Copyright (c) 2022 Red Hat. >> + * >> + * Permission is hereby granted, free of charge, to any person obtaining a >> + * copy of this software and associated documentation files (the "Software"), >> + * to deal in the Software without restriction, including without limitation >> + * the rights to use, copy, modify, merge, publish, distribute, sublicense, >> + * and/or sell copies of the Software, and to permit persons to whom the >> + * Software is furnished to do so, subject to the following conditions: >> + * >> + * The above copyright notice and this permission notice shall be included in >> + * all copies or substantial portions of the Software. >> + * >> + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR >> + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, >> + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL >> + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, DAMAGES OR >> + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, >> + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR >> + * OTHER DEALINGS IN THE SOFTWARE. >> + * >> + * Authors: >> + * Danilo Krummrich <dakr@redhat.com> >> + * >> + */ >> + >> +#include <drm/drm_gem.h> >> +#include <drm/drm_gpuva_mgr.h> >> + >> +/** >> + * DOC: Overview >> + * >> + * The DRM GPU VA Manager, represented by struct drm_gpuva_manager keeps track >> + * of a GPU's virtual address (VA) space and manages the corresponding virtual >> + * mappings represented by &drm_gpuva objects. It also keeps track of the >> + * mapping's backing &drm_gem_object buffers. >> + * >> + * &drm_gem_object buffers maintain a list (and a corresponding list lock) of >> + * &drm_gpuva objects representing all existent GPU VA mappings using this >> + * &drm_gem_object as backing buffer. >> + * >> + * A GPU VA mapping can only be created within a previously allocated >> + * &drm_gpuva_region, which represents a reserved portion of the GPU VA space. >> + * GPU VA mappings are not allowed to span over a &drm_gpuva_region's boundary. >> + * >> + * GPU VA regions can also be flagged as sparse, which allows drivers to create >> + * sparse mappings for a whole GPU VA region in order to support Vulkan >> + * 'Sparse Resources'. >> + * >> + * The GPU VA manager internally uses the &drm_mm range allocator to manage the >> + * &drm_gpuva mappings and the &drm_gpuva_regions within a GPU's virtual address >> + * space. >> + * >> + * Besides the GPU VA space regions (&drm_gpuva_region) allocated by a driver >> + * the &drm_gpuva_manager contains a special region representing the portion of >> + * VA space reserved by the kernel. This node is initialized together with the >> + * GPU VA manager instance and removed when the GPU VA manager is destroyed. >> + * >> + * In a typical application drivers would embed struct drm_gpuva_manager, >> + * struct drm_gpuva_region and struct drm_gpuva within their own driver >> + * specific structures, there won't be any memory allocations of it's own nor >> + * memory allocations of &drm_gpuva or &drm_gpuva_region entries. >> + */ >> + >> +/** >> + * DOC: Split and Merge >> + * >> + * The DRM GPU VA manager also provides an algorithm implementing splitting and >> + * merging of existent GPU VA mappings with the ones that are requested to be >> + * mapped or unmapped. This feature is required by the Vulkan API to implement >> + * Vulkan 'Sparse Memory Bindings' - drivers UAPIs often refer to this as >> + * VM BIND. >> + * >> + * Drivers can call drm_gpuva_sm_map_ops_create() to obtain a list of map, unmap >> + * and remap operations for a given newly requested mapping. This list >> + * represents the set of operations to execute in order to integrate the new >> + * mapping cleanly into the current state of the GPU VA space. >> + * >> + * Depending on how the new GPU VA mapping intersects with the existent mappings >> + * of the GPU VA space the &drm_gpuva_ops contain an arbitrary amount of unmap >> + * operations, a maximum of two remap operations and a single map operation. >> + * The set of operations can also be empty if no operation is required, e.g. if >> + * the requested mapping already exists in the exact same way. >> + * >> + * The single map operation, if existent, represents the original map operation >> + * requested by the caller. Please note that this operation might be altered >> + * comparing it with the original map operation, e.g. because it was merged with >> + * an already existent mapping. Hence, drivers must execute this map operation >> + * instead of the original one they passed to drm_gpuva_sm_map_ops_create(). >> + * >> + * &drm_gpuva_op_unmap contains a 'keep' field, which indicates whether the >> + * &drm_gpuva to unmap is physically contiguous with the original mapping >> + * request. Optionally, if 'keep' is set, drivers may keep the actual page table >> + * entries for this &drm_gpuva, adding the missing page table entries only and >> + * update the &drm_gpuva_manager's view of things accordingly. >> + * >> + * Drivers may do the same optimization, namely delta page table updates, also >> + * for remap operations. This is possible since &drm_gpuva_op_remap consists of >> + * one unmap operation and one or two map operations, such that drivers can >> + * derive the page table update delta accordingly. >> + * >> + * Note that there can't be more than two existent mappings to split up, one at >> + * the beginning and one at the end of the new mapping, hence there is a >> + * maximum of two remap operations. >> + * >> + * Generally, the DRM GPU VA manager never merges mappings across the >> + * boundaries of &drm_gpuva_regions. This is the case since merging between >> + * GPU VA regions would result into unmap and map operations to be issued for >> + * both regions involved although the original mapping request was referred to >> + * one specific GPU VA region only. Since the other GPU VA region, the one not >> + * explicitly requested to be altered, might be in use by the GPU, we are not >> + * allowed to issue any map/unmap operations for this region. >> + * >> + * Note that before calling drm_gpuva_sm_map_ops_create() again with another >> + * mapping request it is necessary to update the &drm_gpuva_manager's view of >> + * the GPU VA space. The previously obtained operations must be either fully >> + * processed or completely abandoned. >> + * >> + * To update the &drm_gpuva_manager's view of the GPU VA space >> + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >> + * drm_gpuva_destroy_unlocked() should be used. >> + * >> + * Analogue to drm_gpuva_sm_map_ops_create() drm_gpuva_sm_unmap_ops_create() >> + * provides drivers a the list of operations to be executed in order to unmap >> + * a range of GPU VA space. The logic behind this functions is way simpler >> + * though: For all existent mappings enclosed by the given range unmap >> + * operations are created. For mappings which are only partically located within >> + * the given range, remap operations are created such that those mappings are >> + * split up and re-mapped partically. >> + * >> + * The following paragraph depicts the basic constellations of existent GPU VA >> + * mappings, a newly requested mapping and the resulting mappings as implemented >> + * by drm_gpuva_sm_map_ops_create() - it doesn't cover arbitrary combinations >> + * of those constellations. >> + * >> + * :: >> + * >> + * 1) Existent mapping is kept. >> + * ---------------------------- >> + * >> + * 0 a 1 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 a 1 >> + * req: |-----------| (bo_offset=n) >> + * >> + * 0 a 1 >> + * new: |-----------| (bo_offset=n) >> + * >> + * >> + * 2) Existent mapping is replaced. >> + * -------------------------------- >> + * >> + * 0 a 1 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 a 1 >> + * req: |-----------| (bo_offset=m) >> + * >> + * 0 a 1 >> + * new: |-----------| (bo_offset=m) >> + * >> + * >> + * 3) Existent mapping is replaced. >> + * -------------------------------- >> + * >> + * 0 a 1 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 b 1 >> + * req: |-----------| (bo_offset=n) >> + * >> + * 0 b 1 >> + * new: |-----------| (bo_offset=n) >> + * >> + * >> + * 4) Existent mapping is replaced. >> + * -------------------------------- >> + * >> + * 0 a 1 >> + * old: |-----| (bo_offset=n) >> + * >> + * 0 a 2 >> + * req: |-----------| (bo_offset=n) >> + * >> + * 0 a 2 >> + * new: |-----------| (bo_offset=n) >> + * >> + * Note: We expect to see the same result for a request with a different bo >> + * and/or bo_offset. >> + * >> + * >> + * 5) Existent mapping is split. >> + * ----------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 b 1 >> + * req: |-----| (bo_offset=n) >> + * >> + * 0 b 1 a' 2 >> + * new: |-----|-----| (b.bo_offset=n, a.bo_offset=n+1) >> + * >> + * Note: We expect to see the same result for a request with a different bo >> + * and/or non-contiguous bo_offset. >> + * >> + * >> + * 6) Existent mapping is kept. >> + * ---------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 a 1 >> + * req: |-----| (bo_offset=n) >> + * >> + * 0 a 2 >> + * new: |-----------| (bo_offset=n) >> + * >> + * >> + * 7) Existent mapping is split. >> + * ----------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 1 b 2 >> + * req: |-----| (bo_offset=m) >> + * >> + * 0 a 1 b 2 >> + * new: |-----|-----| (a.bo_offset=n,b.bo_offset=m) >> + * >> + * >> + * 8) Existent mapping is kept. >> + * ---------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 1 a 2 >> + * req: |-----| (bo_offset=n+1) >> + * >> + * 0 a 2 >> + * new: |-----------| (bo_offset=n) >> + * >> + * >> + * 9) Existent mapping is split. >> + * ----------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 1 b 3 >> + * req: |-----------| (bo_offset=m) >> + * >> + * 0 a 1 b 3 >> + * new: |-----|-----------| (a.bo_offset=n,b.bo_offset=m) >> + * >> + * >> + * 10) Existent mapping is merged. >> + * ------------------------------- >> + * >> + * 0 a 2 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 1 a 3 >> + * req: |-----------| (bo_offset=n+1) >> + * >> + * 0 a 3 >> + * new: |-----------------| (bo_offset=n) >> + * >> + * >> + * 11) Existent mapping is split. >> + * ------------------------------ >> + * >> + * 0 a 3 >> + * old: |-----------------| (bo_offset=n) >> + * >> + * 1 b 2 >> + * req: |-----| (bo_offset=m) >> + * >> + * 0 a 1 b 2 a' 3 >> + * new: |-----|-----|-----| (a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2) >> + * >> + * >> + * 12) Existent mapping is kept. >> + * ----------------------------- >> + * >> + * 0 a 3 >> + * old: |-----------------| (bo_offset=n) >> + * >> + * 1 a 2 >> + * req: |-----| (bo_offset=n+1) >> + * >> + * 0 a 3 >> + * old: |-----------------| (bo_offset=n) >> + * >> + * >> + * 13) Existent mapping is replaced. >> + * --------------------------------- >> + * >> + * 1 a 2 >> + * old: |-----| (bo_offset=n) >> + * >> + * 0 a 2 >> + * req: |-----------| (bo_offset=n) >> + * >> + * 0 a 2 >> + * new: |-----------| (bo_offset=n) >> + * >> + * Note: We expect to see the same result for a request with a different bo >> + * and/or non-contiguous bo_offset. >> + * >> + * >> + * 14) Existent mapping is replaced. >> + * --------------------------------- >> + * >> + * 1 a 2 >> + * old: |-----| (bo_offset=n) >> + * >> + * 0 a 3 >> + * req: |----------------| (bo_offset=n) >> + * >> + * 0 a 3 >> + * new: |----------------| (bo_offset=n) >> + * >> + * Note: We expect to see the same result for a request with a different bo >> + * and/or non-contiguous bo_offset. >> + * >> + * >> + * 15) Existent mapping is split. >> + * ------------------------------ >> + * >> + * 1 a 3 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 0 b 2 >> + * req: |-----------| (bo_offset=m) >> + * >> + * 0 b 2 a' 3 >> + * new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2) >> + * >> + * >> + * 16) Existent mappings are merged. >> + * --------------------------------- >> + * >> + * 0 a 1 >> + * old: |-----------| (bo_offset=n) >> + * >> + * 2 a 3 >> + * old': |-----------| (bo_offset=n+2) >> + * >> + * 1 a 2 >> + * req: |-----------| (bo_offset=n+1) >> + * >> + * a >> + * new: |----------------------------------| (bo_offset=n) >> + */ >> + >> +/** >> + * DOC: Locking >> + * >> + * Generally, the GPU VA manager does not take care of locking itself, it is >> + * the drivers responsibility to take care about locking. Drivers might want to >> + * protect the following operations: inserting, destroying and iterating >> + * &drm_gpuva and &drm_gpuva_region objects as well as generating split and merge >> + * operations. >> + * >> + * The GPU VA manager does take care of the locking of the backing >> + * &drm_gem_object buffers GPU VA lists though, unless the provided functions >> + * documentation claims otherwise. >> + */ >> + >> +/** >> + * drm_gpuva_manager_init - initialize a &drm_gpuva_manager >> + * @mgr: pointer to the &drm_gpuva_manager to initialize >> + * @name: the name of the GPU VA space >> + * @start_offset: the start offset of the GPU VA space >> + * @range: the size of the GPU VA space >> + * @reserve_offset: the start of the kernel reserved GPU VA area >> + * @reserve_range: the size of the kernel reserved GPU VA area >> + * >> + * The &drm_gpuva_manager must be initialized with this function before use. >> + * >> + * Note that @mgr must be cleared to 0 before calling this function. The given >> + * &name is expected to be managed by the surrounding driver structures. >> + */ >> +void >> +drm_gpuva_manager_init(struct drm_gpuva_manager *mgr, >> + const char *name, >> + u64 start_offset, u64 range, >> + u64 reserve_offset, u64 reserve_range) >> +{ >> + drm_mm_init(&mgr->va_mm, start_offset, range); >> + drm_mm_init(&mgr->region_mm, start_offset, range); >> + >> + mgr->mm_start = start_offset; >> + mgr->mm_range = range; >> + >> + mgr->name = name ? name : "unknown"; >> + >> + memset(&mgr->kernel_alloc_node, 0, sizeof(struct drm_mm_node)); >> + mgr->kernel_alloc_node.start = reserve_offset; >> + mgr->kernel_alloc_node.size = reserve_range; >> + drm_mm_reserve_node(&mgr->region_mm, &mgr->kernel_alloc_node); >> +} >> +EXPORT_SYMBOL(drm_gpuva_manager_init); >> + >> +/** >> + * drm_gpuva_manager_destroy - cleanup a &drm_gpuva_manager >> + * @mgr: pointer to the &drm_gpuva_manager to clean up >> + * >> + * Note that it is a bug to call this function on a manager that still >> + * holds GPU VA mappings. >> + */ >> +void >> +drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr) >> +{ >> + mgr->name = NULL; >> + drm_mm_remove_node(&mgr->kernel_alloc_node); >> + drm_mm_takedown(&mgr->va_mm); >> + drm_mm_takedown(&mgr->region_mm); >> +} >> +EXPORT_SYMBOL(drm_gpuva_manager_destroy); >> + >> +static struct drm_gpuva_region * >> +drm_gpuva_in_region(struct drm_gpuva_manager *mgr, u64 addr, u64 range) >> +{ >> + struct drm_gpuva_region *reg; >> + >> + /* Find the VA region the requested range is strictly enclosed by. */ >> + drm_gpuva_for_each_region_in_range(reg, mgr, addr, addr + range) { >> + if (reg->node.start <= addr && >> + reg->node.start + reg->node.size >= addr + range && >> + ®->node != &mgr->kernel_alloc_node) >> + return reg; >> + } >> + >> + return NULL; >> +} >> + >> +static bool >> +drm_gpuva_in_any_region(struct drm_gpuva_manager *mgr, u64 addr, u64 range) >> +{ >> + return !!drm_gpuva_in_region(mgr, addr, range); >> +} >> + >> +/** >> + * drm_gpuva_insert - insert a &drm_gpuva >> + * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in >> + * @va: the &drm_gpuva to insert >> + * @addr: the start address of the GPU VA >> + * @range: the range of the GPU VA >> + * >> + * Insert a &drm_gpuva with a given address and range into a >> + * &drm_gpuva_manager. >> + * >> + * The function assumes the caller does not hold the &drm_gem_object's >> + * GPU VA list mutex. >> + * >> + * Returns: 0 on success, negative error code on failure. >> + */ >> +int >> +drm_gpuva_insert(struct drm_gpuva_manager *mgr, >> + struct drm_gpuva *va, >> + u64 addr, u64 range) >> +{ >> + struct drm_gpuva_region *reg; >> + int ret; >> + >> + if (!va->gem.obj) >> + return -EINVAL; >> + >> + reg = drm_gpuva_in_region(mgr, addr, range); >> + if (!reg) >> + return -EINVAL; >> + >> + ret = drm_mm_insert_node_in_range(&mgr->va_mm, &va->node, >> + range, 0, >> + 0, addr, >> + addr + range, >> + DRM_MM_INSERT_LOW|DRM_MM_INSERT_ONCE); >> + if (ret) >> + return ret; >> + >> + va->mgr = mgr; >> + va->region = reg; >> + >> + return 0; >> +} >> +EXPORT_SYMBOL(drm_gpuva_insert); >> + >> +/** >> + * drm_gpuva_link_locked - link a &drm_gpuva >> + * @va: the &drm_gpuva to link >> + * >> + * This adds the given &va to the GPU VA list of the &drm_gem_object it is >> + * associated with. >> + * >> + * The function assumes the caller already holds the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_link_locked(struct drm_gpuva *va) >> +{ >> + lockdep_assert_held(&va->gem.obj->gpuva.mutex); >> + list_add_tail(&va->head, &va->gem.obj->gpuva.list); >> +} >> +EXPORT_SYMBOL(drm_gpuva_link_locked); >> + >> +/** >> + * drm_gpuva_link_unlocked - unlink a &drm_gpuva >> + * @va: the &drm_gpuva to unlink >> + * >> + * This adds the given &va to the GPU VA list of the &drm_gem_object it is >> + * associated with. >> + * >> + * The function assumes the caller does not hold the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_link_unlocked(struct drm_gpuva *va) >> +{ >> + drm_gem_gpuva_lock(va->gem.obj); >> + drm_gpuva_link_locked(va); >> + drm_gem_gpuva_unlock(va->gem.obj); >> +} >> +EXPORT_SYMBOL(drm_gpuva_link_unlocked); >> + >> +/** >> + * drm_gpuva_unlink_locked - unlink a &drm_gpuva >> + * @va: the &drm_gpuva to unlink >> + * >> + * This removes the given &va from the GPU VA list of the &drm_gem_object it is >> + * associated with. >> + * >> + * The function assumes the caller already holds the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_unlink_locked(struct drm_gpuva *va) >> +{ >> + lockdep_assert_held(&va->gem.obj->gpuva.mutex); >> + list_del_init(&va->head); >> +} >> +EXPORT_SYMBOL(drm_gpuva_unlink_locked); >> + >> +/** >> + * drm_gpuva_unlink_unlocked - unlink a &drm_gpuva >> + * @va: the &drm_gpuva to unlink >> + * >> + * This removes the given &va from the GPU VA list of the &drm_gem_object it is >> + * associated with. >> + * >> + * The function assumes the caller does not hold the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_unlink_unlocked(struct drm_gpuva *va) >> +{ >> + drm_gem_gpuva_lock(va->gem.obj); >> + drm_gpuva_unlink_locked(va); >> + drm_gem_gpuva_unlock(va->gem.obj); >> +} >> +EXPORT_SYMBOL(drm_gpuva_unlink_unlocked); >> + >> +/** >> + * drm_gpuva_destroy_locked - destroy a &drm_gpuva >> + * @va: the &drm_gpuva to destroy >> + * >> + * This removes the given &va from GPU VA list of the &drm_gem_object it is >> + * associated with and removes it from the underlaying range allocator. >> + * >> + * The function assumes the caller already holds the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_destroy_locked(struct drm_gpuva *va) >> +{ >> + lockdep_assert_held(&va->gem.obj->gpuva.mutex); >> + >> + list_del(&va->head); >> + drm_mm_remove_node(&va->node); >> +} >> +EXPORT_SYMBOL(drm_gpuva_destroy_locked); >> + >> +/** >> + * drm_gpuva_destroy_unlocked - destroy a &drm_gpuva >> + * @va: the &drm_gpuva to destroy >> + * >> + * This removes the given &va from GPU VA list of the &drm_gem_object it is >> + * associated with and removes it from the underlaying range allocator. >> + * >> + * The function assumes the caller does not hold the &drm_gem_object's >> + * GPU VA list mutex. >> + */ >> +void >> +drm_gpuva_destroy_unlocked(struct drm_gpuva *va) >> +{ >> + drm_gem_gpuva_lock(va->gem.obj); >> + list_del(&va->head); >> + drm_gem_gpuva_unlock(va->gem.obj); >> + >> + drm_mm_remove_node(&va->node); >> +} >> +EXPORT_SYMBOL(drm_gpuva_destroy_unlocked); >> + >> +/** >> + * drm_gpuva_find - find a &drm_gpuva >> + * @mgr: the &drm_gpuva_manager to search in >> + * @addr: the &drm_gpuvas address >> + * @range: the &drm_gpuvas range >> + * >> + * Returns: the &drm_gpuva at a given &addr and with a given &range >> + */ >> +struct drm_gpuva * >> +drm_gpuva_find(struct drm_gpuva_manager *mgr, >> + u64 addr, u64 range) >> +{ >> + struct drm_gpuva *va; >> + >> + drm_gpuva_for_each_va_in_range(va, mgr, addr, range) { >> + if (va->node.start == addr && >> + va->node.size == range) >> + return va; >> + } >> + >> + return NULL; >> +} >> +EXPORT_SYMBOL(drm_gpuva_find); >> + >> +/** >> + * drm_gpuva_find_prev - find the &drm_gpuva before the given address >> + * @mgr: the &drm_gpuva_manager to search in >> + * @start: the given GPU VA's start address >> + * >> + * Find the adjacent &drm_gpuva before the GPU VA with given &start address. >> + * >> + * Note that if there is any free space between the GPU VA mappings no mapping >> + * is returned. >> + * >> + * Returns: a pointer to the found &drm_gpuva or NULL if none was found >> + */ >> +struct drm_gpuva * >> +drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start) >> +{ >> + struct drm_mm_node *node; >> + >> + if (start <= mgr->mm_start || >> + start > (mgr->mm_start + mgr->mm_range)) >> + return NULL; >> + >> + node = __drm_mm_interval_first(&mgr->va_mm, start - 1, start); >> + if (node == &mgr->va_mm.head_node) >> + return NULL; >> + >> + return (struct drm_gpuva *)node; >> +} >> +EXPORT_SYMBOL(drm_gpuva_find_prev); >> + >> +/** >> + * drm_gpuva_find_next - find the &drm_gpuva after the given address >> + * @mgr: the &drm_gpuva_manager to search in >> + * @end: the given GPU VA's end address >> + * >> + * Find the adjacent &drm_gpuva after the GPU VA with given &end address. >> + * >> + * Note that if there is any free space between the GPU VA mappings no mapping >> + * is returned. >> + * >> + * Returns: a pointer to the found &drm_gpuva or NULL if none was found >> + */ >> +struct drm_gpuva * >> +drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end) >> +{ >> + struct drm_mm_node *node; >> + >> + if (end < mgr->mm_start || >> + end >= (mgr->mm_start + mgr->mm_range)) >> + return NULL; >> + >> + node = __drm_mm_interval_first(&mgr->va_mm, end, end + 1); >> + if (node == &mgr->va_mm.head_node) >> + return NULL; >> + >> + return (struct drm_gpuva *)node; >> +} >> +EXPORT_SYMBOL(drm_gpuva_find_next); >> + >> +/** >> + * drm_gpuva_region_insert - insert a &drm_gpuva_region >> + * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in >> + * @reg: the &drm_gpuva_region to insert >> + * @addr: the start address of the GPU VA >> + * @range: the range of the GPU VA >> + * >> + * Insert a &drm_gpuva_region with a given address and range into a >> + * &drm_gpuva_manager. >> + * >> + * Returns: 0 on success, negative error code on failure. >> + */ >> +int >> +drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, >> + struct drm_gpuva_region *reg, >> + u64 addr, u64 range) >> +{ >> + int ret; >> + >> + ret = drm_mm_insert_node_in_range(&mgr->region_mm, ®->node, >> + range, 0, >> + 0, addr, >> + addr + range, >> + DRM_MM_INSERT_LOW| >> + DRM_MM_INSERT_ONCE); >> + if (ret) >> + return ret; >> + >> + reg->mgr = mgr; >> + >> + return 0; >> +} >> +EXPORT_SYMBOL(drm_gpuva_region_insert); >> + >> +/** >> + * drm_gpuva_region_destroy - destroy a &drm_gpuva_region >> + * @mgr: the &drm_gpuva_manager holding the region >> + * @reg: the &drm_gpuva to destroy >> + * >> + * This removes the given ® from the underlaying range allocator. >> + */ >> +void >> +drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, >> + struct drm_gpuva_region *reg) >> +{ >> + struct drm_gpuva *va; >> + >> + drm_gpuva_for_each_va_in_range(va, mgr, >> + reg->node.start, >> + reg->node.size) { >> + WARN(1, "GPU VA region must be empty on destroy.\n"); >> + return; >> + } >> + >> + if (®->node == &mgr->kernel_alloc_node) { >> + WARN(1, "Can't destroy kernel reserved region.\n"); >> + return; >> + } >> + >> + drm_mm_remove_node(®->node); >> +} >> +EXPORT_SYMBOL(drm_gpuva_region_destroy); >> + >> +/** >> + * drm_gpuva_region_find - find a &drm_gpuva_region >> + * @mgr: the &drm_gpuva_manager to search in >> + * @addr: the &drm_gpuva_regions address >> + * @range: the &drm_gpuva_regions range >> + * >> + * Returns: the &drm_gpuva_region at a given &addr and with a given &range >> + */ >> +struct drm_gpuva_region * >> +drm_gpuva_region_find(struct drm_gpuva_manager *mgr, >> + u64 addr, u64 range) >> +{ >> + struct drm_gpuva_region *reg; >> + >> + drm_gpuva_for_each_region_in_range(reg, mgr, addr, addr + range) >> + if (reg->node.start == addr && >> + reg->node.size == range) >> + return reg; >> + >> + return NULL; >> +} >> +EXPORT_SYMBOL(drm_gpuva_region_find); >> + >> +static int >> +gpuva_op_map_new(struct drm_gpuva_op **pop, >> + u64 addr, u64 range, >> + struct drm_gem_object *obj, u64 offset) >> +{ >> + struct drm_gpuva_op *op; >> + >> + op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >> + if (!op) >> + return -ENOMEM; >> + >> + op->op = DRM_GPUVA_OP_MAP; >> + op->map.va.addr = addr; >> + op->map.va.range = range; >> + op->map.gem.obj = obj; >> + op->map.gem.offset = offset; >> + >> + return 0; >> +} >> + >> +static int >> +gpuva_op_remap_new(struct drm_gpuva_op **pop, >> + struct drm_gpuva_op_map *prev, >> + struct drm_gpuva_op_map *next, >> + struct drm_gpuva_op_unmap *unmap) >> +{ >> + struct drm_gpuva_op *op; >> + struct drm_gpuva_op_remap *r; >> + >> + op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >> + if (!op) >> + return -ENOMEM; >> + >> + op->op = DRM_GPUVA_OP_REMAP; >> + r = &op->remap; >> + >> + if (prev) { >> + r->prev = kmemdup(prev, sizeof(*prev), GFP_KERNEL); >> + if (!r->prev) >> + goto err_free_op; >> + } >> + >> + if (next) { >> + r->next = kmemdup(next, sizeof(*next), GFP_KERNEL); >> + if (!r->next) >> + goto err_free_prev; >> + } >> + >> + r->unmap = kmemdup(unmap, sizeof(*unmap), GFP_KERNEL); >> + if (!r->unmap) >> + goto err_free_next; >> + >> + return 0; >> + >> +err_free_next: >> + if (next) >> + kfree(r->next); >> +err_free_prev: >> + if (prev) >> + kfree(r->prev); >> +err_free_op: >> + kfree(op); >> + *pop = NULL; >> + >> + return -ENOMEM; >> +} >> + >> +static int >> +gpuva_op_unmap_new(struct drm_gpuva_op **pop, >> + struct drm_gpuva *va, bool merge) >> +{ >> + struct drm_gpuva_op *op; >> + >> + op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); >> + if (!op) >> + return -ENOMEM; >> + >> + op->op = DRM_GPUVA_OP_UNMAP; >> + op->unmap.va = va; >> + op->unmap.keep = merge; >> + >> + return 0; >> +} >> + >> +#define op_map_new_to_list(_ops, _addr, _range, \ >> + _obj, _offset) \ >> +do { \ >> + struct drm_gpuva_op *op; \ >> + \ >> + ret = gpuva_op_map_new(&op, _addr, _range, \ >> + _obj, _offset); \ >> + if (ret) \ >> + goto err_free_ops; \ >> + \ >> + list_add_tail(&op->entry, _ops); \ >> +} while (0) >> + >> +#define op_remap_new_to_list(_ops, _prev, _next, \ >> + _unmap) \ >> +do { \ >> + struct drm_gpuva_op *op; \ >> + \ >> + ret = gpuva_op_remap_new(&op, _prev, _next, \ >> + _unmap); \ >> + if (ret) \ >> + goto err_free_ops; \ >> + \ >> + list_add_tail(&op->entry, _ops); \ >> +} while (0) >> + >> +#define op_unmap_new_to_list(_ops, _gpuva, _merge) \ >> +do { \ >> + struct drm_gpuva_op *op; \ >> + \ >> + ret = gpuva_op_unmap_new(&op, _gpuva, _merge); \ >> + if (ret) \ >> + goto err_free_ops; \ >> + \ >> + list_add_tail(&op->entry, _ops); \ >> +} while (0) >> + >> +/** >> + * drm_gpuva_sm_map_ops_create - creates the &drm_gpuva_ops to split and merge >> + * @mgr: the &drm_gpuva_manager representing the GPU VA space >> + * @req_addr: the start address of the new mapping >> + * @req_range: the range of the new mapping >> + * @req_obj: the &drm_gem_object to map >> + * @req_offset: the offset within the &drm_gem_object >> + * >> + * This function creates a list of operations to perform splitting and merging >> + * of existent mapping(s) with the newly requested one. >> + * >> + * The list can be iterated with &drm_gpuva_for_each_op and must be processed >> + * in the given order. It can contain map, unmap and remap operations, but it >> + * also can be empty if no operation is required, e.g. if the requested mapping >> + * already exists is the exact same way. >> + * >> + * There can be an arbitrary amount of unmap operations, a maximum of two remap >> + * operations and a single map operation. The latter one, if existent, >> + * represents the original map operation requested by the caller. Please note >> + * that the map operation might has been modified, e.g. if it was >> + * merged with an existent mapping. >> + * >> + * Note that before calling this function again with another mapping request it >> + * is necessary to update the &drm_gpuva_manager's view of the GPU VA space. >> + * The previously obtained operations must be either processed or abandoned. >> + * To update the &drm_gpuva_manager's view of the GPU VA space >> + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >> + * drm_gpuva_destroy_unlocked() should be used. >> + * >> + * After the caller finished processing the returned &drm_gpuva_ops, they must >> + * be freed with &drm_gpuva_ops_free. >> + * >> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure >> + */ >> +struct drm_gpuva_ops * >> +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr, >> + u64 req_addr, u64 req_range, >> + struct drm_gem_object *req_obj, u64 req_offset) >> +{ >> + struct drm_gpuva_ops *ops; >> + struct drm_gpuva *va, *prev = NULL; >> + u64 req_end = req_addr + req_range; >> + bool skip_pmerge = false, skip_nmerge = false; >> + int ret; >> + >> + if (!drm_gpuva_in_any_region(mgr, req_addr, req_range)) >> + return ERR_PTR(-EINVAL); >> + >> + ops = kzalloc(sizeof(*ops), GFP_KERNEL); >> + if (!ops) >> + return ERR_PTR(-ENOMEM); >> + >> + INIT_LIST_HEAD(&ops->list); >> + >> + drm_gpuva_for_each_va_in_range(va, mgr, req_addr, req_end) { >> + struct drm_gem_object *obj = va->gem.obj; >> + u64 offset = va->gem.offset; >> + u64 addr = va->node.start; >> + u64 range = va->node.size; >> + u64 end = addr + range; >> + >> + /* Generally, we want to skip merging with potential mappings >> + * left and right of the requested one when we found a >> + * collision, since merging happens in this loop already. >> + * >> + * However, there is one exception when the requested mapping >> + * spans into a free VM area. If this is the case we might >> + * still hit the boundary of another mapping before and/or >> + * after the free VM area. >> + */ >> + skip_pmerge = true; >> + skip_nmerge = true; >> + >> + if (addr == req_addr) { >> + bool merge = obj == req_obj && >> + offset == req_offset; >> + if (end == req_end) { >> + if (merge) >> + goto done; >> + >> + op_unmap_new_to_list(&ops->list, va, false); >> + break; >> + } >> + >> + if (end < req_end) { >> + skip_nmerge = false; >> + op_unmap_new_to_list(&ops->list, va, merge); >> + goto next; >> + } >> + >> + if (end > req_end) { >> + struct drm_gpuva_op_map n = { >> + .va.addr = req_end, >> + .va.range = range - req_range, >> + .gem.obj = obj, >> + .gem.offset = offset + req_range, >> + }; >> + struct drm_gpuva_op_unmap u = { .va = va }; >> + >> + if (merge) >> + goto done; >> + >> + op_remap_new_to_list(&ops->list, NULL, &n, &u); >> + break; >> + } >> + } else if (addr < req_addr) { >> + u64 ls_range = req_addr - addr; >> + struct drm_gpuva_op_map p = { >> + .va.addr = addr, >> + .va.range = ls_range, >> + .gem.obj = obj, >> + .gem.offset = offset, >> + }; >> + struct drm_gpuva_op_unmap u = { .va = va }; >> + bool merge = obj == req_obj && >> + offset + ls_range == req_offset; >> + >> + if (end == req_end) { >> + if (merge) >> + goto done; >> + >> + op_remap_new_to_list(&ops->list, &p, NULL, &u); >> + break; >> + } >> + >> + if (end < req_end) { >> + u64 new_addr = addr; >> + u64 new_range = req_range + ls_range; >> + u64 new_offset = offset; >> + >> + /* We validated that the requested mapping is >> + * within a single VA region already. >> + * Since it overlaps the current mapping (which >> + * can't cross a VA region boundary) we can be >> + * sure that we're still within the boundaries >> + * of the same VA region after merging. >> + */ >> + if (merge) { >> + req_offset = new_offset; >> + req_addr = new_addr; >> + req_range = new_range; >> + op_unmap_new_to_list(&ops->list, va, true); >> + goto next; >> + } >> + >> + op_remap_new_to_list(&ops->list, &p, NULL, &u); >> + goto next; >> + } >> + >> + if (end > req_end) { >> + struct drm_gpuva_op_map n = { >> + .va.addr = req_end, >> + .va.range = end - req_end, >> + .gem.obj = obj, >> + .gem.offset = offset + ls_range + >> + req_range, >> + }; >> + >> + if (merge) >> + goto done; >> + >> + op_remap_new_to_list(&ops->list, &p, &n, &u); >> + break; >> + } >> + } else if (addr > req_addr) { >> + bool merge = obj == req_obj && >> + offset == req_offset + >> + (addr - req_addr); >> + if (!prev) >> + skip_pmerge = false; >> + >> + if (end == req_end) { >> + op_unmap_new_to_list(&ops->list, va, merge); >> + break; >> + } >> + >> + if (end < req_end) { >> + skip_nmerge = false; >> + op_unmap_new_to_list(&ops->list, va, merge); >> + goto next; >> + } >> + >> + if (end > req_end) { >> + struct drm_gpuva_op_map n = { >> + .va.addr = req_end, >> + .va.range = end - req_end, >> + .gem.obj = obj, >> + .gem.offset = offset + req_end - addr, >> + }; >> + struct drm_gpuva_op_unmap u = { .va = va }; >> + u64 new_end = end; >> + u64 new_range = new_end - req_addr; >> + >> + /* We validated that the requested mapping is >> + * within a single VA region already. >> + * Since it overlaps the current mapping (which >> + * can't cross a VA region boundary) we can be >> + * sure that we're still within the boundaries >> + * of the same VA region after merging. >> + */ >> + if (merge) { >> + req_end = new_end; >> + req_range = new_range; >> + op_unmap_new_to_list(&ops->list, va, true); >> + break; >> + } >> + >> + op_remap_new_to_list(&ops->list, NULL, &n, &u); >> + break; >> + } >> + } >> +next: >> + prev = va; >> + } >> + >> + va = skip_pmerge ? NULL : drm_gpuva_find_prev(mgr, req_addr); >> + if (va) { >> + struct drm_gem_object *obj = va->gem.obj; >> + u64 offset = va->gem.offset; >> + u64 addr = va->node.start; >> + u64 range = va->node.size; >> + u64 new_offset = offset; >> + u64 new_addr = addr; >> + u64 new_range = req_range + range; >> + bool merge = obj == req_obj && >> + offset + range == req_offset; >> + >> + /* Don't merge over VA region boundaries. */ >> + merge &= drm_gpuva_in_any_region(mgr, new_addr, new_range); >> + if (merge) { >> + op_unmap_new_to_list(&ops->list, va, true); >> + >> + req_offset = new_offset; >> + req_addr = new_addr; >> + req_range = new_range; >> + } >> + } >> + >> + va = skip_nmerge ? NULL : drm_gpuva_find_next(mgr, req_end); >> + if (va) { >> + struct drm_gem_object *obj = va->gem.obj; >> + u64 offset = va->gem.offset; >> + u64 addr = va->node.start; >> + u64 range = va->node.size; >> + u64 end = addr + range; >> + u64 new_range = req_range + range; >> + u64 new_end = end; >> + bool merge = obj == req_obj && >> + offset == req_offset + req_range; >> + >> + /* Don't merge over VA region boundaries. */ >> + merge &= drm_gpuva_in_any_region(mgr, req_addr, new_range); >> + if (merge) { >> + op_unmap_new_to_list(&ops->list, va, true); >> + >> + req_range = new_range; >> + req_end = new_end; >> + } >> + } >> + >> + op_map_new_to_list(&ops->list, >> + req_addr, req_range, >> + req_obj, req_offset); >> + >> +done: >> + return ops; >> + >> +err_free_ops: >> + drm_gpuva_ops_free(ops); >> + return ERR_PTR(ret); >> +} >> +EXPORT_SYMBOL(drm_gpuva_sm_map_ops_create); >> + >> +#undef op_map_new_to_list >> +#undef op_remap_new_to_list >> +#undef op_unmap_new_to_list >> + >> +/** >> + * drm_gpuva_sm_unmap_ops_create - creates the &drm_gpuva_ops to split on unmap >> + * @mgr: the &drm_gpuva_manager representing the GPU VA space >> + * @req_addr: the start address of the range to unmap >> + * @req_range: the range of the mappings to unmap >> + * >> + * This function creates a list of operations to perform unmapping and, if >> + * required, splitting of the mappings overlapping the unmap range. >> + * >> + * The list can be iterated with &drm_gpuva_for_each_op and must be processed >> + * in the given order. It can contain unmap and remap operations, depending on >> + * whether there are actual overlapping mappings to split. >> + * >> + * There can be an arbitrary amount of unmap operations and a maximum of two >> + * remap operations. >> + * >> + * Note that before calling this function again with another range to unmap it >> + * is necessary to update the &drm_gpuva_manager's view of the GPU VA space. >> + * The previously obtained operations must be processed or abandoned. >> + * To update the &drm_gpuva_manager's view of the GPU VA space >> + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or >> + * drm_gpuva_destroy_unlocked() should be used. >> + * >> + * After the caller finished processing the returned &drm_gpuva_ops, they must >> + * be freed with &drm_gpuva_ops_free. >> + * >> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure >> + */ >> +struct drm_gpuva_ops * >> +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr, >> + u64 req_addr, u64 req_range) >> +{ >> + struct drm_gpuva_ops *ops; >> + struct drm_gpuva_op *op; >> + struct drm_gpuva_op_remap *r; >> + struct drm_gpuva *va; >> + u64 req_end = req_addr + req_range; >> + int ret; >> + >> + ops = kzalloc(sizeof(*ops), GFP_KERNEL); >> + if (!ops) >> + return ERR_PTR(-ENOMEM); >> + >> + INIT_LIST_HEAD(&ops->list); >> + >> + drm_gpuva_for_each_va_in_range(va, mgr, req_addr, req_end) { >> + struct drm_gem_object *obj = va->gem.obj; >> + u64 offset = va->gem.offset; >> + u64 addr = va->node.start; >> + u64 range = va->node.size; >> + u64 end = addr + range; >> + >> + op = kzalloc(sizeof(*op), GFP_KERNEL); >> + if (!op) { >> + ret = -ENOMEM; >> + goto err_free_ops; >> + } >> + >> + r = &op->remap; >> + >> + if (addr < req_addr) { >> + r->prev = kzalloc(sizeof(*r->prev), GFP_KERNEL); >> + if (!r->prev) { >> + ret = -ENOMEM; >> + goto err_free_op; >> + } >> + >> + r->prev->va.addr = addr; >> + r->prev->va.range = req_addr - addr; >> + r->prev->gem.obj = obj; >> + r->prev->gem.offset = offset; >> + } >> + >> + if (end > req_end) { >> + r->next = kzalloc(sizeof(*r->next), GFP_KERNEL); >> + if (!r->next) { >> + ret = -ENOMEM; >> + goto err_free_prev; >> + } >> + >> + r->next->va.addr = req_end; >> + r->next->va.range = end - req_end; >> + r->next->gem.obj = obj; >> + r->next->gem.offset = offset + (req_end - addr); >> + } >> + >> + if (op->remap.prev || op->remap.next) { >> + op->op = DRM_GPUVA_OP_REMAP; >> + r->unmap = kzalloc(sizeof(*r->unmap), GFP_KERNEL); >> + if (!r->unmap) { >> + ret = -ENOMEM; >> + goto err_free_next; >> + } >> + >> + r->unmap->va = va; >> + } else { >> + op->op = DRM_GPUVA_OP_UNMAP; >> + op->unmap.va = va; >> + } >> + >> + list_add_tail(&op->entry, &ops->list); >> + } >> + >> + return ops; >> + >> +err_free_next: >> + if (r->next) >> + kfree(r->next); >> +err_free_prev: >> + if (r->prev) >> + kfree(r->prev); >> +err_free_op: >> + kfree(op); >> +err_free_ops: >> + drm_gpuva_ops_free(ops); >> + return ERR_PTR(ret); >> +} >> +EXPORT_SYMBOL(drm_gpuva_sm_unmap_ops_create); >> + >> +/** >> + * drm_gpuva_ops_free - free the given &drm_gpuva_ops >> + * @ops: the &drm_gpuva_ops to free >> + * >> + * Frees the given &drm_gpuva_ops structure including all the ops associated >> + * with it. >> + */ >> +void >> +drm_gpuva_ops_free(struct drm_gpuva_ops *ops) >> +{ >> + struct drm_gpuva_op *op, *next; >> + >> + drm_gpuva_for_each_op_safe(op, next, ops) { >> + list_del(&op->entry); >> + if (op->op == DRM_GPUVA_OP_REMAP) { >> + if (op->remap.prev) >> + kfree(op->remap.prev); >> + >> + if (op->remap.next) >> + kfree(op->remap.next); >> + >> + kfree(op->remap.unmap); >> + } >> + kfree(op); >> + } >> + >> + kfree(ops); >> +} >> +EXPORT_SYMBOL(drm_gpuva_ops_free); >> diff --git a/include/drm/drm_drv.h b/include/drm/drm_drv.h >> index d7c521e8860f..6feacd93aca6 100644 >> --- a/include/drm/drm_drv.h >> +++ b/include/drm/drm_drv.h >> @@ -104,6 +104,12 @@ enum drm_driver_feature { >> * acceleration should be handled by two drivers that are connected using auxiliary bus. >> */ >> DRIVER_COMPUTE_ACCEL = BIT(7), >> + /** >> + * @DRIVER_GEM_GPUVA: >> + * >> + * Driver supports user defined GPU VA bindings for GEM objects. >> + */ >> + DRIVER_GEM_GPUVA = BIT(8), >> >> /* IMPORTANT: Below are all the legacy flags, add new ones above. */ >> >> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h >> index 772a4adf5287..4a3679034966 100644 >> --- a/include/drm/drm_gem.h >> +++ b/include/drm/drm_gem.h >> @@ -36,6 +36,8 @@ >> >> #include <linux/kref.h> >> #include <linux/dma-resv.h> >> +#include <linux/list.h> >> +#include <linux/mutex.h> >> >> #include <drm/drm_vma_manager.h> >> >> @@ -337,6 +339,17 @@ struct drm_gem_object { >> */ >> struct dma_resv _resv; >> >> + /** >> + * @gpuva: >> + * >> + * Provides the list and list mutex of GPU VAs attached to this >> + * GEM object. >> + */ >> + struct { >> + struct list_head list; >> + struct mutex mutex; >> + } gpuva; >> + >> /** >> * @funcs: >> * >> @@ -479,4 +492,66 @@ void drm_gem_lru_move_tail(struct drm_gem_lru *lru, struct drm_gem_object *obj); >> unsigned long drm_gem_lru_scan(struct drm_gem_lru *lru, unsigned nr_to_scan, >> bool (*shrink)(struct drm_gem_object *obj)); >> >> +/** >> + * drm_gem_gpuva_init - initialize the gpuva list of a GEM object >> + * @obj: the &drm_gem_object >> + * >> + * This initializes the &drm_gem_object's &drm_gpuva list and the mutex >> + * protecting it. >> + * >> + * Calling this function is only necessary for drivers intending to support the >> + * &drm_driver_feature DRIVER_GEM_GPUVA. >> + */ >> +static inline void drm_gem_gpuva_init(struct drm_gem_object *obj) >> +{ >> + INIT_LIST_HEAD(&obj->gpuva.list); >> + mutex_init(&obj->gpuva.mutex); >> +} >> + >> +/** >> + * drm_gem_gpuva_lock - lock the GEM's gpuva list mutex >> + * @obj: the &drm_gem_object >> + * >> + * This unlocks the mutex protecting the &drm_gem_object's &drm_gpuva list. >> + */ >> +static inline void drm_gem_gpuva_lock(struct drm_gem_object *obj) >> +{ >> + mutex_lock(&obj->gpuva.mutex); >> +} >> + >> +/** >> + * drm_gem_gpuva_unlock - unlock the GEM's gpuva list mutex >> + * @obj: the &drm_gem_object >> + * >> + * This unlocks the mutex protecting the &drm_gem_object's &drm_gpuva list. >> + */ >> +static inline void drm_gem_gpuva_unlock(struct drm_gem_object *obj) >> +{ >> + mutex_unlock(&obj->gpuva.mutex); >> +} >> + >> +/** >> + * drm_gem_for_each_gpuva - iternator to walk over a list of gpuvas >> + * @entry: &drm_gpuva structure to assign to in each iteration step >> + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with >> + * >> + * This iterator walks over all &drm_gpuva structures associated with the >> + * &drm_gpuva_manager. >> + */ >> +#define drm_gem_for_each_gpuva(entry, obj) \ >> + list_for_each_entry(entry, &obj->gpuva.list, head) >> + >> +/** >> + * drm_gem_for_each_gpuva_safe - iternator to safely walk over a list of gpuvas >> + * @entry: &drm_gpuva structure to assign to in each iteration step >> + * @next: &next &drm_gpuva to store the next step >> + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with >> + * >> + * This iterator walks over all &drm_gpuva structures associated with the >> + * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence >> + * it is save against removal of elements. >> + */ >> +#define drm_gem_for_each_gpuva_safe(entry, next, obj) \ >> + list_for_each_entry_safe(entry, next, &obj->gpuva.list, head) >> + >> #endif /* __DRM_GEM_H__ */ >> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h >> new file mode 100644 >> index 000000000000..adeb0c916e91 >> --- /dev/null >> +++ b/include/drm/drm_gpuva_mgr.h >> @@ -0,0 +1,527 @@ >> +// SPDX-License-Identifier: GPL-2.0 >> + >> +#ifndef __DRM_GPUVA_MGR_H__ >> +#define __DRM_GPUVA_MGR_H__ >> + >> +/* >> + * Copyright (c) 2022 Red Hat. >> + * >> + * Permission is hereby granted, free of charge, to any person obtaining a >> + * copy of this software and associated documentation files (the "Software"), >> + * to deal in the Software without restriction, including without limitation >> + * the rights to use, copy, modify, merge, publish, distribute, sublicense, >> + * and/or sell copies of the Software, and to permit persons to whom the >> + * Software is furnished to do so, subject to the following conditions: >> + * >> + * The above copyright notice and this permission notice shall be included in >> + * all copies or substantial portions of the Software. >> + * >> + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR >> + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, >> + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL >> + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, DAMAGES OR >> + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, >> + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR >> + * OTHER DEALINGS IN THE SOFTWARE. >> + */ >> + >> +#include <drm/drm_mm.h> >> +#include <linux/mm.h> >> +#include <linux/rbtree.h> >> +#include <linux/spinlock.h> >> +#include <linux/types.h> >> + >> +struct drm_gpuva_region; >> +struct drm_gpuva; >> +struct drm_gpuva_ops; >> + >> +/** >> + * struct drm_gpuva_manager - DRM GPU VA Manager >> + * >> + * The DRM GPU VA Manager keeps track of a GPU's virtual address space by using >> + * the &drm_mm range allocator. Typically, this structure is embedded in bigger >> + * driver structures. >> + * >> + * Drivers can pass addresses and ranges in an arbitrary unit, e.g. bytes or >> + * pages. >> + * >> + * There should be one manager instance per GPU virtual address space. >> + */ >> +struct drm_gpuva_manager { >> + /** >> + * @name: the name of the DRM GPU VA space >> + */ >> + const char *name; >> + >> + /** >> + * @mm_start: start of the VA space >> + */ >> + u64 mm_start; >> + >> + /** >> + * @mm_range: length of the VA space >> + */ >> + u64 mm_range; >> + >> + /** >> + * @region_mm: the &drm_mm range allocator to track GPU VA regions >> + */ >> + struct drm_mm region_mm; >> + >> + /** >> + * @va_mm: the &drm_mm range allocator to track GPU VA mappings >> + */ >> + struct drm_mm va_mm; >> + >> + /** >> + * @kernel_alloc_node: >> + * >> + * &drm_mm_node representing the address space cutout reserved for >> + * the kernel >> + */ >> + struct drm_mm_node kernel_alloc_node; >> +}; >> + >> +void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr, >> + const char *name, >> + u64 start_offset, u64 range, >> + u64 reserve_offset, u64 reserve_range); >> +void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr); >> + >> +/** >> + * struct drm_gpuva_region - structure to track a portion of GPU VA space >> + * >> + * This structure represents a portion of a GPUs VA space and is associated >> + * with a &drm_gpuva_manager. Internally it is based on a &drm_mm_node. >> + * >> + * GPU VA mappings, represented by &drm_gpuva objects, are restricted to be >> + * placed within a &drm_gpuva_region. >> + */ >> +struct drm_gpuva_region { >> + /** >> + * @node: the &drm_mm_node to track the GPU VA region >> + */ >> + struct drm_mm_node node; >> + >> + /** >> + * @mgr: the &drm_gpuva_manager this object is associated with >> + */ >> + struct drm_gpuva_manager *mgr; >> + >> + /** >> + * @sparse: indicates whether this region is sparse >> + */ >> + bool sparse; >> +}; >> + >> +struct drm_gpuva_region * >> +drm_gpuva_region_find(struct drm_gpuva_manager *mgr, >> + u64 addr, u64 range); >> +int drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, >> + struct drm_gpuva_region *reg, >> + u64 addr, u64 range); >> +void drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, >> + struct drm_gpuva_region *reg); >> + >> +int drm_gpuva_insert(struct drm_gpuva_manager *mgr, >> + struct drm_gpuva *va, >> + u64 addr, u64 range); >> +/** >> + * drm_gpuva_for_each_region_in_range - iternator to walk over a range of nodes >> + * @node__: &drm_gpuva_region structure to assign to in each iteration step >> + * @gpuva__: &drm_gpuva_manager structure to walk >> + * @start__: starting offset, the first node will overlap this >> + * @end__: ending offset, the last node will start before this (but may overlap) >> + * >> + * This iterator walks over all nodes in the range allocator that lie >> + * between @start and @end. It is implemented similarly to list_for_each(), >> + * but is using &drm_mm's internal interval tree to accelerate the search for >> + * the starting node, and hence isn't safe against removal of elements. It >> + * assumes that @end is within (or is the upper limit of) the &drm_gpuva_manager. >> + * If [@start, @end] are beyond the range of the &drm_gpuva_manager, the >> + * iterator may walk over the special _unallocated_ &drm_mm.head_node of the >> + * backing &drm_mm, and may even continue indefinitely. >> + */ >> +#define drm_gpuva_for_each_region_in_range(node__, gpuva__, start__, end__) \ >> + for (node__ = (struct drm_gpuva_region *)__drm_mm_interval_first(&(gpuva__)->region_mm, \ >> + (start__), (end__)-1); \ >> + node__->node.start < (end__); \ >> + node__ = (struct drm_gpuva_region *)list_next_entry(&node__->node, node_list)) >> + >> +/** >> + * drm_gpuva_for_each_region - iternator to walk over a range of nodes >> + * @entry: &drm_gpuva_region structure to assign to in each iteration step >> + * @gpuva: &drm_gpuva_manager structure to walk >> + * >> + * This iterator walks over all &drm_gpuva_region structures associated with the >> + * &drm_gpuva_manager. >> + */ >> +#define drm_gpuva_for_each_region(entry, gpuva) \ >> + list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->region_mm), node.node_list) >> + >> +/** >> + * drm_gpuva_for_each_region_safe - iternator to safely walk over a range of >> + * nodes >> + * @entry: &drm_gpuva_region structure to assign to in each iteration step >> + * @next: &next &drm_gpuva_region to store the next step >> + * @gpuva: &drm_gpuva_manager structure to walk >> + * >> + * This iterator walks over all &drm_gpuva_region structures associated with the >> + * &drm_gpuva_manager. It is implemented with list_for_each_safe(), so save >> + * against removal of elements. >> + */ >> +#define drm_gpuva_for_each_region_safe(entry, next, gpuva) \ >> + list_for_each_entry_safe(entry, next, drm_mm_nodes(&(gpuva)->region_mm), node.node_list) >> + >> + >> +/** >> + * enum drm_gpuva_flags - flags for struct drm_gpuva >> + */ >> +enum drm_gpuva_flags { >> + /** >> + * @DRM_GPUVA_SWAPPED: flag indicating that the &drm_gpuva is swapped >> + */ >> + DRM_GPUVA_SWAPPED = (1 << 0), >> +}; >> + >> +/** >> + * struct drm_gpuva - structure to track a GPU VA mapping >> + * >> + * This structure represents a GPU VA mapping and is associated with a >> + * &drm_gpuva_manager. Internally it is based on a &drm_mm_node. >> + * >> + * Typically, this structure is embedded in bigger driver structures. >> + */ >> +struct drm_gpuva { >> + /** >> + * @node: the &drm_mm_node to track the GPU VA mapping >> + */ >> + struct drm_mm_node node; >> + >> + /** >> + * @mgr: the &drm_gpuva_manager this object is associated with >> + */ >> + struct drm_gpuva_manager *mgr; >> + >> + /** >> + * @region: the &drm_gpuva_region the &drm_gpuva is mapped in >> + */ >> + struct drm_gpuva_region *region; >> + >> + /** >> + * @head: the &list_head to attach this object to a &drm_gem_object >> + */ >> + struct list_head head; >> + >> + /** >> + * @flags: the &drm_gpuva_flags for this mapping >> + */ >> + enum drm_gpuva_flags flags; >> + >> + /** >> + * @gem: structure containing the &drm_gem_object and it's offset >> + */ >> + struct { >> + /** >> + * @offset: the offset within the &drm_gem_object >> + */ >> + u64 offset; >> + >> + /** >> + * @obj: the mapped &drm_gem_object >> + */ >> + struct drm_gem_object *obj; >> + } gem; >> +}; >> + >> +void drm_gpuva_link_locked(struct drm_gpuva *va); >> +void drm_gpuva_link_unlocked(struct drm_gpuva *va); >> +void drm_gpuva_unlink_locked(struct drm_gpuva *va); >> +void drm_gpuva_unlink_unlocked(struct drm_gpuva *va); >> + >> +void drm_gpuva_destroy_locked(struct drm_gpuva *va); >> +void drm_gpuva_destroy_unlocked(struct drm_gpuva *va); >> + >> +struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr, >> + u64 addr, u64 range); >> +struct drm_gpuva *drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start); >> +struct drm_gpuva *drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end); >> + >> +/** >> + * drm_gpuva_swap - sets whether the backing BO of this &drm_gpuva is swapped >> + * @va: the &drm_gpuva to set the swap flag of >> + * @swap: indicates whether the &drm_gpuva is swapped >> + */ >> +static inline void drm_gpuva_swap(struct drm_gpuva *va, bool swap) >> +{ >> + if (swap) >> + va->flags |= DRM_GPUVA_SWAPPED; >> + else >> + va->flags &= ~DRM_GPUVA_SWAPPED; >> +} >> + >> +/** >> + * drm_gpuva_swapped - indicates whether the backing BO of this &drm_gpuva >> + * is swapped >> + * @va: the &drm_gpuva to check >> + */ >> +static inline bool drm_gpuva_swapped(struct drm_gpuva *va) >> +{ >> + return va->flags & DRM_GPUVA_SWAPPED; >> +} >> + >> +/** >> + * drm_gpuva_for_each_va_in_range - iternator to walk over a range of nodes >> + * @node__: &drm_gpuva structure to assign to in each iteration step >> + * @gpuva__: &drm_gpuva_manager structure to walk >> + * @start__: starting offset, the first node will overlap this >> + * @end__: ending offset, the last node will start before this (but may overlap) >> + * >> + * This iterator walks over all nodes in the range allocator that lie >> + * between @start and @end. It is implemented similarly to list_for_each(), >> + * but is using &drm_mm's internal interval tree to accelerate the search for >> + * the starting node, and hence isn't safe against removal of elements. It >> + * assumes that @end is within (or is the upper limit of) the &drm_gpuva_manager. >> + * If [@start, @end] are beyond the range of the &drm_gpuva_manager, the >> + * iterator may walk over the special _unallocated_ &drm_mm.head_node of the >> + * backing &drm_mm, and may even continue indefinitely. >> + */ >> +#define drm_gpuva_for_each_va_in_range(node__, gpuva__, start__, end__) \ >> + for (node__ = (struct drm_gpuva *)__drm_mm_interval_first(&(gpuva__)->va_mm, \ >> + (start__), (end__)-1); \ >> + node__->node.start < (end__); \ >> + node__ = (struct drm_gpuva *)list_next_entry(&node__->node, node_list)) >> + >> +/** >> + * drm_gpuva_for_each_va - iternator to walk over a range of nodes >> + * @entry: &drm_gpuva structure to assign to in each iteration step >> + * @gpuva: &drm_gpuva_manager structure to walk >> + * >> + * This iterator walks over all &drm_gpuva structures associated with the >> + * &drm_gpuva_manager. >> + */ >> +#define drm_gpuva_for_each_va(entry, gpuva) \ >> + list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->va_mm), node.node_list) >> + >> +/** >> + * drm_gpuva_for_each_va_safe - iternator to safely walk over a range of >> + * nodes >> + * @entry: &drm_gpuva structure to assign to in each iteration step >> + * @next: &next &drm_gpuva to store the next step >> + * @gpuva: &drm_gpuva_manager structure to walk >> + * >> + * This iterator walks over all &drm_gpuva structures associated with the >> + * &drm_gpuva_manager. It is implemented with list_for_each_safe(), so save >> + * against removal of elements. >> + */ >> +#define drm_gpuva_for_each_va_safe(entry, next, gpuva) \ >> + list_for_each_entry_safe(entry, next, drm_mm_nodes(&(gpuva)->va_mm), node.node_list) >> + >> +/** >> + * enum drm_gpuva_op_type - GPU VA operation type >> + * >> + * Operations to alter the GPU VA mappings tracked by the &drm_gpuva_manager >> + * can be map, remap or unmap operations. >> + */ >> +enum drm_gpuva_op_type { >> + /** >> + * @DRM_GPUVA_OP_MAP: the map op type >> + */ >> + DRM_GPUVA_OP_MAP, >> + >> + /** >> + * @DRM_GPUVA_OP_REMAP: the remap op type >> + */ >> + DRM_GPUVA_OP_REMAP, >> + >> + /** >> + * @DRM_GPUVA_OP_UNMAP: the unmap op type >> + */ >> + DRM_GPUVA_OP_UNMAP, >> +}; >> + >> +/** >> + * struct drm_gpuva_op_map - GPU VA map operation >> + * >> + * This structure represents a single map operation generated by the >> + * DRM GPU VA manager. >> + */ >> +struct drm_gpuva_op_map { >> + /** >> + * @va: structure containing address and range of a map >> + * operation >> + */ >> + struct { >> + /** >> + * @addr: the base address of the new mapping >> + */ >> + u64 addr; >> + >> + /** >> + * @range: the range of the new mapping >> + */ >> + u64 range; >> + } va; >> + >> + /** >> + * @gem: structure containing the &drm_gem_object and it's offset >> + */ >> + struct { >> + /** >> + * @offset: the offset within the &drm_gem_object >> + */ >> + u64 offset; >> + >> + /** >> + * @obj: the &drm_gem_object to map >> + */ >> + struct drm_gem_object *obj; >> + } gem; >> +}; >> + >> +/** >> + * struct drm_gpuva_op_unmap - GPU VA unmap operation >> + * >> + * This structure represents a single unmap operation generated by the >> + * DRM GPU VA manager. >> + */ >> +struct drm_gpuva_op_unmap { >> + /** >> + * @va: the &drm_gpuva to unmap >> + */ >> + struct drm_gpuva *va; >> + >> + /** >> + * @keep: >> + * >> + * Indicates whether this &drm_gpuva is physically contiguous with the >> + * original mapping request. >> + * >> + * Optionally, if &keep is set, drivers may keep the actual page table >> + * mappings for this &drm_gpuva, adding the missing page table entries >> + * only and update the &drm_gpuva_manager accordingly. >> + */ >> + bool keep; >> +}; >> + >> +/** >> + * struct drm_gpuva_op_remap - GPU VA remap operation >> + * >> + * This represents a single remap operation generated by the DRM GPU VA manager. >> + * >> + * A remap operation is generated when an existing GPU VA mmapping is split up >> + * by inserting a new GPU VA mapping or by partially unmapping existent >> + * mapping(s), hence it consists of a maximum of two map and one unmap >> + * operation. >> + * >> + * The @unmap operation takes care of removing the original existing mapping. >> + * @prev is used to remap the preceding part, @next the subsequent part. >> + * >> + * If either a new mapping's start address is aligned with the start address >> + * of the old mapping or the new mapping's end address is aligned with the >> + * end address of the old mapping, either @prev or @next is NULL. >> + * >> + * Note, the reason for a dedicated remap operation, rather than arbitrary >> + * unmap and map operations, is to give drivers the chance of extracting driver >> + * specific data for creating the new mappings from the unmap operations's >> + * &drm_gpuva structure which typically is embedded in larger driver specific >> + * structures. >> + */ >> +struct drm_gpuva_op_remap { >> + /** >> + * @prev: the preceding part of a split mapping >> + */ >> + struct drm_gpuva_op_map *prev; >> + >> + /** >> + * @next: the subsequent part of a split mapping >> + */ >> + struct drm_gpuva_op_map *next; >> + >> + /** >> + * @unmap: the unmap operation for the original existing mapping >> + */ >> + struct drm_gpuva_op_unmap *unmap; >> +}; >> + >> +/** >> + * struct drm_gpuva_op - GPU VA operation >> + * >> + * This structure represents a single generic operation, which can be either >> + * map, unmap or remap. >> + * >> + * The particular type of the operation is defined by @op. >> + */ >> +struct drm_gpuva_op { >> + /** >> + * @entry: >> + * >> + * The &list_head used to distribute instances of this struct within >> + * &drm_gpuva_ops. >> + */ >> + struct list_head entry; >> + >> + /** >> + * @op: the type of the operation >> + */ >> + enum drm_gpuva_op_type op; >> + >> + union { >> + /** >> + * @map: the map operation >> + */ >> + struct drm_gpuva_op_map map; >> + >> + /** >> + * @unmap: the unmap operation >> + */ >> + struct drm_gpuva_op_unmap unmap; >> + >> + /** >> + * @remap: the remap operation >> + */ >> + struct drm_gpuva_op_remap remap; >> + }; >> +}; >> + >> +/** >> + * struct drm_gpuva_ops - wraps a list of &drm_gpuva_op >> + */ >> +struct drm_gpuva_ops { >> + /** >> + * @list: the &list_head >> + */ >> + struct list_head list; >> +}; >> + >> +/** >> + * drm_gpuva_for_each_op - iterator to walk over all ops >> + * @op: &drm_gpuva_op to assign in each iteration step >> + * @ops: &drm_gpuva_ops to walk >> + * >> + * This iterator walks over all ops within a given list of operations. >> + */ >> +#define drm_gpuva_for_each_op(op, ops) list_for_each_entry(op, &(ops)->list, entry) >> + >> +/** >> + * drm_gpuva_for_each_op_safe - iterator to safely walk over all ops >> + * @op: &drm_gpuva_op to assign in each iteration step >> + * @next: &next &drm_gpuva_op to store the next step >> + * @ops: &drm_gpuva_ops to walk >> + * >> + * This iterator walks over all ops within a given list of operations. It is >> + * implemented with list_for_each_safe(), so save against removal of elements. >> + */ >> +#define drm_gpuva_for_each_op_safe(op, next, ops) \ >> + list_for_each_entry_safe(op, next, &(ops)->list, entry) >> + >> +struct drm_gpuva_ops * >> +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr, >> + u64 addr, u64 range, >> + struct drm_gem_object *obj, u64 offset); >> +struct drm_gpuva_ops * >> +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr, >> + u64 addr, u64 range); >> +void drm_gpuva_ops_free(struct drm_gpuva_ops *ops); >> + >> +#endif /* __DRM_GPUVA_MGR_H__ */ >> -- >> 2.39.0 >> >
diff --git a/Documentation/gpu/drm-mm.rst b/Documentation/gpu/drm-mm.rst index a52e6f4117d6..c9f120cfe730 100644 --- a/Documentation/gpu/drm-mm.rst +++ b/Documentation/gpu/drm-mm.rst @@ -466,6 +466,37 @@ DRM MM Range Allocator Function References .. kernel-doc:: drivers/gpu/drm/drm_mm.c :export: +DRM GPU VA Manager +================== + +Overview +-------- + +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c + :doc: Overview + +Split and Merge +--------------- + +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c + :doc: Split and Merge + +Locking +------- + +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c + :doc: Locking + + +DRM GPU VA Manager Function References +-------------------------------------- + +.. kernel-doc:: include/drm/drm_gpuva_mgr.h + :internal: + +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c + :export: + DRM Buddy Allocator =================== diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile index 4fe190aee584..de2ffca3b6e4 100644 --- a/drivers/gpu/drm/Makefile +++ b/drivers/gpu/drm/Makefile @@ -45,6 +45,7 @@ drm-y := \ drm_vblank.o \ drm_vblank_work.o \ drm_vma_manager.o \ + drm_gpuva_mgr.o \ drm_writeback.o drm-$(CONFIG_DRM_LEGACY) += \ drm_agpsupport.o \ diff --git a/drivers/gpu/drm/drm_gem.c b/drivers/gpu/drm/drm_gem.c index 59a0bb5ebd85..65115fe88627 100644 --- a/drivers/gpu/drm/drm_gem.c +++ b/drivers/gpu/drm/drm_gem.c @@ -164,6 +164,9 @@ void drm_gem_private_object_init(struct drm_device *dev, if (!obj->resv) obj->resv = &obj->_resv; + if (drm_core_check_feature(dev, DRIVER_GEM_GPUVA)) + drm_gem_gpuva_init(obj); + drm_vma_node_reset(&obj->vma_node); INIT_LIST_HEAD(&obj->lru_node); } diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c new file mode 100644 index 000000000000..e665f642689d --- /dev/null +++ b/drivers/gpu/drm/drm_gpuva_mgr.c @@ -0,0 +1,1323 @@ +// SPDX-License-Identifier: GPL-2.0 +/* + * Copyright (c) 2022 Red Hat. + * + * Permission is hereby granted, free of charge, to any person obtaining a + * copy of this software and associated documentation files (the "Software"), + * to deal in the Software without restriction, including without limitation + * the rights to use, copy, modify, merge, publish, distribute, sublicense, + * and/or sell copies of the Software, and to permit persons to whom the + * Software is furnished to do so, subject to the following conditions: + * + * The above copyright notice and this permission notice shall be included in + * all copies or substantial portions of the Software. + * + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, DAMAGES OR + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR + * OTHER DEALINGS IN THE SOFTWARE. + * + * Authors: + * Danilo Krummrich <dakr@redhat.com> + * + */ + +#include <drm/drm_gem.h> +#include <drm/drm_gpuva_mgr.h> + +/** + * DOC: Overview + * + * The DRM GPU VA Manager, represented by struct drm_gpuva_manager keeps track + * of a GPU's virtual address (VA) space and manages the corresponding virtual + * mappings represented by &drm_gpuva objects. It also keeps track of the + * mapping's backing &drm_gem_object buffers. + * + * &drm_gem_object buffers maintain a list (and a corresponding list lock) of + * &drm_gpuva objects representing all existent GPU VA mappings using this + * &drm_gem_object as backing buffer. + * + * A GPU VA mapping can only be created within a previously allocated + * &drm_gpuva_region, which represents a reserved portion of the GPU VA space. + * GPU VA mappings are not allowed to span over a &drm_gpuva_region's boundary. + * + * GPU VA regions can also be flagged as sparse, which allows drivers to create + * sparse mappings for a whole GPU VA region in order to support Vulkan + * 'Sparse Resources'. + * + * The GPU VA manager internally uses the &drm_mm range allocator to manage the + * &drm_gpuva mappings and the &drm_gpuva_regions within a GPU's virtual address + * space. + * + * Besides the GPU VA space regions (&drm_gpuva_region) allocated by a driver + * the &drm_gpuva_manager contains a special region representing the portion of + * VA space reserved by the kernel. This node is initialized together with the + * GPU VA manager instance and removed when the GPU VA manager is destroyed. + * + * In a typical application drivers would embed struct drm_gpuva_manager, + * struct drm_gpuva_region and struct drm_gpuva within their own driver + * specific structures, there won't be any memory allocations of it's own nor + * memory allocations of &drm_gpuva or &drm_gpuva_region entries. + */ + +/** + * DOC: Split and Merge + * + * The DRM GPU VA manager also provides an algorithm implementing splitting and + * merging of existent GPU VA mappings with the ones that are requested to be + * mapped or unmapped. This feature is required by the Vulkan API to implement + * Vulkan 'Sparse Memory Bindings' - drivers UAPIs often refer to this as + * VM BIND. + * + * Drivers can call drm_gpuva_sm_map_ops_create() to obtain a list of map, unmap + * and remap operations for a given newly requested mapping. This list + * represents the set of operations to execute in order to integrate the new + * mapping cleanly into the current state of the GPU VA space. + * + * Depending on how the new GPU VA mapping intersects with the existent mappings + * of the GPU VA space the &drm_gpuva_ops contain an arbitrary amount of unmap + * operations, a maximum of two remap operations and a single map operation. + * The set of operations can also be empty if no operation is required, e.g. if + * the requested mapping already exists in the exact same way. + * + * The single map operation, if existent, represents the original map operation + * requested by the caller. Please note that this operation might be altered + * comparing it with the original map operation, e.g. because it was merged with + * an already existent mapping. Hence, drivers must execute this map operation + * instead of the original one they passed to drm_gpuva_sm_map_ops_create(). + * + * &drm_gpuva_op_unmap contains a 'keep' field, which indicates whether the + * &drm_gpuva to unmap is physically contiguous with the original mapping + * request. Optionally, if 'keep' is set, drivers may keep the actual page table + * entries for this &drm_gpuva, adding the missing page table entries only and + * update the &drm_gpuva_manager's view of things accordingly. + * + * Drivers may do the same optimization, namely delta page table updates, also + * for remap operations. This is possible since &drm_gpuva_op_remap consists of + * one unmap operation and one or two map operations, such that drivers can + * derive the page table update delta accordingly. + * + * Note that there can't be more than two existent mappings to split up, one at + * the beginning and one at the end of the new mapping, hence there is a + * maximum of two remap operations. + * + * Generally, the DRM GPU VA manager never merges mappings across the + * boundaries of &drm_gpuva_regions. This is the case since merging between + * GPU VA regions would result into unmap and map operations to be issued for + * both regions involved although the original mapping request was referred to + * one specific GPU VA region only. Since the other GPU VA region, the one not + * explicitly requested to be altered, might be in use by the GPU, we are not + * allowed to issue any map/unmap operations for this region. + * + * Note that before calling drm_gpuva_sm_map_ops_create() again with another + * mapping request it is necessary to update the &drm_gpuva_manager's view of + * the GPU VA space. The previously obtained operations must be either fully + * processed or completely abandoned. + * + * To update the &drm_gpuva_manager's view of the GPU VA space + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or + * drm_gpuva_destroy_unlocked() should be used. + * + * Analogue to drm_gpuva_sm_map_ops_create() drm_gpuva_sm_unmap_ops_create() + * provides drivers a the list of operations to be executed in order to unmap + * a range of GPU VA space. The logic behind this functions is way simpler + * though: For all existent mappings enclosed by the given range unmap + * operations are created. For mappings which are only partically located within + * the given range, remap operations are created such that those mappings are + * split up and re-mapped partically. + * + * The following paragraph depicts the basic constellations of existent GPU VA + * mappings, a newly requested mapping and the resulting mappings as implemented + * by drm_gpuva_sm_map_ops_create() - it doesn't cover arbitrary combinations + * of those constellations. + * + * :: + * + * 1) Existent mapping is kept. + * ---------------------------- + * + * 0 a 1 + * old: |-----------| (bo_offset=n) + * + * 0 a 1 + * req: |-----------| (bo_offset=n) + * + * 0 a 1 + * new: |-----------| (bo_offset=n) + * + * + * 2) Existent mapping is replaced. + * -------------------------------- + * + * 0 a 1 + * old: |-----------| (bo_offset=n) + * + * 0 a 1 + * req: |-----------| (bo_offset=m) + * + * 0 a 1 + * new: |-----------| (bo_offset=m) + * + * + * 3) Existent mapping is replaced. + * -------------------------------- + * + * 0 a 1 + * old: |-----------| (bo_offset=n) + * + * 0 b 1 + * req: |-----------| (bo_offset=n) + * + * 0 b 1 + * new: |-----------| (bo_offset=n) + * + * + * 4) Existent mapping is replaced. + * -------------------------------- + * + * 0 a 1 + * old: |-----| (bo_offset=n) + * + * 0 a 2 + * req: |-----------| (bo_offset=n) + * + * 0 a 2 + * new: |-----------| (bo_offset=n) + * + * Note: We expect to see the same result for a request with a different bo + * and/or bo_offset. + * + * + * 5) Existent mapping is split. + * ----------------------------- + * + * 0 a 2 + * old: |-----------| (bo_offset=n) + * + * 0 b 1 + * req: |-----| (bo_offset=n) + * + * 0 b 1 a' 2 + * new: |-----|-----| (b.bo_offset=n, a.bo_offset=n+1) + * + * Note: We expect to see the same result for a request with a different bo + * and/or non-contiguous bo_offset. + * + * + * 6) Existent mapping is kept. + * ---------------------------- + * + * 0 a 2 + * old: |-----------| (bo_offset=n) + * + * 0 a 1 + * req: |-----| (bo_offset=n) + * + * 0 a 2 + * new: |-----------| (bo_offset=n) + * + * + * 7) Existent mapping is split. + * ----------------------------- + * + * 0 a 2 + * old: |-----------| (bo_offset=n) + * + * 1 b 2 + * req: |-----| (bo_offset=m) + * + * 0 a 1 b 2 + * new: |-----|-----| (a.bo_offset=n,b.bo_offset=m) + * + * + * 8) Existent mapping is kept. + * ---------------------------- + * + * 0 a 2 + * old: |-----------| (bo_offset=n) + * + * 1 a 2 + * req: |-----| (bo_offset=n+1) + * + * 0 a 2 + * new: |-----------| (bo_offset=n) + * + * + * 9) Existent mapping is split. + * ----------------------------- + * + * 0 a 2 + * old: |-----------| (bo_offset=n) + * + * 1 b 3 + * req: |-----------| (bo_offset=m) + * + * 0 a 1 b 3 + * new: |-----|-----------| (a.bo_offset=n,b.bo_offset=m) + * + * + * 10) Existent mapping is merged. + * ------------------------------- + * + * 0 a 2 + * old: |-----------| (bo_offset=n) + * + * 1 a 3 + * req: |-----------| (bo_offset=n+1) + * + * 0 a 3 + * new: |-----------------| (bo_offset=n) + * + * + * 11) Existent mapping is split. + * ------------------------------ + * + * 0 a 3 + * old: |-----------------| (bo_offset=n) + * + * 1 b 2 + * req: |-----| (bo_offset=m) + * + * 0 a 1 b 2 a' 3 + * new: |-----|-----|-----| (a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2) + * + * + * 12) Existent mapping is kept. + * ----------------------------- + * + * 0 a 3 + * old: |-----------------| (bo_offset=n) + * + * 1 a 2 + * req: |-----| (bo_offset=n+1) + * + * 0 a 3 + * old: |-----------------| (bo_offset=n) + * + * + * 13) Existent mapping is replaced. + * --------------------------------- + * + * 1 a 2 + * old: |-----| (bo_offset=n) + * + * 0 a 2 + * req: |-----------| (bo_offset=n) + * + * 0 a 2 + * new: |-----------| (bo_offset=n) + * + * Note: We expect to see the same result for a request with a different bo + * and/or non-contiguous bo_offset. + * + * + * 14) Existent mapping is replaced. + * --------------------------------- + * + * 1 a 2 + * old: |-----| (bo_offset=n) + * + * 0 a 3 + * req: |----------------| (bo_offset=n) + * + * 0 a 3 + * new: |----------------| (bo_offset=n) + * + * Note: We expect to see the same result for a request with a different bo + * and/or non-contiguous bo_offset. + * + * + * 15) Existent mapping is split. + * ------------------------------ + * + * 1 a 3 + * old: |-----------| (bo_offset=n) + * + * 0 b 2 + * req: |-----------| (bo_offset=m) + * + * 0 b 2 a' 3 + * new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2) + * + * + * 16) Existent mappings are merged. + * --------------------------------- + * + * 0 a 1 + * old: |-----------| (bo_offset=n) + * + * 2 a 3 + * old': |-----------| (bo_offset=n+2) + * + * 1 a 2 + * req: |-----------| (bo_offset=n+1) + * + * a + * new: |----------------------------------| (bo_offset=n) + */ + +/** + * DOC: Locking + * + * Generally, the GPU VA manager does not take care of locking itself, it is + * the drivers responsibility to take care about locking. Drivers might want to + * protect the following operations: inserting, destroying and iterating + * &drm_gpuva and &drm_gpuva_region objects as well as generating split and merge + * operations. + * + * The GPU VA manager does take care of the locking of the backing + * &drm_gem_object buffers GPU VA lists though, unless the provided functions + * documentation claims otherwise. + */ + +/** + * drm_gpuva_manager_init - initialize a &drm_gpuva_manager + * @mgr: pointer to the &drm_gpuva_manager to initialize + * @name: the name of the GPU VA space + * @start_offset: the start offset of the GPU VA space + * @range: the size of the GPU VA space + * @reserve_offset: the start of the kernel reserved GPU VA area + * @reserve_range: the size of the kernel reserved GPU VA area + * + * The &drm_gpuva_manager must be initialized with this function before use. + * + * Note that @mgr must be cleared to 0 before calling this function. The given + * &name is expected to be managed by the surrounding driver structures. + */ +void +drm_gpuva_manager_init(struct drm_gpuva_manager *mgr, + const char *name, + u64 start_offset, u64 range, + u64 reserve_offset, u64 reserve_range) +{ + drm_mm_init(&mgr->va_mm, start_offset, range); + drm_mm_init(&mgr->region_mm, start_offset, range); + + mgr->mm_start = start_offset; + mgr->mm_range = range; + + mgr->name = name ? name : "unknown"; + + memset(&mgr->kernel_alloc_node, 0, sizeof(struct drm_mm_node)); + mgr->kernel_alloc_node.start = reserve_offset; + mgr->kernel_alloc_node.size = reserve_range; + drm_mm_reserve_node(&mgr->region_mm, &mgr->kernel_alloc_node); +} +EXPORT_SYMBOL(drm_gpuva_manager_init); + +/** + * drm_gpuva_manager_destroy - cleanup a &drm_gpuva_manager + * @mgr: pointer to the &drm_gpuva_manager to clean up + * + * Note that it is a bug to call this function on a manager that still + * holds GPU VA mappings. + */ +void +drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr) +{ + mgr->name = NULL; + drm_mm_remove_node(&mgr->kernel_alloc_node); + drm_mm_takedown(&mgr->va_mm); + drm_mm_takedown(&mgr->region_mm); +} +EXPORT_SYMBOL(drm_gpuva_manager_destroy); + +static struct drm_gpuva_region * +drm_gpuva_in_region(struct drm_gpuva_manager *mgr, u64 addr, u64 range) +{ + struct drm_gpuva_region *reg; + + /* Find the VA region the requested range is strictly enclosed by. */ + drm_gpuva_for_each_region_in_range(reg, mgr, addr, addr + range) { + if (reg->node.start <= addr && + reg->node.start + reg->node.size >= addr + range && + ®->node != &mgr->kernel_alloc_node) + return reg; + } + + return NULL; +} + +static bool +drm_gpuva_in_any_region(struct drm_gpuva_manager *mgr, u64 addr, u64 range) +{ + return !!drm_gpuva_in_region(mgr, addr, range); +} + +/** + * drm_gpuva_insert - insert a &drm_gpuva + * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in + * @va: the &drm_gpuva to insert + * @addr: the start address of the GPU VA + * @range: the range of the GPU VA + * + * Insert a &drm_gpuva with a given address and range into a + * &drm_gpuva_manager. + * + * The function assumes the caller does not hold the &drm_gem_object's + * GPU VA list mutex. + * + * Returns: 0 on success, negative error code on failure. + */ +int +drm_gpuva_insert(struct drm_gpuva_manager *mgr, + struct drm_gpuva *va, + u64 addr, u64 range) +{ + struct drm_gpuva_region *reg; + int ret; + + if (!va->gem.obj) + return -EINVAL; + + reg = drm_gpuva_in_region(mgr, addr, range); + if (!reg) + return -EINVAL; + + ret = drm_mm_insert_node_in_range(&mgr->va_mm, &va->node, + range, 0, + 0, addr, + addr + range, + DRM_MM_INSERT_LOW|DRM_MM_INSERT_ONCE); + if (ret) + return ret; + + va->mgr = mgr; + va->region = reg; + + return 0; +} +EXPORT_SYMBOL(drm_gpuva_insert); + +/** + * drm_gpuva_link_locked - link a &drm_gpuva + * @va: the &drm_gpuva to link + * + * This adds the given &va to the GPU VA list of the &drm_gem_object it is + * associated with. + * + * The function assumes the caller already holds the &drm_gem_object's + * GPU VA list mutex. + */ +void +drm_gpuva_link_locked(struct drm_gpuva *va) +{ + lockdep_assert_held(&va->gem.obj->gpuva.mutex); + list_add_tail(&va->head, &va->gem.obj->gpuva.list); +} +EXPORT_SYMBOL(drm_gpuva_link_locked); + +/** + * drm_gpuva_link_unlocked - unlink a &drm_gpuva + * @va: the &drm_gpuva to unlink + * + * This adds the given &va to the GPU VA list of the &drm_gem_object it is + * associated with. + * + * The function assumes the caller does not hold the &drm_gem_object's + * GPU VA list mutex. + */ +void +drm_gpuva_link_unlocked(struct drm_gpuva *va) +{ + drm_gem_gpuva_lock(va->gem.obj); + drm_gpuva_link_locked(va); + drm_gem_gpuva_unlock(va->gem.obj); +} +EXPORT_SYMBOL(drm_gpuva_link_unlocked); + +/** + * drm_gpuva_unlink_locked - unlink a &drm_gpuva + * @va: the &drm_gpuva to unlink + * + * This removes the given &va from the GPU VA list of the &drm_gem_object it is + * associated with. + * + * The function assumes the caller already holds the &drm_gem_object's + * GPU VA list mutex. + */ +void +drm_gpuva_unlink_locked(struct drm_gpuva *va) +{ + lockdep_assert_held(&va->gem.obj->gpuva.mutex); + list_del_init(&va->head); +} +EXPORT_SYMBOL(drm_gpuva_unlink_locked); + +/** + * drm_gpuva_unlink_unlocked - unlink a &drm_gpuva + * @va: the &drm_gpuva to unlink + * + * This removes the given &va from the GPU VA list of the &drm_gem_object it is + * associated with. + * + * The function assumes the caller does not hold the &drm_gem_object's + * GPU VA list mutex. + */ +void +drm_gpuva_unlink_unlocked(struct drm_gpuva *va) +{ + drm_gem_gpuva_lock(va->gem.obj); + drm_gpuva_unlink_locked(va); + drm_gem_gpuva_unlock(va->gem.obj); +} +EXPORT_SYMBOL(drm_gpuva_unlink_unlocked); + +/** + * drm_gpuva_destroy_locked - destroy a &drm_gpuva + * @va: the &drm_gpuva to destroy + * + * This removes the given &va from GPU VA list of the &drm_gem_object it is + * associated with and removes it from the underlaying range allocator. + * + * The function assumes the caller already holds the &drm_gem_object's + * GPU VA list mutex. + */ +void +drm_gpuva_destroy_locked(struct drm_gpuva *va) +{ + lockdep_assert_held(&va->gem.obj->gpuva.mutex); + + list_del(&va->head); + drm_mm_remove_node(&va->node); +} +EXPORT_SYMBOL(drm_gpuva_destroy_locked); + +/** + * drm_gpuva_destroy_unlocked - destroy a &drm_gpuva + * @va: the &drm_gpuva to destroy + * + * This removes the given &va from GPU VA list of the &drm_gem_object it is + * associated with and removes it from the underlaying range allocator. + * + * The function assumes the caller does not hold the &drm_gem_object's + * GPU VA list mutex. + */ +void +drm_gpuva_destroy_unlocked(struct drm_gpuva *va) +{ + drm_gem_gpuva_lock(va->gem.obj); + list_del(&va->head); + drm_gem_gpuva_unlock(va->gem.obj); + + drm_mm_remove_node(&va->node); +} +EXPORT_SYMBOL(drm_gpuva_destroy_unlocked); + +/** + * drm_gpuva_find - find a &drm_gpuva + * @mgr: the &drm_gpuva_manager to search in + * @addr: the &drm_gpuvas address + * @range: the &drm_gpuvas range + * + * Returns: the &drm_gpuva at a given &addr and with a given &range + */ +struct drm_gpuva * +drm_gpuva_find(struct drm_gpuva_manager *mgr, + u64 addr, u64 range) +{ + struct drm_gpuva *va; + + drm_gpuva_for_each_va_in_range(va, mgr, addr, range) { + if (va->node.start == addr && + va->node.size == range) + return va; + } + + return NULL; +} +EXPORT_SYMBOL(drm_gpuva_find); + +/** + * drm_gpuva_find_prev - find the &drm_gpuva before the given address + * @mgr: the &drm_gpuva_manager to search in + * @start: the given GPU VA's start address + * + * Find the adjacent &drm_gpuva before the GPU VA with given &start address. + * + * Note that if there is any free space between the GPU VA mappings no mapping + * is returned. + * + * Returns: a pointer to the found &drm_gpuva or NULL if none was found + */ +struct drm_gpuva * +drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start) +{ + struct drm_mm_node *node; + + if (start <= mgr->mm_start || + start > (mgr->mm_start + mgr->mm_range)) + return NULL; + + node = __drm_mm_interval_first(&mgr->va_mm, start - 1, start); + if (node == &mgr->va_mm.head_node) + return NULL; + + return (struct drm_gpuva *)node; +} +EXPORT_SYMBOL(drm_gpuva_find_prev); + +/** + * drm_gpuva_find_next - find the &drm_gpuva after the given address + * @mgr: the &drm_gpuva_manager to search in + * @end: the given GPU VA's end address + * + * Find the adjacent &drm_gpuva after the GPU VA with given &end address. + * + * Note that if there is any free space between the GPU VA mappings no mapping + * is returned. + * + * Returns: a pointer to the found &drm_gpuva or NULL if none was found + */ +struct drm_gpuva * +drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end) +{ + struct drm_mm_node *node; + + if (end < mgr->mm_start || + end >= (mgr->mm_start + mgr->mm_range)) + return NULL; + + node = __drm_mm_interval_first(&mgr->va_mm, end, end + 1); + if (node == &mgr->va_mm.head_node) + return NULL; + + return (struct drm_gpuva *)node; +} +EXPORT_SYMBOL(drm_gpuva_find_next); + +/** + * drm_gpuva_region_insert - insert a &drm_gpuva_region + * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in + * @reg: the &drm_gpuva_region to insert + * @addr: the start address of the GPU VA + * @range: the range of the GPU VA + * + * Insert a &drm_gpuva_region with a given address and range into a + * &drm_gpuva_manager. + * + * Returns: 0 on success, negative error code on failure. + */ +int +drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, + struct drm_gpuva_region *reg, + u64 addr, u64 range) +{ + int ret; + + ret = drm_mm_insert_node_in_range(&mgr->region_mm, ®->node, + range, 0, + 0, addr, + addr + range, + DRM_MM_INSERT_LOW| + DRM_MM_INSERT_ONCE); + if (ret) + return ret; + + reg->mgr = mgr; + + return 0; +} +EXPORT_SYMBOL(drm_gpuva_region_insert); + +/** + * drm_gpuva_region_destroy - destroy a &drm_gpuva_region + * @mgr: the &drm_gpuva_manager holding the region + * @reg: the &drm_gpuva to destroy + * + * This removes the given ® from the underlaying range allocator. + */ +void +drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, + struct drm_gpuva_region *reg) +{ + struct drm_gpuva *va; + + drm_gpuva_for_each_va_in_range(va, mgr, + reg->node.start, + reg->node.size) { + WARN(1, "GPU VA region must be empty on destroy.\n"); + return; + } + + if (®->node == &mgr->kernel_alloc_node) { + WARN(1, "Can't destroy kernel reserved region.\n"); + return; + } + + drm_mm_remove_node(®->node); +} +EXPORT_SYMBOL(drm_gpuva_region_destroy); + +/** + * drm_gpuva_region_find - find a &drm_gpuva_region + * @mgr: the &drm_gpuva_manager to search in + * @addr: the &drm_gpuva_regions address + * @range: the &drm_gpuva_regions range + * + * Returns: the &drm_gpuva_region at a given &addr and with a given &range + */ +struct drm_gpuva_region * +drm_gpuva_region_find(struct drm_gpuva_manager *mgr, + u64 addr, u64 range) +{ + struct drm_gpuva_region *reg; + + drm_gpuva_for_each_region_in_range(reg, mgr, addr, addr + range) + if (reg->node.start == addr && + reg->node.size == range) + return reg; + + return NULL; +} +EXPORT_SYMBOL(drm_gpuva_region_find); + +static int +gpuva_op_map_new(struct drm_gpuva_op **pop, + u64 addr, u64 range, + struct drm_gem_object *obj, u64 offset) +{ + struct drm_gpuva_op *op; + + op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); + if (!op) + return -ENOMEM; + + op->op = DRM_GPUVA_OP_MAP; + op->map.va.addr = addr; + op->map.va.range = range; + op->map.gem.obj = obj; + op->map.gem.offset = offset; + + return 0; +} + +static int +gpuva_op_remap_new(struct drm_gpuva_op **pop, + struct drm_gpuva_op_map *prev, + struct drm_gpuva_op_map *next, + struct drm_gpuva_op_unmap *unmap) +{ + struct drm_gpuva_op *op; + struct drm_gpuva_op_remap *r; + + op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); + if (!op) + return -ENOMEM; + + op->op = DRM_GPUVA_OP_REMAP; + r = &op->remap; + + if (prev) { + r->prev = kmemdup(prev, sizeof(*prev), GFP_KERNEL); + if (!r->prev) + goto err_free_op; + } + + if (next) { + r->next = kmemdup(next, sizeof(*next), GFP_KERNEL); + if (!r->next) + goto err_free_prev; + } + + r->unmap = kmemdup(unmap, sizeof(*unmap), GFP_KERNEL); + if (!r->unmap) + goto err_free_next; + + return 0; + +err_free_next: + if (next) + kfree(r->next); +err_free_prev: + if (prev) + kfree(r->prev); +err_free_op: + kfree(op); + *pop = NULL; + + return -ENOMEM; +} + +static int +gpuva_op_unmap_new(struct drm_gpuva_op **pop, + struct drm_gpuva *va, bool merge) +{ + struct drm_gpuva_op *op; + + op = *pop = kzalloc(sizeof(*op), GFP_KERNEL); + if (!op) + return -ENOMEM; + + op->op = DRM_GPUVA_OP_UNMAP; + op->unmap.va = va; + op->unmap.keep = merge; + + return 0; +} + +#define op_map_new_to_list(_ops, _addr, _range, \ + _obj, _offset) \ +do { \ + struct drm_gpuva_op *op; \ + \ + ret = gpuva_op_map_new(&op, _addr, _range, \ + _obj, _offset); \ + if (ret) \ + goto err_free_ops; \ + \ + list_add_tail(&op->entry, _ops); \ +} while (0) + +#define op_remap_new_to_list(_ops, _prev, _next, \ + _unmap) \ +do { \ + struct drm_gpuva_op *op; \ + \ + ret = gpuva_op_remap_new(&op, _prev, _next, \ + _unmap); \ + if (ret) \ + goto err_free_ops; \ + \ + list_add_tail(&op->entry, _ops); \ +} while (0) + +#define op_unmap_new_to_list(_ops, _gpuva, _merge) \ +do { \ + struct drm_gpuva_op *op; \ + \ + ret = gpuva_op_unmap_new(&op, _gpuva, _merge); \ + if (ret) \ + goto err_free_ops; \ + \ + list_add_tail(&op->entry, _ops); \ +} while (0) + +/** + * drm_gpuva_sm_map_ops_create - creates the &drm_gpuva_ops to split and merge + * @mgr: the &drm_gpuva_manager representing the GPU VA space + * @req_addr: the start address of the new mapping + * @req_range: the range of the new mapping + * @req_obj: the &drm_gem_object to map + * @req_offset: the offset within the &drm_gem_object + * + * This function creates a list of operations to perform splitting and merging + * of existent mapping(s) with the newly requested one. + * + * The list can be iterated with &drm_gpuva_for_each_op and must be processed + * in the given order. It can contain map, unmap and remap operations, but it + * also can be empty if no operation is required, e.g. if the requested mapping + * already exists is the exact same way. + * + * There can be an arbitrary amount of unmap operations, a maximum of two remap + * operations and a single map operation. The latter one, if existent, + * represents the original map operation requested by the caller. Please note + * that the map operation might has been modified, e.g. if it was + * merged with an existent mapping. + * + * Note that before calling this function again with another mapping request it + * is necessary to update the &drm_gpuva_manager's view of the GPU VA space. + * The previously obtained operations must be either processed or abandoned. + * To update the &drm_gpuva_manager's view of the GPU VA space + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or + * drm_gpuva_destroy_unlocked() should be used. + * + * After the caller finished processing the returned &drm_gpuva_ops, they must + * be freed with &drm_gpuva_ops_free. + * + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure + */ +struct drm_gpuva_ops * +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr, + u64 req_addr, u64 req_range, + struct drm_gem_object *req_obj, u64 req_offset) +{ + struct drm_gpuva_ops *ops; + struct drm_gpuva *va, *prev = NULL; + u64 req_end = req_addr + req_range; + bool skip_pmerge = false, skip_nmerge = false; + int ret; + + if (!drm_gpuva_in_any_region(mgr, req_addr, req_range)) + return ERR_PTR(-EINVAL); + + ops = kzalloc(sizeof(*ops), GFP_KERNEL); + if (!ops) + return ERR_PTR(-ENOMEM); + + INIT_LIST_HEAD(&ops->list); + + drm_gpuva_for_each_va_in_range(va, mgr, req_addr, req_end) { + struct drm_gem_object *obj = va->gem.obj; + u64 offset = va->gem.offset; + u64 addr = va->node.start; + u64 range = va->node.size; + u64 end = addr + range; + + /* Generally, we want to skip merging with potential mappings + * left and right of the requested one when we found a + * collision, since merging happens in this loop already. + * + * However, there is one exception when the requested mapping + * spans into a free VM area. If this is the case we might + * still hit the boundary of another mapping before and/or + * after the free VM area. + */ + skip_pmerge = true; + skip_nmerge = true; + + if (addr == req_addr) { + bool merge = obj == req_obj && + offset == req_offset; + if (end == req_end) { + if (merge) + goto done; + + op_unmap_new_to_list(&ops->list, va, false); + break; + } + + if (end < req_end) { + skip_nmerge = false; + op_unmap_new_to_list(&ops->list, va, merge); + goto next; + } + + if (end > req_end) { + struct drm_gpuva_op_map n = { + .va.addr = req_end, + .va.range = range - req_range, + .gem.obj = obj, + .gem.offset = offset + req_range, + }; + struct drm_gpuva_op_unmap u = { .va = va }; + + if (merge) + goto done; + + op_remap_new_to_list(&ops->list, NULL, &n, &u); + break; + } + } else if (addr < req_addr) { + u64 ls_range = req_addr - addr; + struct drm_gpuva_op_map p = { + .va.addr = addr, + .va.range = ls_range, + .gem.obj = obj, + .gem.offset = offset, + }; + struct drm_gpuva_op_unmap u = { .va = va }; + bool merge = obj == req_obj && + offset + ls_range == req_offset; + + if (end == req_end) { + if (merge) + goto done; + + op_remap_new_to_list(&ops->list, &p, NULL, &u); + break; + } + + if (end < req_end) { + u64 new_addr = addr; + u64 new_range = req_range + ls_range; + u64 new_offset = offset; + + /* We validated that the requested mapping is + * within a single VA region already. + * Since it overlaps the current mapping (which + * can't cross a VA region boundary) we can be + * sure that we're still within the boundaries + * of the same VA region after merging. + */ + if (merge) { + req_offset = new_offset; + req_addr = new_addr; + req_range = new_range; + op_unmap_new_to_list(&ops->list, va, true); + goto next; + } + + op_remap_new_to_list(&ops->list, &p, NULL, &u); + goto next; + } + + if (end > req_end) { + struct drm_gpuva_op_map n = { + .va.addr = req_end, + .va.range = end - req_end, + .gem.obj = obj, + .gem.offset = offset + ls_range + + req_range, + }; + + if (merge) + goto done; + + op_remap_new_to_list(&ops->list, &p, &n, &u); + break; + } + } else if (addr > req_addr) { + bool merge = obj == req_obj && + offset == req_offset + + (addr - req_addr); + if (!prev) + skip_pmerge = false; + + if (end == req_end) { + op_unmap_new_to_list(&ops->list, va, merge); + break; + } + + if (end < req_end) { + skip_nmerge = false; + op_unmap_new_to_list(&ops->list, va, merge); + goto next; + } + + if (end > req_end) { + struct drm_gpuva_op_map n = { + .va.addr = req_end, + .va.range = end - req_end, + .gem.obj = obj, + .gem.offset = offset + req_end - addr, + }; + struct drm_gpuva_op_unmap u = { .va = va }; + u64 new_end = end; + u64 new_range = new_end - req_addr; + + /* We validated that the requested mapping is + * within a single VA region already. + * Since it overlaps the current mapping (which + * can't cross a VA region boundary) we can be + * sure that we're still within the boundaries + * of the same VA region after merging. + */ + if (merge) { + req_end = new_end; + req_range = new_range; + op_unmap_new_to_list(&ops->list, va, true); + break; + } + + op_remap_new_to_list(&ops->list, NULL, &n, &u); + break; + } + } +next: + prev = va; + } + + va = skip_pmerge ? NULL : drm_gpuva_find_prev(mgr, req_addr); + if (va) { + struct drm_gem_object *obj = va->gem.obj; + u64 offset = va->gem.offset; + u64 addr = va->node.start; + u64 range = va->node.size; + u64 new_offset = offset; + u64 new_addr = addr; + u64 new_range = req_range + range; + bool merge = obj == req_obj && + offset + range == req_offset; + + /* Don't merge over VA region boundaries. */ + merge &= drm_gpuva_in_any_region(mgr, new_addr, new_range); + if (merge) { + op_unmap_new_to_list(&ops->list, va, true); + + req_offset = new_offset; + req_addr = new_addr; + req_range = new_range; + } + } + + va = skip_nmerge ? NULL : drm_gpuva_find_next(mgr, req_end); + if (va) { + struct drm_gem_object *obj = va->gem.obj; + u64 offset = va->gem.offset; + u64 addr = va->node.start; + u64 range = va->node.size; + u64 end = addr + range; + u64 new_range = req_range + range; + u64 new_end = end; + bool merge = obj == req_obj && + offset == req_offset + req_range; + + /* Don't merge over VA region boundaries. */ + merge &= drm_gpuva_in_any_region(mgr, req_addr, new_range); + if (merge) { + op_unmap_new_to_list(&ops->list, va, true); + + req_range = new_range; + req_end = new_end; + } + } + + op_map_new_to_list(&ops->list, + req_addr, req_range, + req_obj, req_offset); + +done: + return ops; + +err_free_ops: + drm_gpuva_ops_free(ops); + return ERR_PTR(ret); +} +EXPORT_SYMBOL(drm_gpuva_sm_map_ops_create); + +#undef op_map_new_to_list +#undef op_remap_new_to_list +#undef op_unmap_new_to_list + +/** + * drm_gpuva_sm_unmap_ops_create - creates the &drm_gpuva_ops to split on unmap + * @mgr: the &drm_gpuva_manager representing the GPU VA space + * @req_addr: the start address of the range to unmap + * @req_range: the range of the mappings to unmap + * + * This function creates a list of operations to perform unmapping and, if + * required, splitting of the mappings overlapping the unmap range. + * + * The list can be iterated with &drm_gpuva_for_each_op and must be processed + * in the given order. It can contain unmap and remap operations, depending on + * whether there are actual overlapping mappings to split. + * + * There can be an arbitrary amount of unmap operations and a maximum of two + * remap operations. + * + * Note that before calling this function again with another range to unmap it + * is necessary to update the &drm_gpuva_manager's view of the GPU VA space. + * The previously obtained operations must be processed or abandoned. + * To update the &drm_gpuva_manager's view of the GPU VA space + * drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or + * drm_gpuva_destroy_unlocked() should be used. + * + * After the caller finished processing the returned &drm_gpuva_ops, they must + * be freed with &drm_gpuva_ops_free. + * + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure + */ +struct drm_gpuva_ops * +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr, + u64 req_addr, u64 req_range) +{ + struct drm_gpuva_ops *ops; + struct drm_gpuva_op *op; + struct drm_gpuva_op_remap *r; + struct drm_gpuva *va; + u64 req_end = req_addr + req_range; + int ret; + + ops = kzalloc(sizeof(*ops), GFP_KERNEL); + if (!ops) + return ERR_PTR(-ENOMEM); + + INIT_LIST_HEAD(&ops->list); + + drm_gpuva_for_each_va_in_range(va, mgr, req_addr, req_end) { + struct drm_gem_object *obj = va->gem.obj; + u64 offset = va->gem.offset; + u64 addr = va->node.start; + u64 range = va->node.size; + u64 end = addr + range; + + op = kzalloc(sizeof(*op), GFP_KERNEL); + if (!op) { + ret = -ENOMEM; + goto err_free_ops; + } + + r = &op->remap; + + if (addr < req_addr) { + r->prev = kzalloc(sizeof(*r->prev), GFP_KERNEL); + if (!r->prev) { + ret = -ENOMEM; + goto err_free_op; + } + + r->prev->va.addr = addr; + r->prev->va.range = req_addr - addr; + r->prev->gem.obj = obj; + r->prev->gem.offset = offset; + } + + if (end > req_end) { + r->next = kzalloc(sizeof(*r->next), GFP_KERNEL); + if (!r->next) { + ret = -ENOMEM; + goto err_free_prev; + } + + r->next->va.addr = req_end; + r->next->va.range = end - req_end; + r->next->gem.obj = obj; + r->next->gem.offset = offset + (req_end - addr); + } + + if (op->remap.prev || op->remap.next) { + op->op = DRM_GPUVA_OP_REMAP; + r->unmap = kzalloc(sizeof(*r->unmap), GFP_KERNEL); + if (!r->unmap) { + ret = -ENOMEM; + goto err_free_next; + } + + r->unmap->va = va; + } else { + op->op = DRM_GPUVA_OP_UNMAP; + op->unmap.va = va; + } + + list_add_tail(&op->entry, &ops->list); + } + + return ops; + +err_free_next: + if (r->next) + kfree(r->next); +err_free_prev: + if (r->prev) + kfree(r->prev); +err_free_op: + kfree(op); +err_free_ops: + drm_gpuva_ops_free(ops); + return ERR_PTR(ret); +} +EXPORT_SYMBOL(drm_gpuva_sm_unmap_ops_create); + +/** + * drm_gpuva_ops_free - free the given &drm_gpuva_ops + * @ops: the &drm_gpuva_ops to free + * + * Frees the given &drm_gpuva_ops structure including all the ops associated + * with it. + */ +void +drm_gpuva_ops_free(struct drm_gpuva_ops *ops) +{ + struct drm_gpuva_op *op, *next; + + drm_gpuva_for_each_op_safe(op, next, ops) { + list_del(&op->entry); + if (op->op == DRM_GPUVA_OP_REMAP) { + if (op->remap.prev) + kfree(op->remap.prev); + + if (op->remap.next) + kfree(op->remap.next); + + kfree(op->remap.unmap); + } + kfree(op); + } + + kfree(ops); +} +EXPORT_SYMBOL(drm_gpuva_ops_free); diff --git a/include/drm/drm_drv.h b/include/drm/drm_drv.h index d7c521e8860f..6feacd93aca6 100644 --- a/include/drm/drm_drv.h +++ b/include/drm/drm_drv.h @@ -104,6 +104,12 @@ enum drm_driver_feature { * acceleration should be handled by two drivers that are connected using auxiliary bus. */ DRIVER_COMPUTE_ACCEL = BIT(7), + /** + * @DRIVER_GEM_GPUVA: + * + * Driver supports user defined GPU VA bindings for GEM objects. + */ + DRIVER_GEM_GPUVA = BIT(8), /* IMPORTANT: Below are all the legacy flags, add new ones above. */ diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h index 772a4adf5287..4a3679034966 100644 --- a/include/drm/drm_gem.h +++ b/include/drm/drm_gem.h @@ -36,6 +36,8 @@ #include <linux/kref.h> #include <linux/dma-resv.h> +#include <linux/list.h> +#include <linux/mutex.h> #include <drm/drm_vma_manager.h> @@ -337,6 +339,17 @@ struct drm_gem_object { */ struct dma_resv _resv; + /** + * @gpuva: + * + * Provides the list and list mutex of GPU VAs attached to this + * GEM object. + */ + struct { + struct list_head list; + struct mutex mutex; + } gpuva; + /** * @funcs: * @@ -479,4 +492,66 @@ void drm_gem_lru_move_tail(struct drm_gem_lru *lru, struct drm_gem_object *obj); unsigned long drm_gem_lru_scan(struct drm_gem_lru *lru, unsigned nr_to_scan, bool (*shrink)(struct drm_gem_object *obj)); +/** + * drm_gem_gpuva_init - initialize the gpuva list of a GEM object + * @obj: the &drm_gem_object + * + * This initializes the &drm_gem_object's &drm_gpuva list and the mutex + * protecting it. + * + * Calling this function is only necessary for drivers intending to support the + * &drm_driver_feature DRIVER_GEM_GPUVA. + */ +static inline void drm_gem_gpuva_init(struct drm_gem_object *obj) +{ + INIT_LIST_HEAD(&obj->gpuva.list); + mutex_init(&obj->gpuva.mutex); +} + +/** + * drm_gem_gpuva_lock - lock the GEM's gpuva list mutex + * @obj: the &drm_gem_object + * + * This unlocks the mutex protecting the &drm_gem_object's &drm_gpuva list. + */ +static inline void drm_gem_gpuva_lock(struct drm_gem_object *obj) +{ + mutex_lock(&obj->gpuva.mutex); +} + +/** + * drm_gem_gpuva_unlock - unlock the GEM's gpuva list mutex + * @obj: the &drm_gem_object + * + * This unlocks the mutex protecting the &drm_gem_object's &drm_gpuva list. + */ +static inline void drm_gem_gpuva_unlock(struct drm_gem_object *obj) +{ + mutex_unlock(&obj->gpuva.mutex); +} + +/** + * drm_gem_for_each_gpuva - iternator to walk over a list of gpuvas + * @entry: &drm_gpuva structure to assign to in each iteration step + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with + * + * This iterator walks over all &drm_gpuva structures associated with the + * &drm_gpuva_manager. + */ +#define drm_gem_for_each_gpuva(entry, obj) \ + list_for_each_entry(entry, &obj->gpuva.list, head) + +/** + * drm_gem_for_each_gpuva_safe - iternator to safely walk over a list of gpuvas + * @entry: &drm_gpuva structure to assign to in each iteration step + * @next: &next &drm_gpuva to store the next step + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with + * + * This iterator walks over all &drm_gpuva structures associated with the + * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence + * it is save against removal of elements. + */ +#define drm_gem_for_each_gpuva_safe(entry, next, obj) \ + list_for_each_entry_safe(entry, next, &obj->gpuva.list, head) + #endif /* __DRM_GEM_H__ */ diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h new file mode 100644 index 000000000000..adeb0c916e91 --- /dev/null +++ b/include/drm/drm_gpuva_mgr.h @@ -0,0 +1,527 @@ +// SPDX-License-Identifier: GPL-2.0 + +#ifndef __DRM_GPUVA_MGR_H__ +#define __DRM_GPUVA_MGR_H__ + +/* + * Copyright (c) 2022 Red Hat. + * + * Permission is hereby granted, free of charge, to any person obtaining a + * copy of this software and associated documentation files (the "Software"), + * to deal in the Software without restriction, including without limitation + * the rights to use, copy, modify, merge, publish, distribute, sublicense, + * and/or sell copies of the Software, and to permit persons to whom the + * Software is furnished to do so, subject to the following conditions: + * + * The above copyright notice and this permission notice shall be included in + * all copies or substantial portions of the Software. + * + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, DAMAGES OR + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR + * OTHER DEALINGS IN THE SOFTWARE. + */ + +#include <drm/drm_mm.h> +#include <linux/mm.h> +#include <linux/rbtree.h> +#include <linux/spinlock.h> +#include <linux/types.h> + +struct drm_gpuva_region; +struct drm_gpuva; +struct drm_gpuva_ops; + +/** + * struct drm_gpuva_manager - DRM GPU VA Manager + * + * The DRM GPU VA Manager keeps track of a GPU's virtual address space by using + * the &drm_mm range allocator. Typically, this structure is embedded in bigger + * driver structures. + * + * Drivers can pass addresses and ranges in an arbitrary unit, e.g. bytes or + * pages. + * + * There should be one manager instance per GPU virtual address space. + */ +struct drm_gpuva_manager { + /** + * @name: the name of the DRM GPU VA space + */ + const char *name; + + /** + * @mm_start: start of the VA space + */ + u64 mm_start; + + /** + * @mm_range: length of the VA space + */ + u64 mm_range; + + /** + * @region_mm: the &drm_mm range allocator to track GPU VA regions + */ + struct drm_mm region_mm; + + /** + * @va_mm: the &drm_mm range allocator to track GPU VA mappings + */ + struct drm_mm va_mm; + + /** + * @kernel_alloc_node: + * + * &drm_mm_node representing the address space cutout reserved for + * the kernel + */ + struct drm_mm_node kernel_alloc_node; +}; + +void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr, + const char *name, + u64 start_offset, u64 range, + u64 reserve_offset, u64 reserve_range); +void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr); + +/** + * struct drm_gpuva_region - structure to track a portion of GPU VA space + * + * This structure represents a portion of a GPUs VA space and is associated + * with a &drm_gpuva_manager. Internally it is based on a &drm_mm_node. + * + * GPU VA mappings, represented by &drm_gpuva objects, are restricted to be + * placed within a &drm_gpuva_region. + */ +struct drm_gpuva_region { + /** + * @node: the &drm_mm_node to track the GPU VA region + */ + struct drm_mm_node node; + + /** + * @mgr: the &drm_gpuva_manager this object is associated with + */ + struct drm_gpuva_manager *mgr; + + /** + * @sparse: indicates whether this region is sparse + */ + bool sparse; +}; + +struct drm_gpuva_region * +drm_gpuva_region_find(struct drm_gpuva_manager *mgr, + u64 addr, u64 range); +int drm_gpuva_region_insert(struct drm_gpuva_manager *mgr, + struct drm_gpuva_region *reg, + u64 addr, u64 range); +void drm_gpuva_region_destroy(struct drm_gpuva_manager *mgr, + struct drm_gpuva_region *reg); + +int drm_gpuva_insert(struct drm_gpuva_manager *mgr, + struct drm_gpuva *va, + u64 addr, u64 range); +/** + * drm_gpuva_for_each_region_in_range - iternator to walk over a range of nodes + * @node__: &drm_gpuva_region structure to assign to in each iteration step + * @gpuva__: &drm_gpuva_manager structure to walk + * @start__: starting offset, the first node will overlap this + * @end__: ending offset, the last node will start before this (but may overlap) + * + * This iterator walks over all nodes in the range allocator that lie + * between @start and @end. It is implemented similarly to list_for_each(), + * but is using &drm_mm's internal interval tree to accelerate the search for + * the starting node, and hence isn't safe against removal of elements. It + * assumes that @end is within (or is the upper limit of) the &drm_gpuva_manager. + * If [@start, @end] are beyond the range of the &drm_gpuva_manager, the + * iterator may walk over the special _unallocated_ &drm_mm.head_node of the + * backing &drm_mm, and may even continue indefinitely. + */ +#define drm_gpuva_for_each_region_in_range(node__, gpuva__, start__, end__) \ + for (node__ = (struct drm_gpuva_region *)__drm_mm_interval_first(&(gpuva__)->region_mm, \ + (start__), (end__)-1); \ + node__->node.start < (end__); \ + node__ = (struct drm_gpuva_region *)list_next_entry(&node__->node, node_list)) + +/** + * drm_gpuva_for_each_region - iternator to walk over a range of nodes + * @entry: &drm_gpuva_region structure to assign to in each iteration step + * @gpuva: &drm_gpuva_manager structure to walk + * + * This iterator walks over all &drm_gpuva_region structures associated with the + * &drm_gpuva_manager. + */ +#define drm_gpuva_for_each_region(entry, gpuva) \ + list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->region_mm), node.node_list) + +/** + * drm_gpuva_for_each_region_safe - iternator to safely walk over a range of + * nodes + * @entry: &drm_gpuva_region structure to assign to in each iteration step + * @next: &next &drm_gpuva_region to store the next step + * @gpuva: &drm_gpuva_manager structure to walk + * + * This iterator walks over all &drm_gpuva_region structures associated with the + * &drm_gpuva_manager. It is implemented with list_for_each_safe(), so save + * against removal of elements. + */ +#define drm_gpuva_for_each_region_safe(entry, next, gpuva) \ + list_for_each_entry_safe(entry, next, drm_mm_nodes(&(gpuva)->region_mm), node.node_list) + + +/** + * enum drm_gpuva_flags - flags for struct drm_gpuva + */ +enum drm_gpuva_flags { + /** + * @DRM_GPUVA_SWAPPED: flag indicating that the &drm_gpuva is swapped + */ + DRM_GPUVA_SWAPPED = (1 << 0), +}; + +/** + * struct drm_gpuva - structure to track a GPU VA mapping + * + * This structure represents a GPU VA mapping and is associated with a + * &drm_gpuva_manager. Internally it is based on a &drm_mm_node. + * + * Typically, this structure is embedded in bigger driver structures. + */ +struct drm_gpuva { + /** + * @node: the &drm_mm_node to track the GPU VA mapping + */ + struct drm_mm_node node; + + /** + * @mgr: the &drm_gpuva_manager this object is associated with + */ + struct drm_gpuva_manager *mgr; + + /** + * @region: the &drm_gpuva_region the &drm_gpuva is mapped in + */ + struct drm_gpuva_region *region; + + /** + * @head: the &list_head to attach this object to a &drm_gem_object + */ + struct list_head head; + + /** + * @flags: the &drm_gpuva_flags for this mapping + */ + enum drm_gpuva_flags flags; + + /** + * @gem: structure containing the &drm_gem_object and it's offset + */ + struct { + /** + * @offset: the offset within the &drm_gem_object + */ + u64 offset; + + /** + * @obj: the mapped &drm_gem_object + */ + struct drm_gem_object *obj; + } gem; +}; + +void drm_gpuva_link_locked(struct drm_gpuva *va); +void drm_gpuva_link_unlocked(struct drm_gpuva *va); +void drm_gpuva_unlink_locked(struct drm_gpuva *va); +void drm_gpuva_unlink_unlocked(struct drm_gpuva *va); + +void drm_gpuva_destroy_locked(struct drm_gpuva *va); +void drm_gpuva_destroy_unlocked(struct drm_gpuva *va); + +struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr, + u64 addr, u64 range); +struct drm_gpuva *drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start); +struct drm_gpuva *drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end); + +/** + * drm_gpuva_swap - sets whether the backing BO of this &drm_gpuva is swapped + * @va: the &drm_gpuva to set the swap flag of + * @swap: indicates whether the &drm_gpuva is swapped + */ +static inline void drm_gpuva_swap(struct drm_gpuva *va, bool swap) +{ + if (swap) + va->flags |= DRM_GPUVA_SWAPPED; + else + va->flags &= ~DRM_GPUVA_SWAPPED; +} + +/** + * drm_gpuva_swapped - indicates whether the backing BO of this &drm_gpuva + * is swapped + * @va: the &drm_gpuva to check + */ +static inline bool drm_gpuva_swapped(struct drm_gpuva *va) +{ + return va->flags & DRM_GPUVA_SWAPPED; +} + +/** + * drm_gpuva_for_each_va_in_range - iternator to walk over a range of nodes + * @node__: &drm_gpuva structure to assign to in each iteration step + * @gpuva__: &drm_gpuva_manager structure to walk + * @start__: starting offset, the first node will overlap this + * @end__: ending offset, the last node will start before this (but may overlap) + * + * This iterator walks over all nodes in the range allocator that lie + * between @start and @end. It is implemented similarly to list_for_each(), + * but is using &drm_mm's internal interval tree to accelerate the search for + * the starting node, and hence isn't safe against removal of elements. It + * assumes that @end is within (or is the upper limit of) the &drm_gpuva_manager. + * If [@start, @end] are beyond the range of the &drm_gpuva_manager, the + * iterator may walk over the special _unallocated_ &drm_mm.head_node of the + * backing &drm_mm, and may even continue indefinitely. + */ +#define drm_gpuva_for_each_va_in_range(node__, gpuva__, start__, end__) \ + for (node__ = (struct drm_gpuva *)__drm_mm_interval_first(&(gpuva__)->va_mm, \ + (start__), (end__)-1); \ + node__->node.start < (end__); \ + node__ = (struct drm_gpuva *)list_next_entry(&node__->node, node_list)) + +/** + * drm_gpuva_for_each_va - iternator to walk over a range of nodes + * @entry: &drm_gpuva structure to assign to in each iteration step + * @gpuva: &drm_gpuva_manager structure to walk + * + * This iterator walks over all &drm_gpuva structures associated with the + * &drm_gpuva_manager. + */ +#define drm_gpuva_for_each_va(entry, gpuva) \ + list_for_each_entry(entry, drm_mm_nodes(&(gpuva)->va_mm), node.node_list) + +/** + * drm_gpuva_for_each_va_safe - iternator to safely walk over a range of + * nodes + * @entry: &drm_gpuva structure to assign to in each iteration step + * @next: &next &drm_gpuva to store the next step + * @gpuva: &drm_gpuva_manager structure to walk + * + * This iterator walks over all &drm_gpuva structures associated with the + * &drm_gpuva_manager. It is implemented with list_for_each_safe(), so save + * against removal of elements. + */ +#define drm_gpuva_for_each_va_safe(entry, next, gpuva) \ + list_for_each_entry_safe(entry, next, drm_mm_nodes(&(gpuva)->va_mm), node.node_list) + +/** + * enum drm_gpuva_op_type - GPU VA operation type + * + * Operations to alter the GPU VA mappings tracked by the &drm_gpuva_manager + * can be map, remap or unmap operations. + */ +enum drm_gpuva_op_type { + /** + * @DRM_GPUVA_OP_MAP: the map op type + */ + DRM_GPUVA_OP_MAP, + + /** + * @DRM_GPUVA_OP_REMAP: the remap op type + */ + DRM_GPUVA_OP_REMAP, + + /** + * @DRM_GPUVA_OP_UNMAP: the unmap op type + */ + DRM_GPUVA_OP_UNMAP, +}; + +/** + * struct drm_gpuva_op_map - GPU VA map operation + * + * This structure represents a single map operation generated by the + * DRM GPU VA manager. + */ +struct drm_gpuva_op_map { + /** + * @va: structure containing address and range of a map + * operation + */ + struct { + /** + * @addr: the base address of the new mapping + */ + u64 addr; + + /** + * @range: the range of the new mapping + */ + u64 range; + } va; + + /** + * @gem: structure containing the &drm_gem_object and it's offset + */ + struct { + /** + * @offset: the offset within the &drm_gem_object + */ + u64 offset; + + /** + * @obj: the &drm_gem_object to map + */ + struct drm_gem_object *obj; + } gem; +}; + +/** + * struct drm_gpuva_op_unmap - GPU VA unmap operation + * + * This structure represents a single unmap operation generated by the + * DRM GPU VA manager. + */ +struct drm_gpuva_op_unmap { + /** + * @va: the &drm_gpuva to unmap + */ + struct drm_gpuva *va; + + /** + * @keep: + * + * Indicates whether this &drm_gpuva is physically contiguous with the + * original mapping request. + * + * Optionally, if &keep is set, drivers may keep the actual page table + * mappings for this &drm_gpuva, adding the missing page table entries + * only and update the &drm_gpuva_manager accordingly. + */ + bool keep; +}; + +/** + * struct drm_gpuva_op_remap - GPU VA remap operation + * + * This represents a single remap operation generated by the DRM GPU VA manager. + * + * A remap operation is generated when an existing GPU VA mmapping is split up + * by inserting a new GPU VA mapping or by partially unmapping existent + * mapping(s), hence it consists of a maximum of two map and one unmap + * operation. + * + * The @unmap operation takes care of removing the original existing mapping. + * @prev is used to remap the preceding part, @next the subsequent part. + * + * If either a new mapping's start address is aligned with the start address + * of the old mapping or the new mapping's end address is aligned with the + * end address of the old mapping, either @prev or @next is NULL. + * + * Note, the reason for a dedicated remap operation, rather than arbitrary + * unmap and map operations, is to give drivers the chance of extracting driver + * specific data for creating the new mappings from the unmap operations's + * &drm_gpuva structure which typically is embedded in larger driver specific + * structures. + */ +struct drm_gpuva_op_remap { + /** + * @prev: the preceding part of a split mapping + */ + struct drm_gpuva_op_map *prev; + + /** + * @next: the subsequent part of a split mapping + */ + struct drm_gpuva_op_map *next; + + /** + * @unmap: the unmap operation for the original existing mapping + */ + struct drm_gpuva_op_unmap *unmap; +}; + +/** + * struct drm_gpuva_op - GPU VA operation + * + * This structure represents a single generic operation, which can be either + * map, unmap or remap. + * + * The particular type of the operation is defined by @op. + */ +struct drm_gpuva_op { + /** + * @entry: + * + * The &list_head used to distribute instances of this struct within + * &drm_gpuva_ops. + */ + struct list_head entry; + + /** + * @op: the type of the operation + */ + enum drm_gpuva_op_type op; + + union { + /** + * @map: the map operation + */ + struct drm_gpuva_op_map map; + + /** + * @unmap: the unmap operation + */ + struct drm_gpuva_op_unmap unmap; + + /** + * @remap: the remap operation + */ + struct drm_gpuva_op_remap remap; + }; +}; + +/** + * struct drm_gpuva_ops - wraps a list of &drm_gpuva_op + */ +struct drm_gpuva_ops { + /** + * @list: the &list_head + */ + struct list_head list; +}; + +/** + * drm_gpuva_for_each_op - iterator to walk over all ops + * @op: &drm_gpuva_op to assign in each iteration step + * @ops: &drm_gpuva_ops to walk + * + * This iterator walks over all ops within a given list of operations. + */ +#define drm_gpuva_for_each_op(op, ops) list_for_each_entry(op, &(ops)->list, entry) + +/** + * drm_gpuva_for_each_op_safe - iterator to safely walk over all ops + * @op: &drm_gpuva_op to assign in each iteration step + * @next: &next &drm_gpuva_op to store the next step + * @ops: &drm_gpuva_ops to walk + * + * This iterator walks over all ops within a given list of operations. It is + * implemented with list_for_each_safe(), so save against removal of elements. + */ +#define drm_gpuva_for_each_op_safe(op, next, ops) \ + list_for_each_entry_safe(op, next, &(ops)->list, entry) + +struct drm_gpuva_ops * +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr, + u64 addr, u64 range, + struct drm_gem_object *obj, u64 offset); +struct drm_gpuva_ops * +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr, + u64 addr, u64 range); +void drm_gpuva_ops_free(struct drm_gpuva_ops *ops); + +#endif /* __DRM_GPUVA_MGR_H__ */
This adds the infrastructure for a manager implementation to keep track of GPU virtual address (VA) mappings. New UAPIs, motivated by Vulkan sparse memory bindings graphics drivers start implementing, allow userspace applications to request multiple and arbitrary GPU VA mappings of buffer objects. The DRM GPU VA manager is intended to serve the following purposes in this context. 1) Provide a dedicated range allocator to track GPU VA allocations and mappings, making use of the drm_mm range allocator. 2) Generically connect GPU VA mappings to their backing buffers, in particular DRM GEM objects. 3) Provide a common implementation to perform more complex mapping operations on the GPU VA space. In particular splitting and merging of GPU VA mappings, e.g. for intersecting mapping requests or partial unmap requests. Idea-suggested-by: Dave Airlie <airlied@redhat.com> Signed-off-by: Danilo Krummrich <dakr@redhat.com> --- Documentation/gpu/drm-mm.rst | 31 + drivers/gpu/drm/Makefile | 1 + drivers/gpu/drm/drm_gem.c | 3 + drivers/gpu/drm/drm_gpuva_mgr.c | 1323 +++++++++++++++++++++++++++++++ include/drm/drm_drv.h | 6 + include/drm/drm_gem.h | 75 ++ include/drm/drm_gpuva_mgr.h | 527 ++++++++++++ 7 files changed, 1966 insertions(+) create mode 100644 drivers/gpu/drm/drm_gpuva_mgr.c create mode 100644 include/drm/drm_gpuva_mgr.h