Message ID | 20250218132356.1809075-6-rrichter@amd.com |
---|---|
State | New |
Headers | show |
Series | cxl: Address translation support, part 2: Generic support and AMD Zen5 platform enablement | expand |
On Tue, Feb 18, 2025 at 02:23:46PM +0100, Robert Richter wrote: > To find the correct region and root port of an endpoint of a system > needing address translation, the endpoint's HPA range must be > translated to each of the parent port address ranges up to the root > decoder. > > Calculate the SPA range using the newly introduced callback function > port->to_hpa() that translates the decoder's HPA range to its parent > port's HPA range of the next outer memory domain. Introduce the helper > function cxl_port_calc_hpa() for this to calculate address ranges > using the low-level port->to_hpa() callbacks. Determine the root port > SPA range by iterating all the ports up to the root. Store the > endpoint's SPA range for later use. > > Signed-off-by: Robert Richter <rrichter@amd.com> > > + /* > + * Address translation is only supported for auto-discovery of > + * decoders. There is no need to support address translation > + * here. That is, do not recalculate ctx.hpa_range here. > + */ Can we at least add why translation is not supported / needed? Just saying "there is no need" doesn't help devs after us understand the code. With that change: Reviewed-by: Gregory Price <gourry@gourry.net>
On 2/18/25 6:23 AM, Robert Richter wrote: > To find the correct region and root port of an endpoint of a system > needing address translation, the endpoint's HPA range must be > translated to each of the parent port address ranges up to the root > decoder. > > Calculate the SPA range using the newly introduced callback function > port->to_hpa() that translates the decoder's HPA range to its parent > port's HPA range of the next outer memory domain. Introduce the helper > function cxl_port_calc_hpa() for this to calculate address ranges > using the low-level port->to_hpa() callbacks. Determine the root port > SPA range by iterating all the ports up to the root. Store the > endpoint's SPA range for later use. > > Signed-off-by: Robert Richter <rrichter@amd.com> > --- > drivers/cxl/core/region.c | 81 ++++++++++++++++++++++++++++++++------- > drivers/cxl/cxl.h | 1 + > 2 files changed, 68 insertions(+), 14 deletions(-) > > diff --git a/drivers/cxl/core/region.c b/drivers/cxl/core/region.c > index 6f106bfa115f..d898c9f51113 100644 > --- a/drivers/cxl/core/region.c > +++ b/drivers/cxl/core/region.c > @@ -832,6 +832,44 @@ static int match_free_decoder(struct device *dev, const void *data) > return 1; > } > > +static int cxl_port_calc_hpa(struct cxl_port *port, struct cxl_decoder *cxld, > + struct range *hpa_range) > +{ > + struct range hpa = *hpa_range; > + u64 len = range_len(&hpa); > + > + if (!port->to_hpa) > + return 0; > + > + /* Translate HPA to the next upper domain. */ > + hpa.start = port->to_hpa(cxld, hpa.start); > + hpa.end = port->to_hpa(cxld, hpa.end); > + > + if (hpa.start == ULLONG_MAX || hpa.end == ULLONG_MAX) { > + dev_warn(&port->dev, > + "CXL address translation: HPA range invalid: %#llx-%#llx:%#llx-%#llx(%s)\n", > + hpa.start, hpa.end, hpa_range->start, > + hpa_range->end, dev_name(&cxld->dev)); > + return -ENXIO; > + } > + > + if (range_len(&hpa) != len * cxld->interleave_ways) { > + dev_warn(&port->dev, > + "CXL address translation: HPA range not contiguous: %#llx-%#llx:%#llx-%#llx(%s)\n", > + hpa.start, hpa.end, hpa_range->start, > + hpa_range->end, dev_name(&cxld->dev)); > + return -ENXIO; > + } > + > + if (hpa.start == hpa_range->start && hpa.end == hpa_range->end) > + return 0; > + > + *hpa_range = hpa; > + > + /* Return 1 if modified. */ > + return 1; > +} > + > static int match_auto_decoder(struct device *dev, const void *data) > { > const struct cxl_region_params *p = data; > @@ -1882,6 +1920,11 @@ static int cxl_calc_interleave_pos(struct cxl_endpoint_decoder *cxled) > .hpa_range = &cxled->cxld.hpa_range, > }; > > + /* > + * Address translation is only supported for auto-discovery of > + * decoders. There is no need to support address translation > + * here. That is, do not recalculate ctx.hpa_range here. > + */ Should this go with patch 3? > for (iter = cxled_to_port(cxled); pos >= 0 && iter; > iter = parent_port_of(iter)) > pos = cxl_port_calc_interleave(iter, &ctx); > @@ -3262,7 +3305,8 @@ static int cxl_endpoint_decoder_initialize(struct cxl_endpoint_decoder *cxled) > { > struct cxl_memdev *cxlmd = cxled_to_memdev(cxled); > struct cxl_port *iter = cxled_to_port(cxled); > - struct cxl_decoder *root, *cxld = &cxled->cxld; > + struct cxl_port *parent = parent_port_of(iter); > + struct cxl_decoder *cxld = &cxled->cxld; > struct range hpa = cxld->hpa_range; > struct cxl_interleave_context ctx; > int rc; > @@ -3271,25 +3315,33 @@ static int cxl_endpoint_decoder_initialize(struct cxl_endpoint_decoder *cxled) > .hpa_range = &hpa, > }; > > - while (iter && !is_cxl_root(iter)) { > + if (!iter || !parent) While parent_port_of() will check NULL and return NULL, it just seems icky checking the pointer after use. DJ > + return -ENXIO; > + > + while (iter && parent) { > + /* Translate HPA to the next upper memory domain. */ > + rc = cxl_port_calc_hpa(parent, cxld, &hpa); > + if (rc < 0) > + return rc; > + > /* Convert interleave settings to next port upstream. */ > rc = cxl_port_calc_interleave(iter, &ctx); > if (rc < 0) > return rc; > > - iter = parent_port_of(iter); > - } > + iter = parent; > + parent = parent_port_of(iter); > > - if (!iter) > - return -ENXIO; > + if (!parent || parent->to_hpa) > + cxld = cxl_port_find_switch_decoder(iter, &hpa); > > - root = cxl_port_find_switch_decoder(iter, hpa); > - if (!root) { > - dev_err(cxlmd->dev.parent, > - "%s:%s no CXL window for range %#llx:%#llx\n", > - dev_name(&cxlmd->dev), dev_name(&cxld->dev), > - cxld->hpa_range.start, cxld->hpa_range.end); > - return -ENXIO; > + if (!cxld) { > + dev_err(cxlmd->dev.parent, > + "%s:%s no CXL window for range %#llx:%#llx\n", > + dev_name(&cxlmd->dev), dev_name(&cxled->cxld.dev), > + hpa.start, hpa.end); > + return -ENXIO; > + } > } > > dev_dbg(cxld->dev.parent, > @@ -3297,7 +3349,8 @@ static int cxl_endpoint_decoder_initialize(struct cxl_endpoint_decoder *cxled) > dev_name(&cxled->cxld.dev), dev_name(&cxld->dev), > hpa.start, hpa.end, ctx.pos); > > - cxled->cxlrd = to_cxl_root_decoder(&root->dev); > + cxled->cxlrd = to_cxl_root_decoder(&cxld->dev); > + cxled->spa_range = hpa; > cxled->pos = ctx.pos; > > return 0; > diff --git a/drivers/cxl/cxl.h b/drivers/cxl/cxl.h > index 17496784f021..7303aec1c31c 100644 > --- a/drivers/cxl/cxl.h > +++ b/drivers/cxl/cxl.h > @@ -394,6 +394,7 @@ struct cxl_endpoint_decoder { > struct cxl_decoder cxld; > struct cxl_root_decoder *cxlrd; > struct resource *dpa_res; > + struct range spa_range; > resource_size_t skip; > enum cxl_decoder_state state; > int part;
On Thu, Feb 20, 2025 at 03:31:45PM -0700, Dave Jiang wrote: > > for (iter = cxled_to_port(cxled); pos >= 0 && iter; > > iter = parent_port_of(iter)) > > pos = cxl_port_calc_interleave(iter, &ctx); > > @@ -3262,7 +3305,8 @@ static int cxl_endpoint_decoder_initialize(struct cxl_endpoint_decoder *cxled) > > { > > struct cxl_memdev *cxlmd = cxled_to_memdev(cxled); > > struct cxl_port *iter = cxled_to_port(cxled); > > - struct cxl_decoder *root, *cxld = &cxled->cxld; > > + struct cxl_port *parent = parent_port_of(iter); > > + struct cxl_decoder *cxld = &cxled->cxld; > > struct range hpa = cxld->hpa_range; > > struct cxl_interleave_context ctx; > > int rc; > > @@ -3271,25 +3315,33 @@ static int cxl_endpoint_decoder_initialize(struct cxl_endpoint_decoder *cxled) > > .hpa_range = &hpa, > > }; > > > > - while (iter && !is_cxl_root(iter)) { > > + if (!iter || !parent) > > While parent_port_of() will check NULL and return NULL, it just seems icky checking the pointer after use. > I briefly had the same thought and dug into parent_port_of, erred on the side of "How many places is cxl_endpoint_decoder_initialize going to be called?" - but you're probably right, we probably should do the null check if only for style. ~Gregory
diff --git a/drivers/cxl/core/region.c b/drivers/cxl/core/region.c index 6f106bfa115f..d898c9f51113 100644 --- a/drivers/cxl/core/region.c +++ b/drivers/cxl/core/region.c @@ -832,6 +832,44 @@ static int match_free_decoder(struct device *dev, const void *data) return 1; } +static int cxl_port_calc_hpa(struct cxl_port *port, struct cxl_decoder *cxld, + struct range *hpa_range) +{ + struct range hpa = *hpa_range; + u64 len = range_len(&hpa); + + if (!port->to_hpa) + return 0; + + /* Translate HPA to the next upper domain. */ + hpa.start = port->to_hpa(cxld, hpa.start); + hpa.end = port->to_hpa(cxld, hpa.end); + + if (hpa.start == ULLONG_MAX || hpa.end == ULLONG_MAX) { + dev_warn(&port->dev, + "CXL address translation: HPA range invalid: %#llx-%#llx:%#llx-%#llx(%s)\n", + hpa.start, hpa.end, hpa_range->start, + hpa_range->end, dev_name(&cxld->dev)); + return -ENXIO; + } + + if (range_len(&hpa) != len * cxld->interleave_ways) { + dev_warn(&port->dev, + "CXL address translation: HPA range not contiguous: %#llx-%#llx:%#llx-%#llx(%s)\n", + hpa.start, hpa.end, hpa_range->start, + hpa_range->end, dev_name(&cxld->dev)); + return -ENXIO; + } + + if (hpa.start == hpa_range->start && hpa.end == hpa_range->end) + return 0; + + *hpa_range = hpa; + + /* Return 1 if modified. */ + return 1; +} + static int match_auto_decoder(struct device *dev, const void *data) { const struct cxl_region_params *p = data; @@ -1882,6 +1920,11 @@ static int cxl_calc_interleave_pos(struct cxl_endpoint_decoder *cxled) .hpa_range = &cxled->cxld.hpa_range, }; + /* + * Address translation is only supported for auto-discovery of + * decoders. There is no need to support address translation + * here. That is, do not recalculate ctx.hpa_range here. + */ for (iter = cxled_to_port(cxled); pos >= 0 && iter; iter = parent_port_of(iter)) pos = cxl_port_calc_interleave(iter, &ctx); @@ -3262,7 +3305,8 @@ static int cxl_endpoint_decoder_initialize(struct cxl_endpoint_decoder *cxled) { struct cxl_memdev *cxlmd = cxled_to_memdev(cxled); struct cxl_port *iter = cxled_to_port(cxled); - struct cxl_decoder *root, *cxld = &cxled->cxld; + struct cxl_port *parent = parent_port_of(iter); + struct cxl_decoder *cxld = &cxled->cxld; struct range hpa = cxld->hpa_range; struct cxl_interleave_context ctx; int rc; @@ -3271,25 +3315,33 @@ static int cxl_endpoint_decoder_initialize(struct cxl_endpoint_decoder *cxled) .hpa_range = &hpa, }; - while (iter && !is_cxl_root(iter)) { + if (!iter || !parent) + return -ENXIO; + + while (iter && parent) { + /* Translate HPA to the next upper memory domain. */ + rc = cxl_port_calc_hpa(parent, cxld, &hpa); + if (rc < 0) + return rc; + /* Convert interleave settings to next port upstream. */ rc = cxl_port_calc_interleave(iter, &ctx); if (rc < 0) return rc; - iter = parent_port_of(iter); - } + iter = parent; + parent = parent_port_of(iter); - if (!iter) - return -ENXIO; + if (!parent || parent->to_hpa) + cxld = cxl_port_find_switch_decoder(iter, &hpa); - root = cxl_port_find_switch_decoder(iter, hpa); - if (!root) { - dev_err(cxlmd->dev.parent, - "%s:%s no CXL window for range %#llx:%#llx\n", - dev_name(&cxlmd->dev), dev_name(&cxld->dev), - cxld->hpa_range.start, cxld->hpa_range.end); - return -ENXIO; + if (!cxld) { + dev_err(cxlmd->dev.parent, + "%s:%s no CXL window for range %#llx:%#llx\n", + dev_name(&cxlmd->dev), dev_name(&cxled->cxld.dev), + hpa.start, hpa.end); + return -ENXIO; + } } dev_dbg(cxld->dev.parent, @@ -3297,7 +3349,8 @@ static int cxl_endpoint_decoder_initialize(struct cxl_endpoint_decoder *cxled) dev_name(&cxled->cxld.dev), dev_name(&cxld->dev), hpa.start, hpa.end, ctx.pos); - cxled->cxlrd = to_cxl_root_decoder(&root->dev); + cxled->cxlrd = to_cxl_root_decoder(&cxld->dev); + cxled->spa_range = hpa; cxled->pos = ctx.pos; return 0; diff --git a/drivers/cxl/cxl.h b/drivers/cxl/cxl.h index 17496784f021..7303aec1c31c 100644 --- a/drivers/cxl/cxl.h +++ b/drivers/cxl/cxl.h @@ -394,6 +394,7 @@ struct cxl_endpoint_decoder { struct cxl_decoder cxld; struct cxl_root_decoder *cxlrd; struct resource *dpa_res; + struct range spa_range; resource_size_t skip; enum cxl_decoder_state state; int part;
To find the correct region and root port of an endpoint of a system needing address translation, the endpoint's HPA range must be translated to each of the parent port address ranges up to the root decoder. Calculate the SPA range using the newly introduced callback function port->to_hpa() that translates the decoder's HPA range to its parent port's HPA range of the next outer memory domain. Introduce the helper function cxl_port_calc_hpa() for this to calculate address ranges using the low-level port->to_hpa() callbacks. Determine the root port SPA range by iterating all the ports up to the root. Store the endpoint's SPA range for later use. Signed-off-by: Robert Richter <rrichter@amd.com> --- drivers/cxl/core/region.c | 81 ++++++++++++++++++++++++++++++++------- drivers/cxl/cxl.h | 1 + 2 files changed, 68 insertions(+), 14 deletions(-)