Message ID | 20230209152628.129914-6-ajones@ventanamicro.com (mailing list archive) |
---|---|
State | Superseded |
Delegated to: | Palmer Dabbelt |
Headers | show |
Series | RISC-V: Apply Zicboz to clear_page | expand |
Context | Check | Description |
---|---|---|
conchuod/tree_selection | fail | Failed to apply to next/pending-fixes or riscv/for-next |
Hey Drew, On Thu, Feb 09, 2023 at 04:26:25PM +0100, Andrew Jones wrote: > When [ab]using alternatives as cpufeature "static keys", which can > be used in assembly, also put vendor_id to work as application- > specific data. This will be initially used in Zicboz's application to > clear_page(), as Zicboz's block size must also be considered. In that > case, vendor_id's role will be to convey the maximum block size which > the Zicboz clear_page() implementation supports. > > cpufeature alternative applications which need to check for the > existence or absence of other cpufeatures may also be able to make > use of this. > > Signed-off-by: Andrew Jones <ajones@ventanamicro.com> > --- > arch/riscv/kernel/cpufeature.c | 10 ++++++++-- > 1 file changed, 8 insertions(+), 2 deletions(-) > > diff --git a/arch/riscv/kernel/cpufeature.c b/arch/riscv/kernel/cpufeature.c > index 0d2db03cf167..74736b4f0624 100644 > --- a/arch/riscv/kernel/cpufeature.c > +++ b/arch/riscv/kernel/cpufeature.c > @@ -278,6 +278,11 @@ void __init riscv_fill_hwcap(void) > } > > #ifdef CONFIG_RISCV_ALTERNATIVE I think a comment here about what "application check" means would be nice. That wording just feels clunky & the meaning is not immediately graspable? /* * riscv_cpufeature_application_check() - Check if a cpufeature applies. * The presence of a cpufeature does not mean it is necessarily * useable. This function is used to apply the alternative on a * case-by-case basis. */ Dunno, does something like that convey the intent? > +static bool riscv_cpufeature_application_check(u32 feature, u16 data) > +{ > + return data == 0; > +} > + > void __init_or_module riscv_cpufeature_patch_func(struct alt_entry *begin, > struct alt_entry *end, > unsigned int stage) > @@ -289,8 +294,6 @@ void __init_or_module riscv_cpufeature_patch_func(struct alt_entry *begin, > return; > > for (alt = begin; alt < end; alt++) { > - if (alt->vendor_id != 0) > - continue; Can you remind me what makes this "safe"? My understanding was that a vendor_id of zero was safe, as zero is reserved in JEDEC. What is stopping someone stuffing this with a given value and colliding with a real vendor's errata? for (alt = begin; alt < end; alt++) { if (alt->vendor_id != A_VENDOR_ID) continue; if (alt->errata_id >= ERRATA_A_NUMBER) continue; tmp = (1U << alt->errata_id); if (cpu_req_errata & tmp) { oldptr = ALT_OLD_PTR(alt); altptr = ALT_ALT_PTR(alt); /* On vm-alternatives, the mmu isn't running yet */ if (stage == RISCV_ALTERNATIVES_EARLY_BOOT) memcpy((void *)__pa_symbol(oldptr), (void *)__pa_symbol(altptr), alt->alt_len); else patch_text_nosync(oldptr, altptr, alt->alt_len); } } I've probably just missing something, my brain swapped out alternatives the other week. Hopefully whatever I missed isn't embarrassingly obvious! Cheers, Conor. > if (alt->errata_id >= RISCV_ISA_EXT_MAX) { > WARN(1, "This extension id:%d is not in ISA extension list", > alt->errata_id); > @@ -300,6 +303,9 @@ void __init_or_module riscv_cpufeature_patch_func(struct alt_entry *begin, > if (!__riscv_isa_extension_available(NULL, alt->errata_id)) > continue; > > + if (!riscv_cpufeature_application_check(alt->errata_id, alt->vendor_id)) > + continue; > + > oldptr = ALT_OLD_PTR(alt); > altptr = ALT_ALT_PTR(alt); > patch_text_nosync(oldptr, altptr, alt->alt_len); > -- > 2.39.1 >
On Thu, Feb 09, 2023 at 07:04:59PM +0000, Conor Dooley wrote: > Hey Drew, > > On Thu, Feb 09, 2023 at 04:26:25PM +0100, Andrew Jones wrote: > > When [ab]using alternatives as cpufeature "static keys", which can > > be used in assembly, also put vendor_id to work as application- > > specific data. This will be initially used in Zicboz's application to > > clear_page(), as Zicboz's block size must also be considered. In that > > case, vendor_id's role will be to convey the maximum block size which > > the Zicboz clear_page() implementation supports. > > > > cpufeature alternative applications which need to check for the > > existence or absence of other cpufeatures may also be able to make > > use of this. > > > > Signed-off-by: Andrew Jones <ajones@ventanamicro.com> > > --- > > arch/riscv/kernel/cpufeature.c | 10 ++++++++-- > > 1 file changed, 8 insertions(+), 2 deletions(-) > > > > diff --git a/arch/riscv/kernel/cpufeature.c b/arch/riscv/kernel/cpufeature.c > > index 0d2db03cf167..74736b4f0624 100644 > > --- a/arch/riscv/kernel/cpufeature.c > > +++ b/arch/riscv/kernel/cpufeature.c > > @@ -278,6 +278,11 @@ void __init riscv_fill_hwcap(void) > > } > > > > #ifdef CONFIG_RISCV_ALTERNATIVE > > I think a comment here about what "application check" means would be > nice. > That wording just feels clunky & the meaning is not immediately > graspable? > > /* > * riscv_cpufeature_application_check() - Check if a cpufeature applies. > * The presence of a cpufeature does not mean it is necessarily > * useable. This function is used to apply the alternative on a > * case-by-case basis. > */ > > Dunno, does something like that convey the intent? Indeed, a comment would be helpful. I'll put something similar to what you propose in the next version. > > > +static bool riscv_cpufeature_application_check(u32 feature, u16 data) > > +{ > > + return data == 0; > > +} > > + > > void __init_or_module riscv_cpufeature_patch_func(struct alt_entry *begin, > > struct alt_entry *end, > > unsigned int stage) > > @@ -289,8 +294,6 @@ void __init_or_module riscv_cpufeature_patch_func(struct alt_entry *begin, > > return; > > > > for (alt = begin; alt < end; alt++) { > > - if (alt->vendor_id != 0) > > - continue; > > Can you remind me what makes this "safe"? > My understanding was that a vendor_id of zero was safe, as zero is > reserved in JEDEC. > What is stopping someone stuffing this with a given value and > colliding with a real vendor's errata? > > for (alt = begin; alt < end; alt++) { > if (alt->vendor_id != A_VENDOR_ID) > continue; > if (alt->errata_id >= ERRATA_A_NUMBER) > continue; > > tmp = (1U << alt->errata_id); > if (cpu_req_errata & tmp) { > oldptr = ALT_OLD_PTR(alt); > altptr = ALT_ALT_PTR(alt); > > /* On vm-alternatives, the mmu isn't running yet */ > if (stage == RISCV_ALTERNATIVES_EARLY_BOOT) > memcpy((void *)__pa_symbol(oldptr), > (void *)__pa_symbol(altptr), > alt->alt_len); > else > patch_text_nosync(oldptr, altptr, alt->alt_len); > } > } > > I've probably just missing something, my brain swapped out alternatives > the other week. Hopefully whatever I missed isn't embarrassingly obvious! You're right. I was assuming the errata_id space for errata didn't overlap with the errata_id space for cpufeatures. It doesn't, atm, but by luck, not design. I could try to ensure that somehow, but probably the better approach would be to use the upper bits of errata_id for the application data and to leave vendor_id alone. Thanks for catching my oversight! Thanks, drew > > Cheers, > Conor. > > > if (alt->errata_id >= RISCV_ISA_EXT_MAX) { > > WARN(1, "This extension id:%d is not in ISA extension list", > > alt->errata_id); > > @@ -300,6 +303,9 @@ void __init_or_module riscv_cpufeature_patch_func(struct alt_entry *begin, > > if (!__riscv_isa_extension_available(NULL, alt->errata_id)) > > continue; > > > > + if (!riscv_cpufeature_application_check(alt->errata_id, alt->vendor_id)) > > + continue; > > + > > oldptr = ALT_OLD_PTR(alt); > > altptr = ALT_ALT_PTR(alt); > > patch_text_nosync(oldptr, altptr, alt->alt_len); > > -- > > 2.39.1 > >
On Fri, Feb 10, 2023 at 08:58:18AM +0100, Andrew Jones wrote: > On Thu, Feb 09, 2023 at 07:04:59PM +0000, Conor Dooley wrote: > > On Thu, Feb 09, 2023 at 04:26:25PM +0100, Andrew Jones wrote: > > > +static bool riscv_cpufeature_application_check(u32 feature, u16 data) > > > +{ > > > + return data == 0; > > > +} > > > + > > > void __init_or_module riscv_cpufeature_patch_func(struct alt_entry *begin, > > > struct alt_entry *end, > > > unsigned int stage) > > > @@ -289,8 +294,6 @@ void __init_or_module riscv_cpufeature_patch_func(struct alt_entry *begin, > > > return; > > > > > > for (alt = begin; alt < end; alt++) { > > > - if (alt->vendor_id != 0) > > > - continue; > > > > Can you remind me what makes this "safe"? > > My understanding was that a vendor_id of zero was safe, as zero is > > reserved in JEDEC. > > What is stopping someone stuffing this with a given value and > > colliding with a real vendor's errata? > > > > for (alt = begin; alt < end; alt++) { > > if (alt->vendor_id != A_VENDOR_ID) > > continue; > > if (alt->errata_id >= ERRATA_A_NUMBER) > > continue; > > > > tmp = (1U << alt->errata_id); > > if (cpu_req_errata & tmp) { > > oldptr = ALT_OLD_PTR(alt); > > altptr = ALT_ALT_PTR(alt); > > > > /* On vm-alternatives, the mmu isn't running yet */ > > if (stage == RISCV_ALTERNATIVES_EARLY_BOOT) > > memcpy((void *)__pa_symbol(oldptr), > > (void *)__pa_symbol(altptr), > > alt->alt_len); > > else > > patch_text_nosync(oldptr, altptr, alt->alt_len); > > } > > } > > > > I've probably just missing something, my brain swapped out alternatives > > the other week. Hopefully whatever I missed isn't embarrassingly obvious! > > You're right. I was assuming the errata_id space for errata didn't overlap > with the errata_id space for cpufeatures. It doesn't, atm, but by luck, > not design. I could try to ensure that somehow, but probably the better > approach would be to use the upper bits of errata_id for the application > data and to leave vendor_id alone. Thanks for catching my oversight! Sounds like a better idea at least. I suppose the ideal would be to add another arg and not abuse things, but maybe that's one for the future, idk. Is this somewhere that an assertion should be used to make sure that we don't grow the list of extensions such that the regions collide?
On Fri, Feb 10, 2023 at 08:29:15PM +0000, Conor Dooley wrote: > On Fri, Feb 10, 2023 at 08:58:18AM +0100, Andrew Jones wrote: > > On Thu, Feb 09, 2023 at 07:04:59PM +0000, Conor Dooley wrote: > > > On Thu, Feb 09, 2023 at 04:26:25PM +0100, Andrew Jones wrote: > > > > > +static bool riscv_cpufeature_application_check(u32 feature, u16 data) > > > > +{ > > > > + return data == 0; > > > > +} > > > > + > > > > void __init_or_module riscv_cpufeature_patch_func(struct alt_entry *begin, > > > > struct alt_entry *end, > > > > unsigned int stage) > > > > @@ -289,8 +294,6 @@ void __init_or_module riscv_cpufeature_patch_func(struct alt_entry *begin, > > > > return; > > > > > > > > for (alt = begin; alt < end; alt++) { > > > > - if (alt->vendor_id != 0) > > > > - continue; > > > > > > Can you remind me what makes this "safe"? > > > My understanding was that a vendor_id of zero was safe, as zero is > > > reserved in JEDEC. > > > What is stopping someone stuffing this with a given value and > > > colliding with a real vendor's errata? > > > > > > for (alt = begin; alt < end; alt++) { > > > if (alt->vendor_id != A_VENDOR_ID) > > > continue; > > > if (alt->errata_id >= ERRATA_A_NUMBER) > > > continue; > > > > > > tmp = (1U << alt->errata_id); > > > if (cpu_req_errata & tmp) { > > > oldptr = ALT_OLD_PTR(alt); > > > altptr = ALT_ALT_PTR(alt); > > > > > > /* On vm-alternatives, the mmu isn't running yet */ > > > if (stage == RISCV_ALTERNATIVES_EARLY_BOOT) > > > memcpy((void *)__pa_symbol(oldptr), > > > (void *)__pa_symbol(altptr), > > > alt->alt_len); > > > else > > > patch_text_nosync(oldptr, altptr, alt->alt_len); > > > } > > > } > > > > > > I've probably just missing something, my brain swapped out alternatives > > > the other week. Hopefully whatever I missed isn't embarrassingly obvious! > > > > You're right. I was assuming the errata_id space for errata didn't overlap > > with the errata_id space for cpufeatures. It doesn't, atm, but by luck, > > not design. I could try to ensure that somehow, but probably the better > > approach would be to use the upper bits of errata_id for the application > > data and to leave vendor_id alone. Thanks for catching my oversight! > > Sounds like a better idea at least. > I suppose the ideal would be to add another arg and not abuse things, > but maybe that's one for the future, idk. We could add another arg without too much trouble and even use some macro trickier to only pass it when it's needed. OTOH, the errata_id is 32-bits, which is overkill, since errata are assigned increasing integers from zero, and I don't want to over-bloat this structure. Maybe I can split errata_id into two 16-bit parameters to avoid completely adding yet another. > Is this somewhere that an assertion should be used to make sure that > we don't grow the list of extensions such that the regions collide? It's probably best to split errata_id for the application data and not worry about collisions. Thanks, drew
diff --git a/arch/riscv/kernel/cpufeature.c b/arch/riscv/kernel/cpufeature.c index 0d2db03cf167..74736b4f0624 100644 --- a/arch/riscv/kernel/cpufeature.c +++ b/arch/riscv/kernel/cpufeature.c @@ -278,6 +278,11 @@ void __init riscv_fill_hwcap(void) } #ifdef CONFIG_RISCV_ALTERNATIVE +static bool riscv_cpufeature_application_check(u32 feature, u16 data) +{ + return data == 0; +} + void __init_or_module riscv_cpufeature_patch_func(struct alt_entry *begin, struct alt_entry *end, unsigned int stage) @@ -289,8 +294,6 @@ void __init_or_module riscv_cpufeature_patch_func(struct alt_entry *begin, return; for (alt = begin; alt < end; alt++) { - if (alt->vendor_id != 0) - continue; if (alt->errata_id >= RISCV_ISA_EXT_MAX) { WARN(1, "This extension id:%d is not in ISA extension list", alt->errata_id); @@ -300,6 +303,9 @@ void __init_or_module riscv_cpufeature_patch_func(struct alt_entry *begin, if (!__riscv_isa_extension_available(NULL, alt->errata_id)) continue; + if (!riscv_cpufeature_application_check(alt->errata_id, alt->vendor_id)) + continue; + oldptr = ALT_OLD_PTR(alt); altptr = ALT_ALT_PTR(alt); patch_text_nosync(oldptr, altptr, alt->alt_len);
When [ab]using alternatives as cpufeature "static keys", which can be used in assembly, also put vendor_id to work as application- specific data. This will be initially used in Zicboz's application to clear_page(), as Zicboz's block size must also be considered. In that case, vendor_id's role will be to convey the maximum block size which the Zicboz clear_page() implementation supports. cpufeature alternative applications which need to check for the existence or absence of other cpufeatures may also be able to make use of this. Signed-off-by: Andrew Jones <ajones@ventanamicro.com> --- arch/riscv/kernel/cpufeature.c | 10 ++++++++-- 1 file changed, 8 insertions(+), 2 deletions(-)