Message ID | 20220508160749.984-3-jszhang@kernel.org (mailing list archive) |
---|---|
State | New, archived |
Headers | show |
Series | unified way to use static key and optimize pgtable_l4_enabled | expand |
On Sun, May 8, 2022 at 9:47 PM Jisheng Zhang <jszhang@kernel.org> wrote: > > Currently, riscv has several features why may not be supported on all > riscv platforms, for example, FPU, SV48 and so on. To support unified > kernel Image style, we need to check whether the feature is suportted > or not. If the check sits at hot code path, then performance will be > impacted a lot. static key can be used to solve the issue. In the past > FPU support has been converted to use static key mechanism. I believe > we will have similar cases in the future. It's not just FPU and Sv48. There are several others such as Svinval, Vector, Svnapot, Svpbmt, and many many others. Overall, I agree with the approach of using static key array but I disagree with the semantics and the duplicate stuff being added. Please see more comments below .. > > Similar as arm64 does(in fact, some code is borrowed from arm64), this > patch tries to add an unified mechanism to use static keys for all > the cpu features by implementing an array of default-false static keys > and enabling them when detected. The cpus_have_*_cap() check uses the > static keys if riscv_const_caps_ready is finalized, otherwise the > compiler generates the bitmap test. First of all, we should stop calling this a feature (like ARM does). Rather, we should call these as isa extensions ("isaext") to align with the RISC-V priv spec and RISC-V profiles spec. For all the ISA optionalities which do not have distinct extension name, the RISC-V profiles spec is assigning names to all such optionalities. Another issue with semantics is that this patch assumes all features are enabled by default and we selectively disable it. This contrary to the approach taken by existing arch/riscv/kernel/cpufeature.c which assumes nothing is enabled by default and we selectively enable it. > > Signed-off-by: Jisheng Zhang <jszhang@kernel.org> > --- > arch/riscv/Makefile | 3 + > arch/riscv/include/asm/cpufeature.h | 94 +++++++++++++++++++++++++++++ > arch/riscv/kernel/cpufeature.c | 23 +++++++ > arch/riscv/tools/Makefile | 22 +++++++ > arch/riscv/tools/cpucaps | 5 ++ > arch/riscv/tools/gen-cpucaps.awk | 40 ++++++++++++ > 6 files changed, 187 insertions(+) > create mode 100644 arch/riscv/include/asm/cpufeature.h > create mode 100644 arch/riscv/tools/Makefile > create mode 100644 arch/riscv/tools/cpucaps > create mode 100755 arch/riscv/tools/gen-cpucaps.awk > > diff --git a/arch/riscv/Makefile b/arch/riscv/Makefile > index 7d81102cffd4..f4df67369d84 100644 > --- a/arch/riscv/Makefile > +++ b/arch/riscv/Makefile > @@ -154,3 +154,6 @@ PHONY += rv64_randconfig > rv64_randconfig: > $(Q)$(MAKE) KCONFIG_ALLCONFIG=$(srctree)/arch/riscv/configs/64-bit.config \ > -f $(srctree)/Makefile randconfig > + > +archprepare: > + $(Q)$(MAKE) $(build)=arch/riscv/tools kapi > diff --git a/arch/riscv/include/asm/cpufeature.h b/arch/riscv/include/asm/cpufeature.h > new file mode 100644 > index 000000000000..d80ddd2f3b49 > --- /dev/null > +++ b/arch/riscv/include/asm/cpufeature.h We don't need a separate header for this. All this belongs to arch/riscv/include/asm/hwcap.h > @@ -0,0 +1,94 @@ > +/* SPDX-License-Identifier: GPL-2.0-only */ > +/* > + * Copyright (C) 2014 Linaro Ltd. <ard.biesheuvel@linaro.org> > + * Copyright (C) 2022 Jisheng Zhang <jszhang@kernel.org> > + */ > + > +#ifndef __ASM_CPUFEATURE_H > +#define __ASM_CPUFEATURE_H > + > +#include <asm/cpucaps.h> > + > +#include <linux/bug.h> > +#include <linux/jump_label.h> > +#include <linux/kernel.h> > + > +extern DECLARE_BITMAP(cpu_hwcaps, RISCV_NCAPS); This is a redundant bitmap. Please re-use "riscv_isa" bitmap for this the ISA extensions. > +extern struct static_key_false cpu_hwcap_keys[RISCV_NCAPS]; This should be called "riscv_isa_keys" > +extern struct static_key_false riscv_const_caps_ready; This should be called "riscv_isa_keys_ready". > + > +static __always_inline bool system_capabilities_finalized(void) Another misaligned name. This should be called "riscv_isa_keys_finalized()". > +{ > + return static_branch_likely(&riscv_const_caps_ready); > +} > + > +/* > + * Test for a capability with a runtime check. > + * > + * Before the capability is detected, this returns false. > + */ > +static inline bool cpus_have_cap(unsigned int num) > +{ > + if (num >= RISCV_NCAPS) > + return false; > + return test_bit(num, cpu_hwcaps); > +} This should be called riscv_isa_have_extension() and it should internally call "__riscv_isa_extension_available(NULL, num)". > + > +/* > + * Test for a capability without a runtime check. > + * > + * Before capabilities are finalized, this returns false. > + * After capabilities are finalized, this is patched to avoid a runtime check. > + * > + * @num must be a compile-time constant. > + */ > +static __always_inline bool __cpus_have_const_cap(int num) This should be named "__riscv_isa_have_const_extension()" > +{ > + if (num >= RISCV_NCAPS) > + return false; > + return static_branch_unlikely(&cpu_hwcap_keys[num]); > +} > + > +/* > + * Test for a capability without a runtime check. > + * > + * Before capabilities are finalized, this will BUG(). > + * After capabilities are finalized, this is patched to avoid a runtime check. > + * > + * @num must be a compile-time constant. > + */ > +static __always_inline bool cpus_have_final_cap(int num) This should be called "riscv_isa_have_final_extension()" > +{ > + if (system_capabilities_finalized()) > + return __cpus_have_const_cap(num); > + else > + BUG(); > +} > + > +/* > + * Test for a capability, possibly with a runtime check. > + * > + * Before capabilities are finalized, this behaves as cpus_have_cap(). > + * After capabilities are finalized, this is patched to avoid a runtime check. > + * > + * @num must be a compile-time constant. > + */ > +static __always_inline bool cpus_have_const_cap(int num) Same comment as above. > +{ > + if (system_capabilities_finalized()) > + return __cpus_have_const_cap(num); > + else > + return cpus_have_cap(num); > +} > + > +static inline void cpus_set_cap(unsigned int num) Same comment as above. > +{ > + if (num >= RISCV_NCAPS) { > + pr_warn("Attempt to set an illegal CPU capability (%d >= %d)\n", > + num, RISCV_NCAPS); > + } else { > + __set_bit(num, cpu_hwcaps); > + } > +} > + > +#endif > diff --git a/arch/riscv/kernel/cpufeature.c b/arch/riscv/kernel/cpufeature.c > index 1b2d42d7f589..e6c72cad0c1c 100644 > --- a/arch/riscv/kernel/cpufeature.c > +++ b/arch/riscv/kernel/cpufeature.c > @@ -9,6 +9,7 @@ > #include <linux/bitmap.h> > #include <linux/ctype.h> > #include <linux/of.h> > +#include <asm/cpufeature.h> > #include <asm/processor.h> > #include <asm/hwcap.h> > #include <asm/smp.h> > @@ -25,6 +26,15 @@ static DECLARE_BITMAP(riscv_isa, RISCV_ISA_EXT_MAX) __read_mostly; > __ro_after_init DEFINE_STATIC_KEY_FALSE(cpu_hwcap_fpu); > #endif > > +DECLARE_BITMAP(cpu_hwcaps, RISCV_NCAPS); > +EXPORT_SYMBOL(cpu_hwcaps); Just like the previous comment. This is a redundant bitmap. Please use "riscv_isa" bitmap for this purpose. > + > +DEFINE_STATIC_KEY_ARRAY_FALSE(cpu_hwcap_keys, RISCV_NCAPS); > +EXPORT_SYMBOL(cpu_hwcap_keys); > + > +DEFINE_STATIC_KEY_FALSE(riscv_const_caps_ready); > +EXPORT_SYMBOL(riscv_const_caps_ready); Please see comments above. > + > /** > * riscv_isa_extension_base() - Get base extension word > * > @@ -62,6 +72,17 @@ bool __riscv_isa_extension_available(const unsigned long *isa_bitmap, int bit) > } > EXPORT_SYMBOL_GPL(__riscv_isa_extension_available); > > +static void __init enable_cpu_capabilities(void) > +{ > + int i; > + > + for (i = 0; i < RISCV_NCAPS; i++) { > + if (!cpus_have_cap(i)) > + continue; > + static_branch_enable(&cpu_hwcap_keys[i]); > + } > +} > + > void __init riscv_fill_hwcap(void) > { > struct device_node *node; > @@ -236,4 +257,6 @@ void __init riscv_fill_hwcap(void) > if (elf_hwcap & (COMPAT_HWCAP_ISA_F | COMPAT_HWCAP_ISA_D)) > static_branch_enable(&cpu_hwcap_fpu); > #endif > + enable_cpu_capabilities(); > + static_branch_enable(&riscv_const_caps_ready); > } > diff --git a/arch/riscv/tools/Makefile b/arch/riscv/tools/Makefile > new file mode 100644 > index 000000000000..932b4fe5c768 > --- /dev/null > +++ b/arch/riscv/tools/Makefile > @@ -0,0 +1,22 @@ > +# SPDX-License-Identifier: GPL-2.0 > + > +gen := arch/$(ARCH)/include/generated > +kapi := $(gen)/asm > + > +kapi-hdrs-y := $(kapi)/cpucaps.h > + > +targets += $(addprefix ../../../,$(gen-y) $(kapi-hdrs-y)) > + > +PHONY += kapi > + > +kapi: $(kapi-hdrs-y) $(gen-y) > + > +# Create output directory if not already present > +_dummy := $(shell [ -d '$(kapi)' ] || mkdir -p '$(kapi)') > + > +quiet_cmd_gen_cpucaps = GEN $@ > + cmd_gen_cpucaps = mkdir -p $(dir $@) && \ > + $(AWK) -f $(filter-out $(PHONY),$^) > $@ > + > +$(kapi)/cpucaps.h: $(src)/gen-cpucaps.awk $(src)/cpucaps FORCE > + $(call if_changed,gen_cpucaps) > diff --git a/arch/riscv/tools/cpucaps b/arch/riscv/tools/cpucaps > new file mode 100644 > index 000000000000..cb1ff2747859 > --- /dev/null > +++ b/arch/riscv/tools/cpucaps > @@ -0,0 +1,5 @@ > +# SPDX-License-Identifier: GPL-2.0 > +# > +# Internal CPU capabilities constants, keep this list sorted > + > +HAS_NO_FPU How can "No FPU" be a CPU capability ? We have ISA extensions 'F' and 'D' which tells us whether an FPU is available or not. I think this file should be a table with two columns "<lower_case_extension_name> <parsed_from_isa_string_yes_no>" I this this file should look like this: i yes m yes a yes c yes f yes d yes h yes sv48 no sv57 no sstc yes svinval yes svpbmt yes svnapot yes sscofpmf yes ... > diff --git a/arch/riscv/tools/gen-cpucaps.awk b/arch/riscv/tools/gen-cpucaps.awk > new file mode 100755 > index 000000000000..52a1e1b064ad > --- /dev/null > +++ b/arch/riscv/tools/gen-cpucaps.awk > @@ -0,0 +1,40 @@ > +#!/bin/awk -f > +# SPDX-License-Identifier: GPL-2.0 > +# gen-cpucaps.awk: riscv cpucaps header generator > +# > +# Usage: awk -f gen-cpucaps.awk cpucaps.txt > + > +# Log an error and terminate > +function fatal(msg) { > + print "Error at line " NR ": " msg > "/dev/stderr" > + exit 1 > +} > + > +# skip blank lines and comment lines > +/^$/ { next } > +/^#/ { next } > + > +BEGIN { > + print "#ifndef __ASM_CPUCAPS_H" > + print "#define __ASM_CPUCAPS_H" > + print "" > + print "/* Generated file - do not edit */" > + cap_num = 0 > + print "" > +} > + > +/^[vA-Z0-9_]+$/ { > + printf("#define RISCV_%-30s\t%d\n", $0, cap_num++) > + next > +} > + > +END { > + printf("#define RISCV_NCAPS\t\t\t\t%d\n", cap_num) > + print "" > + print "#endif /* __ASM_CPUCAPS_H */" > +} This script need to change refer capabilities as extensions. For every extension, there should be two defines. For e.g. "sstc" extension should have following defines #define RISCV_ISA_EXT_sstc <#num> #define RISCV_ISA_EXT_FROMSTR_sstc <1|0> > + > +# Any lines not handled by previous rules are unexpected > +{ > + fatal("unhandled statement") > +} > -- > 2.34.1 > Regards, Anup
On Mon, May 09, 2022 at 09:17:10AM +0530, Anup Patel wrote: > On Sun, May 8, 2022 at 9:47 PM Jisheng Zhang <jszhang@kernel.org> wrote: > > > > Currently, riscv has several features why may not be supported on all > > riscv platforms, for example, FPU, SV48 and so on. To support unified > > kernel Image style, we need to check whether the feature is suportted > > or not. If the check sits at hot code path, then performance will be > > impacted a lot. static key can be used to solve the issue. In the past > > FPU support has been converted to use static key mechanism. I believe > > we will have similar cases in the future. > > It's not just FPU and Sv48. There are several others such as Svinval, > Vector, Svnapot, Svpbmt, and many many others. > > Overall, I agree with the approach of using static key array but I > disagree with the semantics and the duplicate stuff being added. > > Please see more comments below .. > > > > > Similar as arm64 does(in fact, some code is borrowed from arm64), this > > patch tries to add an unified mechanism to use static keys for all > > the cpu features by implementing an array of default-false static keys > > and enabling them when detected. The cpus_have_*_cap() check uses the > > static keys if riscv_const_caps_ready is finalized, otherwise the > > compiler generates the bitmap test. > > First of all, we should stop calling this a feature (like ARM does). Rather, > we should call these as isa extensions ("isaext") to align with the RISC-V > priv spec and RISC-V profiles spec. For all the ISA optionalities which do > not have distinct extension name, the RISC-V profiles spec is assigning > names to all such optionalities. Same as the reply a few minutes ago, the key problem here is do all CPU features belong to *ISA* extensions? For example, SV48, SV57 etc. I agree with Atish's comments here: "I think the cpu feature is a superset of the ISA extension. cpu feature != ISA extension" https://lore.kernel.org/linux-riscv/CAHBxVyF65jC_wvxcD6bueqpCY8-Kbahu1yxsSoBmO1s15dGkSQ@mail.gmail.com/ > > Another issue with semantics is that this patch assumes all features are > enabled by default and we selectively disable it. This contrary to the > approach taken by existing arch/riscv/kernel/cpufeature.c which assumes > nothing is enabled by default and we selectively enable it. This is implementation related, can be modified in next version. From another side, assuming some feature enabled by default can result in trivial performance improvement for most platforms. For example, if most platforms are FPU capable, we'd better assume FPU enabled by default. > > > > > Signed-off-by: Jisheng Zhang <jszhang@kernel.org> > > --- > > arch/riscv/Makefile | 3 + > > arch/riscv/include/asm/cpufeature.h | 94 +++++++++++++++++++++++++++++ > > arch/riscv/kernel/cpufeature.c | 23 +++++++ > > arch/riscv/tools/Makefile | 22 +++++++ > > arch/riscv/tools/cpucaps | 5 ++ > > arch/riscv/tools/gen-cpucaps.awk | 40 ++++++++++++ > > 6 files changed, 187 insertions(+) > > create mode 100644 arch/riscv/include/asm/cpufeature.h > > create mode 100644 arch/riscv/tools/Makefile > > create mode 100644 arch/riscv/tools/cpucaps > > create mode 100755 arch/riscv/tools/gen-cpucaps.awk > > > > diff --git a/arch/riscv/Makefile b/arch/riscv/Makefile > > index 7d81102cffd4..f4df67369d84 100644 > > --- a/arch/riscv/Makefile > > +++ b/arch/riscv/Makefile > > @@ -154,3 +154,6 @@ PHONY += rv64_randconfig > > rv64_randconfig: > > $(Q)$(MAKE) KCONFIG_ALLCONFIG=$(srctree)/arch/riscv/configs/64-bit.config \ > > -f $(srctree)/Makefile randconfig > > + > > +archprepare: > > + $(Q)$(MAKE) $(build)=arch/riscv/tools kapi > > diff --git a/arch/riscv/include/asm/cpufeature.h b/arch/riscv/include/asm/cpufeature.h > > new file mode 100644 > > index 000000000000..d80ddd2f3b49 > > --- /dev/null > > +++ b/arch/riscv/include/asm/cpufeature.h > > We don't need a separate header for this. > > All this belongs to arch/riscv/include/asm/hwcap.h > > > @@ -0,0 +1,94 @@ > > +/* SPDX-License-Identifier: GPL-2.0-only */ > > +/* > > + * Copyright (C) 2014 Linaro Ltd. <ard.biesheuvel@linaro.org> > > + * Copyright (C) 2022 Jisheng Zhang <jszhang@kernel.org> > > + */ > > + > > +#ifndef __ASM_CPUFEATURE_H > > +#define __ASM_CPUFEATURE_H > > + > > +#include <asm/cpucaps.h> > > + > > +#include <linux/bug.h> > > +#include <linux/jump_label.h> > > +#include <linux/kernel.h> > > + > > +extern DECLARE_BITMAP(cpu_hwcaps, RISCV_NCAPS); > > This is a redundant bitmap. Please re-use "riscv_isa" bitmap for this > the ISA extensions. > > > +extern struct static_key_false cpu_hwcap_keys[RISCV_NCAPS]; > > This should be called "riscv_isa_keys" > > > +extern struct static_key_false riscv_const_caps_ready; > > This should be called "riscv_isa_keys_ready". > > > + > > +static __always_inline bool system_capabilities_finalized(void) > > Another misaligned name. This should be called > "riscv_isa_keys_finalized()". > > > +{ > > + return static_branch_likely(&riscv_const_caps_ready); > > +} > > + > > +/* > > + * Test for a capability with a runtime check. > > + * > > + * Before the capability is detected, this returns false. > > + */ > > +static inline bool cpus_have_cap(unsigned int num) > > +{ > > + if (num >= RISCV_NCAPS) > > + return false; > > + return test_bit(num, cpu_hwcaps); > > +} > > This should be called riscv_isa_have_extension() and it should > internally call "__riscv_isa_extension_available(NULL, num)". > > > + > > +/* > > + * Test for a capability without a runtime check. > > + * > > + * Before capabilities are finalized, this returns false. > > + * After capabilities are finalized, this is patched to avoid a runtime check. > > + * > > + * @num must be a compile-time constant. > > + */ > > +static __always_inline bool __cpus_have_const_cap(int num) > > This should be named "__riscv_isa_have_const_extension()" > > > +{ > > + if (num >= RISCV_NCAPS) > > + return false; > > + return static_branch_unlikely(&cpu_hwcap_keys[num]); > > +} > > + > > +/* > > + * Test for a capability without a runtime check. > > + * > > + * Before capabilities are finalized, this will BUG(). > > + * After capabilities are finalized, this is patched to avoid a runtime check. > > + * > > + * @num must be a compile-time constant. > > + */ > > +static __always_inline bool cpus_have_final_cap(int num) > > This should be called "riscv_isa_have_final_extension()" > > > +{ > > + if (system_capabilities_finalized()) > > + return __cpus_have_const_cap(num); > > + else > > + BUG(); > > +} > > + > > +/* > > + * Test for a capability, possibly with a runtime check. > > + * > > + * Before capabilities are finalized, this behaves as cpus_have_cap(). > > + * After capabilities are finalized, this is patched to avoid a runtime check. > > + * > > + * @num must be a compile-time constant. > > + */ > > +static __always_inline bool cpus_have_const_cap(int num) > > Same comment as above. > > > +{ > > + if (system_capabilities_finalized()) > > + return __cpus_have_const_cap(num); > > + else > > + return cpus_have_cap(num); > > +} > > + > > +static inline void cpus_set_cap(unsigned int num) > > Same comment as above. > > > +{ > > + if (num >= RISCV_NCAPS) { > > + pr_warn("Attempt to set an illegal CPU capability (%d >= %d)\n", > > + num, RISCV_NCAPS); > > + } else { > > + __set_bit(num, cpu_hwcaps); > > + } > > +} > > + > > +#endif > > diff --git a/arch/riscv/kernel/cpufeature.c b/arch/riscv/kernel/cpufeature.c > > index 1b2d42d7f589..e6c72cad0c1c 100644 > > --- a/arch/riscv/kernel/cpufeature.c > > +++ b/arch/riscv/kernel/cpufeature.c > > @@ -9,6 +9,7 @@ > > #include <linux/bitmap.h> > > #include <linux/ctype.h> > > #include <linux/of.h> > > +#include <asm/cpufeature.h> > > #include <asm/processor.h> > > #include <asm/hwcap.h> > > #include <asm/smp.h> > > @@ -25,6 +26,15 @@ static DECLARE_BITMAP(riscv_isa, RISCV_ISA_EXT_MAX) __read_mostly; > > __ro_after_init DEFINE_STATIC_KEY_FALSE(cpu_hwcap_fpu); > > #endif > > > > +DECLARE_BITMAP(cpu_hwcaps, RISCV_NCAPS); > > +EXPORT_SYMBOL(cpu_hwcaps); > > Just like the previous comment. This is a redundant bitmap. > Please use "riscv_isa" bitmap for this purpose. > > > + > > +DEFINE_STATIC_KEY_ARRAY_FALSE(cpu_hwcap_keys, RISCV_NCAPS); > > +EXPORT_SYMBOL(cpu_hwcap_keys); > > + > > +DEFINE_STATIC_KEY_FALSE(riscv_const_caps_ready); > > +EXPORT_SYMBOL(riscv_const_caps_ready); > > Please see comments above. > > > + > > /** > > * riscv_isa_extension_base() - Get base extension word > > * > > @@ -62,6 +72,17 @@ bool __riscv_isa_extension_available(const unsigned long *isa_bitmap, int bit) > > } > > EXPORT_SYMBOL_GPL(__riscv_isa_extension_available); > > > > +static void __init enable_cpu_capabilities(void) > > +{ > > + int i; > > + > > + for (i = 0; i < RISCV_NCAPS; i++) { > > + if (!cpus_have_cap(i)) > > + continue; > > + static_branch_enable(&cpu_hwcap_keys[i]); > > + } > > +} > > + > > void __init riscv_fill_hwcap(void) > > { > > struct device_node *node; > > @@ -236,4 +257,6 @@ void __init riscv_fill_hwcap(void) > > if (elf_hwcap & (COMPAT_HWCAP_ISA_F | COMPAT_HWCAP_ISA_D)) > > static_branch_enable(&cpu_hwcap_fpu); > > #endif > > + enable_cpu_capabilities(); > > + static_branch_enable(&riscv_const_caps_ready); > > } > > diff --git a/arch/riscv/tools/Makefile b/arch/riscv/tools/Makefile > > new file mode 100644 > > index 000000000000..932b4fe5c768 > > --- /dev/null > > +++ b/arch/riscv/tools/Makefile > > @@ -0,0 +1,22 @@ > > +# SPDX-License-Identifier: GPL-2.0 > > + > > +gen := arch/$(ARCH)/include/generated > > +kapi := $(gen)/asm > > + > > +kapi-hdrs-y := $(kapi)/cpucaps.h > > + > > +targets += $(addprefix ../../../,$(gen-y) $(kapi-hdrs-y)) > > + > > +PHONY += kapi > > + > > +kapi: $(kapi-hdrs-y) $(gen-y) > > + > > +# Create output directory if not already present > > +_dummy := $(shell [ -d '$(kapi)' ] || mkdir -p '$(kapi)') > > + > > +quiet_cmd_gen_cpucaps = GEN $@ > > + cmd_gen_cpucaps = mkdir -p $(dir $@) && \ > > + $(AWK) -f $(filter-out $(PHONY),$^) > $@ > > + > > +$(kapi)/cpucaps.h: $(src)/gen-cpucaps.awk $(src)/cpucaps FORCE > > + $(call if_changed,gen_cpucaps) > > diff --git a/arch/riscv/tools/cpucaps b/arch/riscv/tools/cpucaps > > new file mode 100644 > > index 000000000000..cb1ff2747859 > > --- /dev/null > > +++ b/arch/riscv/tools/cpucaps > > @@ -0,0 +1,5 @@ > > +# SPDX-License-Identifier: GPL-2.0 > > +# > > +# Internal CPU capabilities constants, keep this list sorted > > + > > +HAS_NO_FPU > > How can "No FPU" be a CPU capability ? > > We have ISA extensions 'F' and 'D' which tells us whether an FPU is available > or not. > > I think this file should be a table with two columns > "<lower_case_extension_name> <parsed_from_isa_string_yes_no>" > I this this file should look like this: > > i yes > m yes > a yes > c yes > f yes > d yes > h yes > sv48 no > sv57 no > sstc yes > svinval yes > svpbmt yes > svnapot yes > sscofpmf yes > ... > > > diff --git a/arch/riscv/tools/gen-cpucaps.awk b/arch/riscv/tools/gen-cpucaps.awk > > new file mode 100755 > > index 000000000000..52a1e1b064ad > > --- /dev/null > > +++ b/arch/riscv/tools/gen-cpucaps.awk > > @@ -0,0 +1,40 @@ > > +#!/bin/awk -f > > +# SPDX-License-Identifier: GPL-2.0 > > +# gen-cpucaps.awk: riscv cpucaps header generator > > +# > > +# Usage: awk -f gen-cpucaps.awk cpucaps.txt > > + > > +# Log an error and terminate > > +function fatal(msg) { > > + print "Error at line " NR ": " msg > "/dev/stderr" > > + exit 1 > > +} > > + > > +# skip blank lines and comment lines > > +/^$/ { next } > > +/^#/ { next } > > + > > +BEGIN { > > + print "#ifndef __ASM_CPUCAPS_H" > > + print "#define __ASM_CPUCAPS_H" > > + print "" > > + print "/* Generated file - do not edit */" > > + cap_num = 0 > > + print "" > > +} > > + > > +/^[vA-Z0-9_]+$/ { > > + printf("#define RISCV_%-30s\t%d\n", $0, cap_num++) > > + next > > +} > > + > > +END { > > + printf("#define RISCV_NCAPS\t\t\t\t%d\n", cap_num) > > + print "" > > + print "#endif /* __ASM_CPUCAPS_H */" > > +} > > This script need to change refer capabilities as extensions. > > For every extension, there should be two defines. > For e.g. "sstc" extension should have following defines > #define RISCV_ISA_EXT_sstc <#num> > #define RISCV_ISA_EXT_FROMSTR_sstc <1|0> > > > + > > +# Any lines not handled by previous rules are unexpected > > +{ > > + fatal("unhandled statement") > > +} > > -- > > 2.34.1 > > > > Regards, > Anup
On Mon, May 9, 2022 at 7:50 AM Jisheng Zhang <jszhang@kernel.org> wrote: > > On Mon, May 09, 2022 at 09:17:10AM +0530, Anup Patel wrote: > > On Sun, May 8, 2022 at 9:47 PM Jisheng Zhang <jszhang@kernel.org> wrote: > > > > > > Currently, riscv has several features why may not be supported on all > > > riscv platforms, for example, FPU, SV48 and so on. To support unified > > > kernel Image style, we need to check whether the feature is suportted > > > or not. If the check sits at hot code path, then performance will be > > > impacted a lot. static key can be used to solve the issue. In the past > > > FPU support has been converted to use static key mechanism. I believe > > > we will have similar cases in the future. > > > > It's not just FPU and Sv48. There are several others such as Svinval, > > Vector, Svnapot, Svpbmt, and many many others. > > > > Overall, I agree with the approach of using static key array but I > > disagree with the semantics and the duplicate stuff being added. > > > > Please see more comments below .. > > > > > > > > Similar as arm64 does(in fact, some code is borrowed from arm64), this > > > patch tries to add an unified mechanism to use static keys for all > > > the cpu features by implementing an array of default-false static keys > > > and enabling them when detected. The cpus_have_*_cap() check uses the > > > static keys if riscv_const_caps_ready is finalized, otherwise the > > > compiler generates the bitmap test. > > > > First of all, we should stop calling this a feature (like ARM does). Rather, > > we should call these as isa extensions ("isaext") to align with the RISC-V > > priv spec and RISC-V profiles spec. For all the ISA optionalities which do > > not have distinct extension name, the RISC-V profiles spec is assigning > > names to all such optionalities. > > Same as the reply a few minutes ago, the key problem here is do all > CPU features belong to *ISA* extensions? For example, SV48, SV57 etc. > I agree with Atish's comments here: > > "I think the cpu feature is a superset of the ISA extension. > cpu feature != ISA extension" > It seems to be accurate at that point in time. However, the latest profile spec seems to define everything as an extension including sv48. https://github.com/riscv/riscv-profiles/blob/main/profiles.adoc#623-rva22s64-supported-optional-extensions It may be a redundant effort and confusing to create two sets i.e. feature and extension in this case. But this specification is not frozen yet and may change in the future. We at least know that that is the current intention. Array of static keys is definitely useful and should be used for all well defined ISA extensions by the ratified priv spec. This will simplify this patch as well. For any feature/extensions (i.e. sv48/sv57) which was never defined as an extension in the priv spec but profile seems to define it now, I would leave it alone for the time being. Converting the existing code to static key probably has value but please do not include it in the static key array setup. Once the profile spec is frozen, we can decide which direction the Linux kernel should go. > https://lore.kernel.org/linux-riscv/CAHBxVyF65jC_wvxcD6bueqpCY8-Kbahu1yxsSoBmO1s15dGkSQ@mail.gmail.com/ > > > > > Another issue with semantics is that this patch assumes all features are > > enabled by default and we selectively disable it. This contrary to the > > approach taken by existing arch/riscv/kernel/cpufeature.c which assumes > > nothing is enabled by default and we selectively enable it. > > This is implementation related, can be modified in next version. From > another side, assuming some feature enabled by default can result in > trivial performance improvement for most platforms. For example, if > most platforms are FPU capable, we'd better assume FPU enabled by default. > > > > > > > > > Signed-off-by: Jisheng Zhang <jszhang@kernel.org> > > > --- > > > arch/riscv/Makefile | 3 + > > > arch/riscv/include/asm/cpufeature.h | 94 +++++++++++++++++++++++++++++ > > > arch/riscv/kernel/cpufeature.c | 23 +++++++ > > > arch/riscv/tools/Makefile | 22 +++++++ > > > arch/riscv/tools/cpucaps | 5 ++ > > > arch/riscv/tools/gen-cpucaps.awk | 40 ++++++++++++ > > > 6 files changed, 187 insertions(+) > > > create mode 100644 arch/riscv/include/asm/cpufeature.h > > > create mode 100644 arch/riscv/tools/Makefile > > > create mode 100644 arch/riscv/tools/cpucaps > > > create mode 100755 arch/riscv/tools/gen-cpucaps.awk > > > > > > diff --git a/arch/riscv/Makefile b/arch/riscv/Makefile > > > index 7d81102cffd4..f4df67369d84 100644 > > > --- a/arch/riscv/Makefile > > > +++ b/arch/riscv/Makefile > > > @@ -154,3 +154,6 @@ PHONY += rv64_randconfig > > > rv64_randconfig: > > > $(Q)$(MAKE) KCONFIG_ALLCONFIG=$(srctree)/arch/riscv/configs/64-bit.config \ > > > -f $(srctree)/Makefile randconfig > > > + > > > +archprepare: > > > + $(Q)$(MAKE) $(build)=arch/riscv/tools kapi > > > diff --git a/arch/riscv/include/asm/cpufeature.h b/arch/riscv/include/asm/cpufeature.h > > > new file mode 100644 > > > index 000000000000..d80ddd2f3b49 > > > --- /dev/null > > > +++ b/arch/riscv/include/asm/cpufeature.h > > > > We don't need a separate header for this. > > > > All this belongs to arch/riscv/include/asm/hwcap.h > > > > > @@ -0,0 +1,94 @@ > > > +/* SPDX-License-Identifier: GPL-2.0-only */ > > > +/* > > > + * Copyright (C) 2014 Linaro Ltd. <ard.biesheuvel@linaro.org> > > > + * Copyright (C) 2022 Jisheng Zhang <jszhang@kernel.org> > > > + */ > > > + > > > +#ifndef __ASM_CPUFEATURE_H > > > +#define __ASM_CPUFEATURE_H > > > + > > > +#include <asm/cpucaps.h> > > > + > > > +#include <linux/bug.h> > > > +#include <linux/jump_label.h> > > > +#include <linux/kernel.h> > > > + > > > +extern DECLARE_BITMAP(cpu_hwcaps, RISCV_NCAPS); > > > > This is a redundant bitmap. Please re-use "riscv_isa" bitmap for this > > the ISA extensions. > > > > > +extern struct static_key_false cpu_hwcap_keys[RISCV_NCAPS]; > > > > This should be called "riscv_isa_keys" > > > > > +extern struct static_key_false riscv_const_caps_ready; > > > > This should be called "riscv_isa_keys_ready". > > > > > + > > > +static __always_inline bool system_capabilities_finalized(void) > > > > Another misaligned name. This should be called > > "riscv_isa_keys_finalized()". > > > > > +{ > > > + return static_branch_likely(&riscv_const_caps_ready); > > > +} > > > + > > > +/* > > > + * Test for a capability with a runtime check. > > > + * > > > + * Before the capability is detected, this returns false. > > > + */ > > > +static inline bool cpus_have_cap(unsigned int num) > > > +{ > > > + if (num >= RISCV_NCAPS) > > > + return false; > > > + return test_bit(num, cpu_hwcaps); > > > +} > > > > This should be called riscv_isa_have_extension() and it should > > internally call "__riscv_isa_extension_available(NULL, num)". > > > > > + > > > +/* > > > + * Test for a capability without a runtime check. > > > + * > > > + * Before capabilities are finalized, this returns false. > > > + * After capabilities are finalized, this is patched to avoid a runtime check. > > > + * > > > + * @num must be a compile-time constant. > > > + */ > > > +static __always_inline bool __cpus_have_const_cap(int num) > > > > This should be named "__riscv_isa_have_const_extension()" > > > > > +{ > > > + if (num >= RISCV_NCAPS) > > > + return false; > > > + return static_branch_unlikely(&cpu_hwcap_keys[num]); > > > +} > > > + > > > +/* > > > + * Test for a capability without a runtime check. > > > + * > > > + * Before capabilities are finalized, this will BUG(). > > > + * After capabilities are finalized, this is patched to avoid a runtime check. > > > + * > > > + * @num must be a compile-time constant. > > > + */ > > > +static __always_inline bool cpus_have_final_cap(int num) > > > > This should be called "riscv_isa_have_final_extension()" > > > > > +{ > > > + if (system_capabilities_finalized()) > > > + return __cpus_have_const_cap(num); > > > + else > > > + BUG(); > > > +} > > > + > > > +/* > > > + * Test for a capability, possibly with a runtime check. > > > + * > > > + * Before capabilities are finalized, this behaves as cpus_have_cap(). > > > + * After capabilities are finalized, this is patched to avoid a runtime check. > > > + * > > > + * @num must be a compile-time constant. > > > + */ > > > +static __always_inline bool cpus_have_const_cap(int num) > > > > Same comment as above. > > > > > +{ > > > + if (system_capabilities_finalized()) > > > + return __cpus_have_const_cap(num); > > > + else > > > + return cpus_have_cap(num); > > > +} > > > + > > > +static inline void cpus_set_cap(unsigned int num) > > > > Same comment as above. > > > > > +{ > > > + if (num >= RISCV_NCAPS) { > > > + pr_warn("Attempt to set an illegal CPU capability (%d >= %d)\n", > > > + num, RISCV_NCAPS); > > > + } else { > > > + __set_bit(num, cpu_hwcaps); > > > + } > > > +} > > > + > > > +#endif > > > diff --git a/arch/riscv/kernel/cpufeature.c b/arch/riscv/kernel/cpufeature.c > > > index 1b2d42d7f589..e6c72cad0c1c 100644 > > > --- a/arch/riscv/kernel/cpufeature.c > > > +++ b/arch/riscv/kernel/cpufeature.c > > > @@ -9,6 +9,7 @@ > > > #include <linux/bitmap.h> > > > #include <linux/ctype.h> > > > #include <linux/of.h> > > > +#include <asm/cpufeature.h> > > > #include <asm/processor.h> > > > #include <asm/hwcap.h> > > > #include <asm/smp.h> > > > @@ -25,6 +26,15 @@ static DECLARE_BITMAP(riscv_isa, RISCV_ISA_EXT_MAX) __read_mostly; > > > __ro_after_init DEFINE_STATIC_KEY_FALSE(cpu_hwcap_fpu); > > > #endif > > > > > > +DECLARE_BITMAP(cpu_hwcaps, RISCV_NCAPS); > > > +EXPORT_SYMBOL(cpu_hwcaps); > > > > Just like the previous comment. This is a redundant bitmap. > > Please use "riscv_isa" bitmap for this purpose. > > > > > + > > > +DEFINE_STATIC_KEY_ARRAY_FALSE(cpu_hwcap_keys, RISCV_NCAPS); > > > +EXPORT_SYMBOL(cpu_hwcap_keys); > > > + > > > +DEFINE_STATIC_KEY_FALSE(riscv_const_caps_ready); > > > +EXPORT_SYMBOL(riscv_const_caps_ready); > > > > Please see comments above. > > > > > + > > > /** > > > * riscv_isa_extension_base() - Get base extension word > > > * > > > @@ -62,6 +72,17 @@ bool __riscv_isa_extension_available(const unsigned long *isa_bitmap, int bit) > > > } > > > EXPORT_SYMBOL_GPL(__riscv_isa_extension_available); > > > > > > +static void __init enable_cpu_capabilities(void) > > > +{ > > > + int i; > > > + > > > + for (i = 0; i < RISCV_NCAPS; i++) { > > > + if (!cpus_have_cap(i)) > > > + continue; > > > + static_branch_enable(&cpu_hwcap_keys[i]); > > > + } > > > +} > > > + > > > void __init riscv_fill_hwcap(void) > > > { > > > struct device_node *node; > > > @@ -236,4 +257,6 @@ void __init riscv_fill_hwcap(void) > > > if (elf_hwcap & (COMPAT_HWCAP_ISA_F | COMPAT_HWCAP_ISA_D)) > > > static_branch_enable(&cpu_hwcap_fpu); > > > #endif > > > + enable_cpu_capabilities(); > > > + static_branch_enable(&riscv_const_caps_ready); > > > } > > > diff --git a/arch/riscv/tools/Makefile b/arch/riscv/tools/Makefile > > > new file mode 100644 > > > index 000000000000..932b4fe5c768 > > > --- /dev/null > > > +++ b/arch/riscv/tools/Makefile > > > @@ -0,0 +1,22 @@ > > > +# SPDX-License-Identifier: GPL-2.0 > > > + > > > +gen := arch/$(ARCH)/include/generated > > > +kapi := $(gen)/asm > > > + > > > +kapi-hdrs-y := $(kapi)/cpucaps.h > > > + > > > +targets += $(addprefix ../../../,$(gen-y) $(kapi-hdrs-y)) > > > + > > > +PHONY += kapi > > > + > > > +kapi: $(kapi-hdrs-y) $(gen-y) > > > + > > > +# Create output directory if not already present > > > +_dummy := $(shell [ -d '$(kapi)' ] || mkdir -p '$(kapi)') > > > + > > > +quiet_cmd_gen_cpucaps = GEN $@ > > > + cmd_gen_cpucaps = mkdir -p $(dir $@) && \ > > > + $(AWK) -f $(filter-out $(PHONY),$^) > $@ > > > + > > > +$(kapi)/cpucaps.h: $(src)/gen-cpucaps.awk $(src)/cpucaps FORCE > > > + $(call if_changed,gen_cpucaps) > > > diff --git a/arch/riscv/tools/cpucaps b/arch/riscv/tools/cpucaps > > > new file mode 100644 > > > index 000000000000..cb1ff2747859 > > > --- /dev/null > > > +++ b/arch/riscv/tools/cpucaps > > > @@ -0,0 +1,5 @@ > > > +# SPDX-License-Identifier: GPL-2.0 > > > +# > > > +# Internal CPU capabilities constants, keep this list sorted > > > + > > > +HAS_NO_FPU > > > > How can "No FPU" be a CPU capability ? > > > > We have ISA extensions 'F' and 'D' which tells us whether an FPU is available > > or not. > > > > I think this file should be a table with two columns > > "<lower_case_extension_name> <parsed_from_isa_string_yes_no>" > > I this this file should look like this: > > > > i yes > > m yes > > a yes > > c yes > > f yes > > d yes > > h yes > > sv48 no > > sv57 no > > sstc yes > > svinval yes > > svpbmt yes > > svnapot yes > > sscofpmf yes > > ... > > > > > diff --git a/arch/riscv/tools/gen-cpucaps.awk b/arch/riscv/tools/gen-cpucaps.awk > > > new file mode 100755 > > > index 000000000000..52a1e1b064ad > > > --- /dev/null > > > +++ b/arch/riscv/tools/gen-cpucaps.awk > > > @@ -0,0 +1,40 @@ > > > +#!/bin/awk -f > > > +# SPDX-License-Identifier: GPL-2.0 > > > +# gen-cpucaps.awk: riscv cpucaps header generator > > > +# > > > +# Usage: awk -f gen-cpucaps.awk cpucaps.txt > > > + > > > +# Log an error and terminate > > > +function fatal(msg) { > > > + print "Error at line " NR ": " msg > "/dev/stderr" > > > + exit 1 > > > +} > > > + > > > +# skip blank lines and comment lines > > > +/^$/ { next } > > > +/^#/ { next } > > > + > > > +BEGIN { > > > + print "#ifndef __ASM_CPUCAPS_H" > > > + print "#define __ASM_CPUCAPS_H" > > > + print "" > > > + print "/* Generated file - do not edit */" > > > + cap_num = 0 > > > + print "" > > > +} > > > + > > > +/^[vA-Z0-9_]+$/ { > > > + printf("#define RISCV_%-30s\t%d\n", $0, cap_num++) > > > + next > > > +} > > > + > > > +END { > > > + printf("#define RISCV_NCAPS\t\t\t\t%d\n", cap_num) > > > + print "" > > > + print "#endif /* __ASM_CPUCAPS_H */" > > > +} > > > > This script need to change refer capabilities as extensions. > > > > For every extension, there should be two defines. > > For e.g. "sstc" extension should have following defines > > #define RISCV_ISA_EXT_sstc <#num> > > #define RISCV_ISA_EXT_FROMSTR_sstc <1|0> > > > > > + > > > +# Any lines not handled by previous rules are unexpected > > > +{ > > > + fatal("unhandled statement") > > > +} > > > -- > > > 2.34.1 > > > > > > > Regards, > > Anup > > _______________________________________________ > linux-riscv mailing list > linux-riscv@lists.infradead.org > http://lists.infradead.org/mailman/listinfo/linux-riscv
On Wed, May 11, 2022 at 11:29:32PM -0700, Atish Patra wrote: > On Mon, May 9, 2022 at 7:50 AM Jisheng Zhang <jszhang@kernel.org> wrote: > > > > On Mon, May 09, 2022 at 09:17:10AM +0530, Anup Patel wrote: > > > On Sun, May 8, 2022 at 9:47 PM Jisheng Zhang <jszhang@kernel.org> wrote: > > > > > > > > Currently, riscv has several features why may not be supported on all > > > > riscv platforms, for example, FPU, SV48 and so on. To support unified > > > > kernel Image style, we need to check whether the feature is suportted > > > > or not. If the check sits at hot code path, then performance will be > > > > impacted a lot. static key can be used to solve the issue. In the past > > > > FPU support has been converted to use static key mechanism. I believe > > > > we will have similar cases in the future. > > > > > > It's not just FPU and Sv48. There are several others such as Svinval, > > > Vector, Svnapot, Svpbmt, and many many others. > > > > > > Overall, I agree with the approach of using static key array but I > > > disagree with the semantics and the duplicate stuff being added. > > > > > > Please see more comments below .. > > > > > > > > > > > Similar as arm64 does(in fact, some code is borrowed from arm64), this > > > > patch tries to add an unified mechanism to use static keys for all > > > > the cpu features by implementing an array of default-false static keys > > > > and enabling them when detected. The cpus_have_*_cap() check uses the > > > > static keys if riscv_const_caps_ready is finalized, otherwise the > > > > compiler generates the bitmap test. > > > > > > First of all, we should stop calling this a feature (like ARM does). Rather, > > > we should call these as isa extensions ("isaext") to align with the RISC-V > > > priv spec and RISC-V profiles spec. For all the ISA optionalities which do > > > not have distinct extension name, the RISC-V profiles spec is assigning > > > names to all such optionalities. > > > > Same as the reply a few minutes ago, the key problem here is do all > > CPU features belong to *ISA* extensions? For example, SV48, SV57 etc. > > I agree with Atish's comments here: > > > > "I think the cpu feature is a superset of the ISA extension. > > cpu feature != ISA extension" > > > > It seems to be accurate at that point in time. However, the latest > profile spec seems to > define everything as an extension including sv48. > > https://github.com/riscv/riscv-profiles/blob/main/profiles.adoc#623-rva22s64-supported-optional-extensions > > It may be a redundant effort and confusing to create two sets i.e. > feature and extension in this case. > But this specification is not frozen yet and may change in the future. > We at least know that that is the current intention. > > Array of static keys is definitely useful and should be used for all > well defined ISA extensions by the ratified priv spec. > This will simplify this patch as well. For any feature/extensions > (i.e. sv48/sv57) which was never defined as an extension > in the priv spec but profile seems to define it now, I would leave it > alone for the time being. Converting the existing code > to static key probably has value but please do not include it in the > static key array setup. > > Once the profile spec is frozen, we can decide which direction the > Linux kernel should go. > Hi Atish, Anup, I see your points and thanks for the information of the profile spec. Now, I have other two points about isa VS features: 1. Not all isa extenstions need static key mechanism, so if we make a static key array with 1:1 riscv_isa <-> static key relationship there may be waste. For example, the 'a', 'c', 'i', 'm' and so on don't have static key usage. 2.We may need riscv architecture static keys for non-isa, this is usually related with the linux os itself, for example a static key for "unmap kernelspace at userspace". static keys for "spectre CVE mitigations" etc. In summary, I can see riscv_isa doesn't cover features which need static keys, and vice vesa. Could you please comment? Thanks in advance, Jisheng
On Sun, May 15, 2022 at 12:54 PM Jisheng Zhang <jszhang@kernel.org> wrote: > > On Wed, May 11, 2022 at 11:29:32PM -0700, Atish Patra wrote: > > On Mon, May 9, 2022 at 7:50 AM Jisheng Zhang <jszhang@kernel.org> wrote: > > > > > > On Mon, May 09, 2022 at 09:17:10AM +0530, Anup Patel wrote: > > > > On Sun, May 8, 2022 at 9:47 PM Jisheng Zhang <jszhang@kernel.org> wrote: > > > > > > > > > > Currently, riscv has several features why may not be supported on all > > > > > riscv platforms, for example, FPU, SV48 and so on. To support unified > > > > > kernel Image style, we need to check whether the feature is suportted > > > > > or not. If the check sits at hot code path, then performance will be > > > > > impacted a lot. static key can be used to solve the issue. In the past > > > > > FPU support has been converted to use static key mechanism. I believe > > > > > we will have similar cases in the future. > > > > > > > > It's not just FPU and Sv48. There are several others such as Svinval, > > > > Vector, Svnapot, Svpbmt, and many many others. > > > > > > > > Overall, I agree with the approach of using static key array but I > > > > disagree with the semantics and the duplicate stuff being added. > > > > > > > > Please see more comments below .. > > > > > > > > > > > > > > Similar as arm64 does(in fact, some code is borrowed from arm64), this > > > > > patch tries to add an unified mechanism to use static keys for all > > > > > the cpu features by implementing an array of default-false static keys > > > > > and enabling them when detected. The cpus_have_*_cap() check uses the > > > > > static keys if riscv_const_caps_ready is finalized, otherwise the > > > > > compiler generates the bitmap test. > > > > > > > > First of all, we should stop calling this a feature (like ARM does). Rather, > > > > we should call these as isa extensions ("isaext") to align with the RISC-V > > > > priv spec and RISC-V profiles spec. For all the ISA optionalities which do > > > > not have distinct extension name, the RISC-V profiles spec is assigning > > > > names to all such optionalities. > > > > > > Same as the reply a few minutes ago, the key problem here is do all > > > CPU features belong to *ISA* extensions? For example, SV48, SV57 etc. > > > I agree with Atish's comments here: > > > > > > "I think the cpu feature is a superset of the ISA extension. > > > cpu feature != ISA extension" > > > > > > > It seems to be accurate at that point in time. However, the latest > > profile spec seems to > > define everything as an extension including sv48. > > > > https://github.com/riscv/riscv-profiles/blob/main/profiles.adoc#623-rva22s64-supported-optional-extensions > > > > It may be a redundant effort and confusing to create two sets i.e. > > feature and extension in this case. > > But this specification is not frozen yet and may change in the future. > > We at least know that that is the current intention. > > > > Array of static keys is definitely useful and should be used for all > > well defined ISA extensions by the ratified priv spec. > > This will simplify this patch as well. For any feature/extensions > > (i.e. sv48/sv57) which was never defined as an extension > > in the priv spec but profile seems to define it now, I would leave it > > alone for the time being. Converting the existing code > > to static key probably has value but please do not include it in the > > static key array setup. > > > > Once the profile spec is frozen, we can decide which direction the > > Linux kernel should go. > > > > Hi Atish, Anup, > > I see your points and thanks for the information of the profile > spec. Now, I have other two points about isa VS features: > > 1. Not all isa extenstions need static key mechanism, so if we > make a static key array with 1:1 riscv_isa <-> static key relationship > there may be waste. > > For example, the 'a', 'c', 'i', 'm' and so on don't have static > key usage. Not all isa extensions but a large number of them will need a static key. It's better to always have one static key per ISA extension defined in cpufeatures.c For example, F, D, V, Sstc, Svinval, Ssofpmt, Zb*, AIA, etc. > > 2.We may need riscv architecture static keys for non-isa, this is > usually related with the linux os itself, for example > a static key for "unmap kernelspace at userspace". > static keys for "spectre CVE mitigations" > etc. These things look more like errata or workarounds so better to use that framework instead of ISA extensions (or features). Some of these things might even use ALTERNATIVEs instead of static keys. > > In summary, I can see riscv_isa doesn't cover features which need static > keys, and vice vesa. > > Could you please comment? > > Thanks in advance, > Jisheng Regards, Anup
On Sun, May 15, 2022 at 08:19:37PM +0530, Anup Patel wrote: > On Sun, May 15, 2022 at 12:54 PM Jisheng Zhang <jszhang@kernel.org> wrote: > > > > On Wed, May 11, 2022 at 11:29:32PM -0700, Atish Patra wrote: > > > On Mon, May 9, 2022 at 7:50 AM Jisheng Zhang <jszhang@kernel.org> wrote: > > > > > > > > On Mon, May 09, 2022 at 09:17:10AM +0530, Anup Patel wrote: > > > > > On Sun, May 8, 2022 at 9:47 PM Jisheng Zhang <jszhang@kernel.org> wrote: > > > > > > > > > > > > Currently, riscv has several features why may not be supported on all > > > > > > riscv platforms, for example, FPU, SV48 and so on. To support unified > > > > > > kernel Image style, we need to check whether the feature is suportted > > > > > > or not. If the check sits at hot code path, then performance will be > > > > > > impacted a lot. static key can be used to solve the issue. In the past > > > > > > FPU support has been converted to use static key mechanism. I believe > > > > > > we will have similar cases in the future. > > > > > > > > > > It's not just FPU and Sv48. There are several others such as Svinval, > > > > > Vector, Svnapot, Svpbmt, and many many others. > > > > > > > > > > Overall, I agree with the approach of using static key array but I > > > > > disagree with the semantics and the duplicate stuff being added. > > > > > > > > > > Please see more comments below .. > > > > > > > > > > > > > > > > > Similar as arm64 does(in fact, some code is borrowed from arm64), this > > > > > > patch tries to add an unified mechanism to use static keys for all > > > > > > the cpu features by implementing an array of default-false static keys > > > > > > and enabling them when detected. The cpus_have_*_cap() check uses the > > > > > > static keys if riscv_const_caps_ready is finalized, otherwise the > > > > > > compiler generates the bitmap test. > > > > > > > > > > First of all, we should stop calling this a feature (like ARM does). Rather, > > > > > we should call these as isa extensions ("isaext") to align with the RISC-V > > > > > priv spec and RISC-V profiles spec. For all the ISA optionalities which do > > > > > not have distinct extension name, the RISC-V profiles spec is assigning > > > > > names to all such optionalities. > > > > > > > > Same as the reply a few minutes ago, the key problem here is do all > > > > CPU features belong to *ISA* extensions? For example, SV48, SV57 etc. > > > > I agree with Atish's comments here: > > > > > > > > "I think the cpu feature is a superset of the ISA extension. > > > > cpu feature != ISA extension" > > > > > > > > > > It seems to be accurate at that point in time. However, the latest > > > profile spec seems to > > > define everything as an extension including sv48. > > > > > > https://github.com/riscv/riscv-profiles/blob/main/profiles.adoc#623-rva22s64-supported-optional-extensions > > > > > > It may be a redundant effort and confusing to create two sets i.e. > > > feature and extension in this case. > > > But this specification is not frozen yet and may change in the future. > > > We at least know that that is the current intention. > > > > > > Array of static keys is definitely useful and should be used for all > > > well defined ISA extensions by the ratified priv spec. > > > This will simplify this patch as well. For any feature/extensions > > > (i.e. sv48/sv57) which was never defined as an extension > > > in the priv spec but profile seems to define it now, I would leave it > > > alone for the time being. Converting the existing code > > > to static key probably has value but please do not include it in the > > > static key array setup. > > > > > > Once the profile spec is frozen, we can decide which direction the > > > Linux kernel should go. > > > > > > > Hi Atish, Anup, > > > > I see your points and thanks for the information of the profile > > spec. Now, I have other two points about isa VS features: > > > > 1. Not all isa extenstions need static key mechanism, so if we > > make a static key array with 1:1 riscv_isa <-> static key relationship > > there may be waste. > > > > For example, the 'a', 'c', 'i', 'm' and so on don't have static > > key usage. > > Not all isa extensions but a large number of them will need a static > key. It's better to always have one static key per ISA extension > defined in cpufeatures.c Currently, RISCV_ISA_EXT_MAX equals to 64 while the base ID is 26. In those 26 base IDs, only F/D and V need static key, it means we waste at least 24 static keys. > > For example, F, D, V, Sstc, Svinval, Ssofpmt, Zb*, AIA, etc. > > > > > 2.We may need riscv architecture static keys for non-isa, this is > > usually related with the linux os itself, for example > > a static key for "unmap kernelspace at userspace". > > static keys for "spectre CVE mitigations" > > etc. > > These things look more like errata or workarounds so better > to use that framework instead of ISA extensions (or features). Currently, the errata workarounds are implemented with ALTERNATIVEs but I believe sometime we may need static key to implement the workarounds. However this can be checked later. Now I worried about the static key waste above. Thanks
On Mon, May 16, 2022 at 11:02 PM Jisheng Zhang <jszhang@kernel.org> wrote: > > On Sun, May 15, 2022 at 08:19:37PM +0530, Anup Patel wrote: > > On Sun, May 15, 2022 at 12:54 PM Jisheng Zhang <jszhang@kernel.org> wrote: > > > > > > On Wed, May 11, 2022 at 11:29:32PM -0700, Atish Patra wrote: > > > > On Mon, May 9, 2022 at 7:50 AM Jisheng Zhang <jszhang@kernel.org> wrote: > > > > > > > > > > On Mon, May 09, 2022 at 09:17:10AM +0530, Anup Patel wrote: > > > > > > On Sun, May 8, 2022 at 9:47 PM Jisheng Zhang <jszhang@kernel.org> wrote: > > > > > > > > > > > > > > Currently, riscv has several features why may not be supported on all > > > > > > > riscv platforms, for example, FPU, SV48 and so on. To support unified > > > > > > > kernel Image style, we need to check whether the feature is suportted > > > > > > > or not. If the check sits at hot code path, then performance will be > > > > > > > impacted a lot. static key can be used to solve the issue. In the past > > > > > > > FPU support has been converted to use static key mechanism. I believe > > > > > > > we will have similar cases in the future. > > > > > > > > > > > > It's not just FPU and Sv48. There are several others such as Svinval, > > > > > > Vector, Svnapot, Svpbmt, and many many others. > > > > > > > > > > > > Overall, I agree with the approach of using static key array but I > > > > > > disagree with the semantics and the duplicate stuff being added. > > > > > > > > > > > > Please see more comments below .. > > > > > > > > > > > > > > > > > > > > Similar as arm64 does(in fact, some code is borrowed from arm64), this > > > > > > > patch tries to add an unified mechanism to use static keys for all > > > > > > > the cpu features by implementing an array of default-false static keys > > > > > > > and enabling them when detected. The cpus_have_*_cap() check uses the > > > > > > > static keys if riscv_const_caps_ready is finalized, otherwise the > > > > > > > compiler generates the bitmap test. > > > > > > > > > > > > First of all, we should stop calling this a feature (like ARM does). Rather, > > > > > > we should call these as isa extensions ("isaext") to align with the RISC-V > > > > > > priv spec and RISC-V profiles spec. For all the ISA optionalities which do > > > > > > not have distinct extension name, the RISC-V profiles spec is assigning > > > > > > names to all such optionalities. > > > > > > > > > > Same as the reply a few minutes ago, the key problem here is do all > > > > > CPU features belong to *ISA* extensions? For example, SV48, SV57 etc. > > > > > I agree with Atish's comments here: > > > > > > > > > > "I think the cpu feature is a superset of the ISA extension. > > > > > cpu feature != ISA extension" > > > > > > > > > > > > > It seems to be accurate at that point in time. However, the latest > > > > profile spec seems to > > > > define everything as an extension including sv48. > > > > > > > > https://github.com/riscv/riscv-profiles/blob/main/profiles.adoc#623-rva22s64-supported-optional-extensions > > > > > > > > It may be a redundant effort and confusing to create two sets i.e. > > > > feature and extension in this case. > > > > But this specification is not frozen yet and may change in the future. > > > > We at least know that that is the current intention. > > > > > > > > Array of static keys is definitely useful and should be used for all > > > > well defined ISA extensions by the ratified priv spec. > > > > This will simplify this patch as well. For any feature/extensions > > > > (i.e. sv48/sv57) which was never defined as an extension > > > > in the priv spec but profile seems to define it now, I would leave it > > > > alone for the time being. Converting the existing code > > > > to static key probably has value but please do not include it in the > > > > static key array setup. > > > > > > > > Once the profile spec is frozen, we can decide which direction the > > > > Linux kernel should go. > > > > > > > > > > Hi Atish, Anup, > > > > > > I see your points and thanks for the information of the profile > > > spec. Now, I have other two points about isa VS features: > > > > > > 1. Not all isa extenstions need static key mechanism, so if we > > > make a static key array with 1:1 riscv_isa <-> static key relationship > > > there may be waste. > > > > > > For example, the 'a', 'c', 'i', 'm' and so on don't have static > > > key usage. > > > > Not all isa extensions but a large number of them will need a static > > key. It's better to always have one static key per ISA extension > > defined in cpufeatures.c > > Currently, RISCV_ISA_EXT_MAX equals to 64 while the base ID is 26. > In those 26 base IDs, only F/D and V need static key, it means > we waste at least 24 static keys. If you want to save space of unused static keys then there are other ways. For example, you can create a small static key array which has many-to-one relation with the ISA extension numbers. For ISA extension which are always ON or always OFF, we can use fixed FALSE and TRUE keys. Something like below. enum riscv_isa_ext_key { RISCV_ISA_EXT_KEY_FALSE = 0, RISCV_ISA_EXT_KEY_TRUE, RISCV_ISA_EXT_KEY_FLOAD, /* For 'F' and 'D' */ RISCV_ISA_EXT_KEY_VECTOR, /* For all vector extensions */ RISCV_ISA_EXT_KEY_SVINVAL, RISCV_ISA_EXT_KEY_SSCOFPMT, RISCV_ISA_EXT_KEY_MAX, }; extern unsigned char __riscv_isa_ext_id2key[RISCV_ISA_EXT_ID_MAX]; extern struct static_key_false __riscv_isa_ext_keys[RISCV_ISA_EXT_KEY_MAX]; static __always_inline bool __riscv_isa_extension_keycheck(unsigned int ext) { if (RISCV_ISA_EXT_ID_MAX <= ext) return false; return static_branch_unlikely(&__riscv_isa_ext_keys[__riscv_isa_ext_id2key[ext]]); } #define riscv_isa_extension_keycheck(ext) \ __riscv_isa_extension_keycheck(RISCV_ISA_EXT_##ext) > > > > > For example, F, D, V, Sstc, Svinval, Ssofpmt, Zb*, AIA, etc. > > > > > > > > 2.We may need riscv architecture static keys for non-isa, this is > > > usually related with the linux os itself, for example > > > a static key for "unmap kernelspace at userspace". > > > static keys for "spectre CVE mitigations" > > > etc. > > > > These things look more like errata or workarounds so better > > to use that framework instead of ISA extensions (or features). > > Currently, the errata workarounds are implemented with ALTERNATIVEs > but I believe sometime we may need static key to implement the > workarounds. However this can be checked later. Now I worried about > the static key waste above. That's a separate topic and for now what we need is a simple and extensible approach to have static keys for ISA extensions. Regards, Anup > > Thanks
On Tue, May 17, 2022 at 09:31:50AM +0530, Anup Patel wrote: > On Mon, May 16, 2022 at 11:02 PM Jisheng Zhang <jszhang@kernel.org> wrote: > > ... > > Currently, RISCV_ISA_EXT_MAX equals to 64 while the base ID is 26. > > In those 26 base IDs, only F/D and V need static key, it means > > we waste at least 24 static keys. > > If you want to save space of unused static keys then there are other > ways. > > For example, you can create a small static key array which has > many-to-one relation with the ISA extension numbers. For ISA extension "any problem in computer science can be solved with another layer of indirection" ;) I see your points, thanks very much! But I think the array should be a static inline function to make use of compiler optimization to avoid the array references for performance. And the static key check maybe used in modules, I want to export less vars. I'm cooking the patches, will send out for review soon. > which are always ON or always OFF, we can use fixed FALSE and > TRUE keys. Something like below. > > enum riscv_isa_ext_key { > RISCV_ISA_EXT_KEY_FALSE = 0, > RISCV_ISA_EXT_KEY_TRUE, > RISCV_ISA_EXT_KEY_FLOAD, /* For 'F' and 'D' */ > RISCV_ISA_EXT_KEY_VECTOR, /* For all vector extensions */ > RISCV_ISA_EXT_KEY_SVINVAL, > RISCV_ISA_EXT_KEY_SSCOFPMT, > RISCV_ISA_EXT_KEY_MAX, > }; > > extern unsigned char __riscv_isa_ext_id2key[RISCV_ISA_EXT_ID_MAX]; > extern struct static_key_false __riscv_isa_ext_keys[RISCV_ISA_EXT_KEY_MAX]; > > static __always_inline bool __riscv_isa_extension_keycheck(unsigned int ext) > { > if (RISCV_ISA_EXT_ID_MAX <= ext) > return false; > return static_branch_unlikely(&__riscv_isa_ext_keys[__riscv_isa_ext_id2key[ext]]); > } > #define riscv_isa_extension_keycheck(ext) \ > __riscv_isa_extension_keycheck(RISCV_ISA_EXT_##ext) >
diff --git a/arch/riscv/Makefile b/arch/riscv/Makefile index 7d81102cffd4..f4df67369d84 100644 --- a/arch/riscv/Makefile +++ b/arch/riscv/Makefile @@ -154,3 +154,6 @@ PHONY += rv64_randconfig rv64_randconfig: $(Q)$(MAKE) KCONFIG_ALLCONFIG=$(srctree)/arch/riscv/configs/64-bit.config \ -f $(srctree)/Makefile randconfig + +archprepare: + $(Q)$(MAKE) $(build)=arch/riscv/tools kapi diff --git a/arch/riscv/include/asm/cpufeature.h b/arch/riscv/include/asm/cpufeature.h new file mode 100644 index 000000000000..d80ddd2f3b49 --- /dev/null +++ b/arch/riscv/include/asm/cpufeature.h @@ -0,0 +1,94 @@ +/* SPDX-License-Identifier: GPL-2.0-only */ +/* + * Copyright (C) 2014 Linaro Ltd. <ard.biesheuvel@linaro.org> + * Copyright (C) 2022 Jisheng Zhang <jszhang@kernel.org> + */ + +#ifndef __ASM_CPUFEATURE_H +#define __ASM_CPUFEATURE_H + +#include <asm/cpucaps.h> + +#include <linux/bug.h> +#include <linux/jump_label.h> +#include <linux/kernel.h> + +extern DECLARE_BITMAP(cpu_hwcaps, RISCV_NCAPS); +extern struct static_key_false cpu_hwcap_keys[RISCV_NCAPS]; +extern struct static_key_false riscv_const_caps_ready; + +static __always_inline bool system_capabilities_finalized(void) +{ + return static_branch_likely(&riscv_const_caps_ready); +} + +/* + * Test for a capability with a runtime check. + * + * Before the capability is detected, this returns false. + */ +static inline bool cpus_have_cap(unsigned int num) +{ + if (num >= RISCV_NCAPS) + return false; + return test_bit(num, cpu_hwcaps); +} + +/* + * Test for a capability without a runtime check. + * + * Before capabilities are finalized, this returns false. + * After capabilities are finalized, this is patched to avoid a runtime check. + * + * @num must be a compile-time constant. + */ +static __always_inline bool __cpus_have_const_cap(int num) +{ + if (num >= RISCV_NCAPS) + return false; + return static_branch_unlikely(&cpu_hwcap_keys[num]); +} + +/* + * Test for a capability without a runtime check. + * + * Before capabilities are finalized, this will BUG(). + * After capabilities are finalized, this is patched to avoid a runtime check. + * + * @num must be a compile-time constant. + */ +static __always_inline bool cpus_have_final_cap(int num) +{ + if (system_capabilities_finalized()) + return __cpus_have_const_cap(num); + else + BUG(); +} + +/* + * Test for a capability, possibly with a runtime check. + * + * Before capabilities are finalized, this behaves as cpus_have_cap(). + * After capabilities are finalized, this is patched to avoid a runtime check. + * + * @num must be a compile-time constant. + */ +static __always_inline bool cpus_have_const_cap(int num) +{ + if (system_capabilities_finalized()) + return __cpus_have_const_cap(num); + else + return cpus_have_cap(num); +} + +static inline void cpus_set_cap(unsigned int num) +{ + if (num >= RISCV_NCAPS) { + pr_warn("Attempt to set an illegal CPU capability (%d >= %d)\n", + num, RISCV_NCAPS); + } else { + __set_bit(num, cpu_hwcaps); + } +} + +#endif diff --git a/arch/riscv/kernel/cpufeature.c b/arch/riscv/kernel/cpufeature.c index 1b2d42d7f589..e6c72cad0c1c 100644 --- a/arch/riscv/kernel/cpufeature.c +++ b/arch/riscv/kernel/cpufeature.c @@ -9,6 +9,7 @@ #include <linux/bitmap.h> #include <linux/ctype.h> #include <linux/of.h> +#include <asm/cpufeature.h> #include <asm/processor.h> #include <asm/hwcap.h> #include <asm/smp.h> @@ -25,6 +26,15 @@ static DECLARE_BITMAP(riscv_isa, RISCV_ISA_EXT_MAX) __read_mostly; __ro_after_init DEFINE_STATIC_KEY_FALSE(cpu_hwcap_fpu); #endif +DECLARE_BITMAP(cpu_hwcaps, RISCV_NCAPS); +EXPORT_SYMBOL(cpu_hwcaps); + +DEFINE_STATIC_KEY_ARRAY_FALSE(cpu_hwcap_keys, RISCV_NCAPS); +EXPORT_SYMBOL(cpu_hwcap_keys); + +DEFINE_STATIC_KEY_FALSE(riscv_const_caps_ready); +EXPORT_SYMBOL(riscv_const_caps_ready); + /** * riscv_isa_extension_base() - Get base extension word * @@ -62,6 +72,17 @@ bool __riscv_isa_extension_available(const unsigned long *isa_bitmap, int bit) } EXPORT_SYMBOL_GPL(__riscv_isa_extension_available); +static void __init enable_cpu_capabilities(void) +{ + int i; + + for (i = 0; i < RISCV_NCAPS; i++) { + if (!cpus_have_cap(i)) + continue; + static_branch_enable(&cpu_hwcap_keys[i]); + } +} + void __init riscv_fill_hwcap(void) { struct device_node *node; @@ -236,4 +257,6 @@ void __init riscv_fill_hwcap(void) if (elf_hwcap & (COMPAT_HWCAP_ISA_F | COMPAT_HWCAP_ISA_D)) static_branch_enable(&cpu_hwcap_fpu); #endif + enable_cpu_capabilities(); + static_branch_enable(&riscv_const_caps_ready); } diff --git a/arch/riscv/tools/Makefile b/arch/riscv/tools/Makefile new file mode 100644 index 000000000000..932b4fe5c768 --- /dev/null +++ b/arch/riscv/tools/Makefile @@ -0,0 +1,22 @@ +# SPDX-License-Identifier: GPL-2.0 + +gen := arch/$(ARCH)/include/generated +kapi := $(gen)/asm + +kapi-hdrs-y := $(kapi)/cpucaps.h + +targets += $(addprefix ../../../,$(gen-y) $(kapi-hdrs-y)) + +PHONY += kapi + +kapi: $(kapi-hdrs-y) $(gen-y) + +# Create output directory if not already present +_dummy := $(shell [ -d '$(kapi)' ] || mkdir -p '$(kapi)') + +quiet_cmd_gen_cpucaps = GEN $@ + cmd_gen_cpucaps = mkdir -p $(dir $@) && \ + $(AWK) -f $(filter-out $(PHONY),$^) > $@ + +$(kapi)/cpucaps.h: $(src)/gen-cpucaps.awk $(src)/cpucaps FORCE + $(call if_changed,gen_cpucaps) diff --git a/arch/riscv/tools/cpucaps b/arch/riscv/tools/cpucaps new file mode 100644 index 000000000000..cb1ff2747859 --- /dev/null +++ b/arch/riscv/tools/cpucaps @@ -0,0 +1,5 @@ +# SPDX-License-Identifier: GPL-2.0 +# +# Internal CPU capabilities constants, keep this list sorted + +HAS_NO_FPU diff --git a/arch/riscv/tools/gen-cpucaps.awk b/arch/riscv/tools/gen-cpucaps.awk new file mode 100755 index 000000000000..52a1e1b064ad --- /dev/null +++ b/arch/riscv/tools/gen-cpucaps.awk @@ -0,0 +1,40 @@ +#!/bin/awk -f +# SPDX-License-Identifier: GPL-2.0 +# gen-cpucaps.awk: riscv cpucaps header generator +# +# Usage: awk -f gen-cpucaps.awk cpucaps.txt + +# Log an error and terminate +function fatal(msg) { + print "Error at line " NR ": " msg > "/dev/stderr" + exit 1 +} + +# skip blank lines and comment lines +/^$/ { next } +/^#/ { next } + +BEGIN { + print "#ifndef __ASM_CPUCAPS_H" + print "#define __ASM_CPUCAPS_H" + print "" + print "/* Generated file - do not edit */" + cap_num = 0 + print "" +} + +/^[vA-Z0-9_]+$/ { + printf("#define RISCV_%-30s\t%d\n", $0, cap_num++) + next +} + +END { + printf("#define RISCV_NCAPS\t\t\t\t%d\n", cap_num) + print "" + print "#endif /* __ASM_CPUCAPS_H */" +} + +# Any lines not handled by previous rules are unexpected +{ + fatal("unhandled statement") +}
Currently, riscv has several features why may not be supported on all riscv platforms, for example, FPU, SV48 and so on. To support unified kernel Image style, we need to check whether the feature is suportted or not. If the check sits at hot code path, then performance will be impacted a lot. static key can be used to solve the issue. In the past FPU support has been converted to use static key mechanism. I believe we will have similar cases in the future. Similar as arm64 does(in fact, some code is borrowed from arm64), this patch tries to add an unified mechanism to use static keys for all the cpu features by implementing an array of default-false static keys and enabling them when detected. The cpus_have_*_cap() check uses the static keys if riscv_const_caps_ready is finalized, otherwise the compiler generates the bitmap test. Signed-off-by: Jisheng Zhang <jszhang@kernel.org> --- arch/riscv/Makefile | 3 + arch/riscv/include/asm/cpufeature.h | 94 +++++++++++++++++++++++++++++ arch/riscv/kernel/cpufeature.c | 23 +++++++ arch/riscv/tools/Makefile | 22 +++++++ arch/riscv/tools/cpucaps | 5 ++ arch/riscv/tools/gen-cpucaps.awk | 40 ++++++++++++ 6 files changed, 187 insertions(+) create mode 100644 arch/riscv/include/asm/cpufeature.h create mode 100644 arch/riscv/tools/Makefile create mode 100644 arch/riscv/tools/cpucaps create mode 100755 arch/riscv/tools/gen-cpucaps.awk