Message ID | 1473373615-51427-2-git-send-email-srinivas.pandruvada@linux.intel.com (mailing list archive) |
---|---|
State | Changes Requested, archived |
Headers | show |
On Thu, 8 Sep 2016, Srinivas Pandruvada wrote: > From: Tim Chen <tim.c.chen@linux.intel.com> > > We uses ASYM_PACKING feature in the scheduler to move tasks to more > capable cpus that can be boosted to higher frequency. This is enabled by > Intel Turbo Boost Max Technology 3.0 (ITMT). We mark the sched domain > topology level with SD_ASYM_PACKING flag for such systems to indicate > scheduler can use the ASYM_PACKING feature to move load to the > more capable cpus. Sigh. This changelog does not tell anything about the nature of the patch, the rationale for it etc. It's just a meaningless blurb. > +unsigned int __read_mostly sysctl_sched_itmt_enabled; > + > static inline void smpboot_setup_warm_reset_vector(unsigned long start_eip) > { > unsigned long flags; > @@ -471,31 +473,57 @@ static bool match_die(struct cpuinfo_x86 *c, struct cpuinfo_x86 *o) > return false; > } > > -static struct sched_domain_topology_level numa_inside_package_topology[] = { > +#ifdef CONFIG_SCHED_ITMT > +static int x86_core_flags(void) > +{ > + int flags = cpu_core_flags(); > + > + if (sysctl_sched_itmt_enabled) > + flags |= SD_ASYM_PACKING; > + > + return flags; > +} > + > +static int x86_smt_flags(void) > +{ > + int flags = cpu_smt_flags(); > + > + if (sysctl_sched_itmt_enabled) > + flags |= SD_ASYM_PACKING; > + > + return flags; > +} > +#else > +#define x86_core_flags cpu_core_flags > +#define x86_smt_flags cpu_smt_flags > +#endif No. We first rework the code so that the IMT stuff can be added in a later patch easily. Thanks, tglx -- To unsubscribe from this list: send the line "unsubscribe linux-acpi" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Thu, 8 Sep 2016, Srinivas Pandruvada wrote: > > void set_sched_topology(struct sched_domain_topology_level *tl) > { > + if (WARN_ON_ONCE(sched_smp_initialized)) > + return; > + > sched_domain_topology = tl; > } This change has nothing to do with $subject. Seperate patch please. Thanks, tglx -- To unsubscribe from this list: send the line "unsubscribe linux-acpi" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Sat, 2016-09-10 at 15:10 +0200, Thomas Gleixner wrote: > On Thu, 8 Sep 2016, Srinivas Pandruvada wrote: > > > > From: Tim Chen <tim.c.chen@linux.intel.com> > > > > We uses ASYM_PACKING feature in the scheduler to move tasks to more > > capable cpus that can be boosted to higher frequency. This is enabled by > > Intel Turbo Boost Max Technology 3.0 (ITMT). We mark the sched domain > > topology level with SD_ASYM_PACKING flag for such systems to indicate > > scheduler can use the ASYM_PACKING feature to move load to the > > more capable cpus. > Sigh. This changelog does not tell anything about the nature of the patch, > the rationale for it etc. It's just a meaningless blurb. Okay, I can add more details about ITMT. Will also be clearer if the ITMT patch (patch 5) comes before this one. > > +} > > +#else > > +#define x86_core_flags cpu_core_flags > > +#define x86_smt_flags cpu_smt_flags > > +#endif > No. We first rework the code so that the IMT stuff can be added in a later > patch easily. I'll move this patch to come after current patch 5. Thanks. Tim -- To unsubscribe from this list: send the line "unsubscribe linux-acpi" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Mon, Sep 12, 2016 at 01:50:15PM -0700, Tim Chen wrote: > > No. We first rework the code so that the IMT stuff can be added in a later > > patch easily. > > I'll move this patch to come after current patch 5. Split the patch, one doing the topo cleanup, one adding the x86_*_flags(). -- To unsubscribe from this list: send the line "unsubscribe linux-acpi" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Tue, Sep 13, 2016 at 09:26:13AM +0200, Peter Zijlstra wrote: > On Mon, Sep 12, 2016 at 01:50:15PM -0700, Tim Chen wrote: > > > No. We first rework the code so that the IMT stuff can be added in a later > > > patch easily. > > > > I'll move this patch to come after current patch 5. > > Split the patch, one doing the topo cleanup, one adding the > x86_*_flags(). hi, could you please CC me on the next version, I'm interested in that cleanup patch.. thanks a lot, jirka -- To unsubscribe from this list: send the line "unsubscribe linux-acpi" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Mon, 2016-09-19 at 11:11 +0200, Jiri Olsa wrote: > On Tue, Sep 13, 2016 at 09:26:13AM +0200, Peter Zijlstra wrote: > > > > On Mon, Sep 12, 2016 at 01:50:15PM -0700, Tim Chen wrote: > > > > > > > > > > > No. We first rework the code so that the IMT stuff can be added in a later > > > > patch easily. > > > I'll move this patch to come after current patch 5. > > Split the patch, one doing the topo cleanup, one adding the > > x86_*_flags(). > hi, > could you please CC me on the next version, > I'm interested in that cleanup patch.. Sure. Will do. Tim -- To unsubscribe from this list: send the line "unsubscribe linux-acpi" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
diff --git a/arch/x86/include/asm/topology.h b/arch/x86/include/asm/topology.h index cf75871..98b669d 100644 --- a/arch/x86/include/asm/topology.h +++ b/arch/x86/include/asm/topology.h @@ -146,4 +146,8 @@ struct pci_bus; int x86_pci_root_bus_node(int bus); void x86_pci_root_bus_resources(int bus, struct list_head *resources); +#ifdef CONFIG_SCHED_ITMT +extern unsigned int __read_mostly sysctl_sched_itmt_enabled; +#endif /* CONFIG_SCHED_ITMT */ + #endif /* _ASM_X86_TOPOLOGY_H */ diff --git a/arch/x86/kernel/smpboot.c b/arch/x86/kernel/smpboot.c index 4296beb..3782bd4 100644 --- a/arch/x86/kernel/smpboot.c +++ b/arch/x86/kernel/smpboot.c @@ -109,6 +109,8 @@ static bool logical_packages_frozen __read_mostly; /* Maximum number of SMT threads on any online core */ int __max_smt_threads __read_mostly; +unsigned int __read_mostly sysctl_sched_itmt_enabled; + static inline void smpboot_setup_warm_reset_vector(unsigned long start_eip) { unsigned long flags; @@ -471,31 +473,57 @@ static bool match_die(struct cpuinfo_x86 *c, struct cpuinfo_x86 *o) return false; } -static struct sched_domain_topology_level numa_inside_package_topology[] = { +#ifdef CONFIG_SCHED_ITMT +static int x86_core_flags(void) +{ + int flags = cpu_core_flags(); + + if (sysctl_sched_itmt_enabled) + flags |= SD_ASYM_PACKING; + + return flags; +} + +static int x86_smt_flags(void) +{ + int flags = cpu_smt_flags(); + + if (sysctl_sched_itmt_enabled) + flags |= SD_ASYM_PACKING; + + return flags; +} +#else +#define x86_core_flags cpu_core_flags +#define x86_smt_flags cpu_smt_flags +#endif + +static struct sched_domain_topology_level x86_topology[] = { +#ifdef CONFIG_SCHED_SMT + { cpu_smt_mask, x86_smt_flags, SD_INIT_NAME(SMT) }, +#endif +#ifdef CONFIG_SCHED_MC + { cpu_coregroup_mask, x86_core_flags, SD_INIT_NAME(MC) }, +#endif + { cpu_cpu_mask, SD_INIT_NAME(DIE) }, + { NULL, }, +}; + +static struct sched_domain_topology_level x86_numa_in_package_topology[] = { #ifdef CONFIG_SCHED_SMT - { cpu_smt_mask, cpu_smt_flags, SD_INIT_NAME(SMT) }, + { cpu_smt_mask, x86_smt_flags, SD_INIT_NAME(SMT) }, #endif #ifdef CONFIG_SCHED_MC - { cpu_coregroup_mask, cpu_core_flags, SD_INIT_NAME(MC) }, + { cpu_coregroup_mask, x86_core_flags, SD_INIT_NAME(MC) }, #endif { NULL, }, }; + /* - * set_sched_topology() sets the topology internal to a CPU. The - * NUMA topologies are layered on top of it to build the full - * system topology. - * - * If NUMA nodes are observed to occur within a CPU package, this - * function should be called. It forces the sched domain code to - * only use the SMT level for the CPU portion of the topology. - * This essentially falls back to relying on NUMA information - * from the SRAT table to describe the entire system topology - * (except for hyperthreads). + * Set if a package/die has multiple NUMA nodes inside. + * AMD Magny-Cours and Intel Cluster-on-Die have this. */ -static void primarily_use_numa_for_topology(void) -{ - set_sched_topology(numa_inside_package_topology); -} +static bool x86_has_numa_in_package; void set_cpu_sibling_map(int cpu) { @@ -558,7 +586,7 @@ void set_cpu_sibling_map(int cpu) c->booted_cores = cpu_data(i).booted_cores; } if (match_die(c, o) && !topology_same_node(c, o)) - primarily_use_numa_for_topology(); + x86_has_numa_in_package = true; } threads = cpumask_weight(topology_sibling_cpumask(cpu)); @@ -1304,6 +1332,16 @@ void __init native_smp_prepare_cpus(unsigned int max_cpus) zalloc_cpumask_var(&per_cpu(cpu_core_map, i), GFP_KERNEL); zalloc_cpumask_var(&per_cpu(cpu_llc_shared_map, i), GFP_KERNEL); } + + /* + * Set 'default' x86 topology, this matches default_topology() in that + * it has NUMA nodes as a topology level. See also + * native_smp_cpus_done(). + * + * Must be done before set_cpus_sibling_map() is ran. + */ + set_sched_topology(x86_topology); + set_cpu_sibling_map(0); switch (smp_sanity_check(max_cpus)) { @@ -1370,6 +1408,9 @@ void __init native_smp_cpus_done(unsigned int max_cpus) { pr_debug("Boot done\n"); + if (x86_has_numa_in_package) + set_sched_topology(x86_numa_in_package_topology); + nmi_selftest(); impress_friends(); setup_ioapic_dest(); diff --git a/kernel/sched/core.c b/kernel/sched/core.c index 2a906f2..e86c4a5 100644 --- a/kernel/sched/core.c +++ b/kernel/sched/core.c @@ -6487,6 +6487,9 @@ static struct sched_domain_topology_level *sched_domain_topology = void set_sched_topology(struct sched_domain_topology_level *tl) { + if (WARN_ON_ONCE(sched_smp_initialized)) + return; + sched_domain_topology = tl; }