Message ID | 20230912021615.2086698-4-matthew.brost@intel.com (mailing list archive) |
---|---|
State | New, archived |
Headers | show |
Series | DRM scheduler changes for Xe | expand |
On Mon, 11 Sep 2023 19:16:05 -0700 Matthew Brost <matthew.brost@intel.com> wrote: > Rather than a global modparam for scheduling policy, move the scheduling > policy to scheduler / entity so user can control each scheduler / entity > policy. I'm a bit confused by the commit message (I think I'm okay with the diff though). Sounds like entity is involved in the sched policy choice, but AFAICT, it just has to live with the scheduler policy chosen by the driver at init time. If my understanding is correct, I'd just drop the ' / entity'.
Hi Matthew, kernel test robot noticed the following build errors: [auto build test ERROR on drm/drm-next] [also build test ERROR on drm-exynos/exynos-drm-next drm-intel/for-linux-next-fixes drm-tip/drm-tip linus/master v6.6-rc1 next-20230912] [cannot apply to drm-misc/drm-misc-next drm-intel/for-linux-next] [If your patch is applied to the wrong git tree, kindly drop us a note. And when submitting patch, we suggest to use '--base' as documented in https://git-scm.com/docs/git-format-patch#_base_tree_information] url: https://github.com/intel-lab-lkp/linux/commits/Matthew-Brost/drm-sched-Add-drm_sched_submit_-helpers/20230912-102001 base: git://anongit.freedesktop.org/drm/drm drm-next patch link: https://lore.kernel.org/r/20230912021615.2086698-4-matthew.brost%40intel.com patch subject: [PATCH v3 03/13] drm/sched: Move schedule policy to scheduler / entity config: arm64-randconfig-r032-20230912 (https://download.01.org/0day-ci/archive/20230912/202309122100.HAEi8ytJ-lkp@intel.com/config) compiler: clang version 16.0.4 (https://github.com/llvm/llvm-project.git ae42196bc493ffe877a7e3dff8be32035dea4d07) reproduce (this is a W=1 build): (https://download.01.org/0day-ci/archive/20230912/202309122100.HAEi8ytJ-lkp@intel.com/reproduce) If you fix the issue in a separate patch/commit (i.e. not just a new version of the same patch/commit), kindly add following tags | Reported-by: kernel test robot <lkp@intel.com> | Closes: https://lore.kernel.org/oe-kbuild-all/202309122100.HAEi8ytJ-lkp@intel.com/ All errors (new ones prefixed by >>): >> drivers/gpu/drm/v3d/v3d_sched.c:403:9: error: use of undeclared identifier 'ULL' ULL, "v3d_render", DRM_SCHED_POLICY_DEFAULT, ^ 1 error generated. vim +/ULL +403 drivers/gpu/drm/v3d/v3d_sched.c 381 382 int 383 v3d_sched_init(struct v3d_dev *v3d) 384 { 385 int hw_jobs_limit = 1; 386 int job_hang_limit = 0; 387 int hang_limit_ms = 500; 388 int ret; 389 390 ret = drm_sched_init(&v3d->queue[V3D_BIN].sched, 391 &v3d_bin_sched_ops, NULL, 392 hw_jobs_limit, job_hang_limit, 393 msecs_to_jiffies(hang_limit_ms), NULL, 394 NULL, "v3d_bin", DRM_SCHED_POLICY_DEFAULT, 395 v3d->drm.dev); 396 if (ret) 397 return ret; 398 399 ret = drm_sched_init(&v3d->queue[V3D_RENDER].sched, 400 &v3d_render_sched_ops, NULL, 401 hw_jobs_limit, job_hang_limit, 402 msecs_to_jiffies(hang_limit_ms), NULL, > 403 ULL, "v3d_render", DRM_SCHED_POLICY_DEFAULT, 404 v3d->drm.dev); 405 if (ret) 406 goto fail; 407 408 ret = drm_sched_init(&v3d->queue[V3D_TFU].sched, 409 &v3d_tfu_sched_ops, NULL, 410 hw_jobs_limit, job_hang_limit, 411 msecs_to_jiffies(hang_limit_ms), NULL, 412 NULL, "v3d_tfu", DRM_SCHED_POLICY_DEFAULT, 413 v3d->drm.dev); 414 if (ret) 415 goto fail; 416 417 if (v3d_has_csd(v3d)) { 418 ret = drm_sched_init(&v3d->queue[V3D_CSD].sched, 419 &v3d_csd_sched_ops, NULL, 420 hw_jobs_limit, job_hang_limit, 421 msecs_to_jiffies(hang_limit_ms), NULL, 422 NULL, "v3d_csd", DRM_SCHED_POLICY_DEFAULT, 423 v3d->drm.dev); 424 if (ret) 425 goto fail; 426 427 ret = drm_sched_init(&v3d->queue[V3D_CACHE_CLEAN].sched, 428 &v3d_cache_clean_sched_ops, NULL, 429 hw_jobs_limit, job_hang_limit, 430 msecs_to_jiffies(hang_limit_ms), NULL, 431 NULL, "v3d_cache_clean", 432 DRM_SCHED_POLICY_DEFAULT, v3d->drm.dev); 433 if (ret) 434 goto fail; 435 } 436 437 return 0; 438 439 fail: 440 v3d_sched_fini(v3d); 441 return ret; 442 } 443
On Tue, Sep 12, 2023 at 09:37:06AM +0200, Boris Brezillon wrote: > On Mon, 11 Sep 2023 19:16:05 -0700 > Matthew Brost <matthew.brost@intel.com> wrote: > > > Rather than a global modparam for scheduling policy, move the scheduling > > policy to scheduler / entity so user can control each scheduler / entity > > policy. > > I'm a bit confused by the commit message (I think I'm okay with the > diff though). Sounds like entity is involved in the sched policy > choice, but AFAICT, it just has to live with the scheduler policy chosen > by the driver at init time. If my understanding is correct, I'd just > drop the ' / entity'. Yep, stale commit message. Will fix in next rev. Matt
On Tue, Sep 12, 2023 at 10:11:56PM +0800, kernel test robot wrote: > Hi Matthew, > > kernel test robot noticed the following build errors: > > [auto build test ERROR on drm/drm-next] > [also build test ERROR on drm-exynos/exynos-drm-next drm-intel/for-linux-next-fixes drm-tip/drm-tip linus/master v6.6-rc1 next-20230912] > [cannot apply to drm-misc/drm-misc-next drm-intel/for-linux-next] > [If your patch is applied to the wrong git tree, kindly drop us a note. > And when submitting patch, we suggest to use '--base' as documented in > https://git-scm.com/docs/git-format-patch#_base_tree_information] > > url: https://github.com/intel-lab-lkp/linux/commits/Matthew-Brost/drm-sched-Add-drm_sched_submit_-helpers/20230912-102001 > base: git://anongit.freedesktop.org/drm/drm drm-next > patch link: https://lore.kernel.org/r/20230912021615.2086698-4-matthew.brost%40intel.com > patch subject: [PATCH v3 03/13] drm/sched: Move schedule policy to scheduler / entity > config: arm64-randconfig-r032-20230912 (https://download.01.org/0day-ci/archive/20230912/202309122100.HAEi8ytJ-lkp@intel.com/config) > compiler: clang version 16.0.4 (https://github.com/llvm/llvm-project.git ae42196bc493ffe877a7e3dff8be32035dea4d07) > reproduce (this is a W=1 build): (https://download.01.org/0day-ci/archive/20230912/202309122100.HAEi8ytJ-lkp@intel.com/reproduce) > > If you fix the issue in a separate patch/commit (i.e. not just a new version of > the same patch/commit), kindly add following tags > | Reported-by: kernel test robot <lkp@intel.com> > | Closes: https://lore.kernel.org/oe-kbuild-all/202309122100.HAEi8ytJ-lkp@intel.com/ > > All errors (new ones prefixed by >>): > > >> drivers/gpu/drm/v3d/v3d_sched.c:403:9: error: use of undeclared identifier 'ULL' > ULL, "v3d_render", DRM_SCHED_POLICY_DEFAULT, > ^ Typos, s/ULL/NULL in next rev. Matt > 1 error generated. > > > vim +/ULL +403 drivers/gpu/drm/v3d/v3d_sched.c > > 381 > 382 int > 383 v3d_sched_init(struct v3d_dev *v3d) > 384 { > 385 int hw_jobs_limit = 1; > 386 int job_hang_limit = 0; > 387 int hang_limit_ms = 500; > 388 int ret; > 389 > 390 ret = drm_sched_init(&v3d->queue[V3D_BIN].sched, > 391 &v3d_bin_sched_ops, NULL, > 392 hw_jobs_limit, job_hang_limit, > 393 msecs_to_jiffies(hang_limit_ms), NULL, > 394 NULL, "v3d_bin", DRM_SCHED_POLICY_DEFAULT, > 395 v3d->drm.dev); > 396 if (ret) > 397 return ret; > 398 > 399 ret = drm_sched_init(&v3d->queue[V3D_RENDER].sched, > 400 &v3d_render_sched_ops, NULL, > 401 hw_jobs_limit, job_hang_limit, > 402 msecs_to_jiffies(hang_limit_ms), NULL, > > 403 ULL, "v3d_render", DRM_SCHED_POLICY_DEFAULT, > 404 v3d->drm.dev); > 405 if (ret) > 406 goto fail; > 407 > 408 ret = drm_sched_init(&v3d->queue[V3D_TFU].sched, > 409 &v3d_tfu_sched_ops, NULL, > 410 hw_jobs_limit, job_hang_limit, > 411 msecs_to_jiffies(hang_limit_ms), NULL, > 412 NULL, "v3d_tfu", DRM_SCHED_POLICY_DEFAULT, > 413 v3d->drm.dev); > 414 if (ret) > 415 goto fail; > 416 > 417 if (v3d_has_csd(v3d)) { > 418 ret = drm_sched_init(&v3d->queue[V3D_CSD].sched, > 419 &v3d_csd_sched_ops, NULL, > 420 hw_jobs_limit, job_hang_limit, > 421 msecs_to_jiffies(hang_limit_ms), NULL, > 422 NULL, "v3d_csd", DRM_SCHED_POLICY_DEFAULT, > 423 v3d->drm.dev); > 424 if (ret) > 425 goto fail; > 426 > 427 ret = drm_sched_init(&v3d->queue[V3D_CACHE_CLEAN].sched, > 428 &v3d_cache_clean_sched_ops, NULL, > 429 hw_jobs_limit, job_hang_limit, > 430 msecs_to_jiffies(hang_limit_ms), NULL, > 431 NULL, "v3d_cache_clean", > 432 DRM_SCHED_POLICY_DEFAULT, v3d->drm.dev); > 433 if (ret) > 434 goto fail; > 435 } > 436 > 437 return 0; > 438 > 439 fail: > 440 v3d_sched_fini(v3d); > 441 return ret; > 442 } > 443 > > -- > 0-DAY CI Kernel Test Service > https://github.com/intel/lkp-tests/wiki
On 2023-09-11 22:16, Matthew Brost wrote: > Rather than a global modparam for scheduling policy, move the scheduling > policy to scheduler / entity so user can control each scheduler / entity > policy. > > v2: > - s/DRM_SCHED_POLICY_MAX/DRM_SCHED_POLICY_COUNT (Luben) > - Only include policy in scheduler (Luben) > > Signed-off-by: Matthew Brost <matthew.brost@intel.com> > --- > drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 1 + > drivers/gpu/drm/etnaviv/etnaviv_sched.c | 3 ++- > drivers/gpu/drm/lima/lima_sched.c | 3 ++- > drivers/gpu/drm/msm/msm_ringbuffer.c | 3 ++- > drivers/gpu/drm/nouveau/nouveau_sched.c | 3 ++- > drivers/gpu/drm/panfrost/panfrost_job.c | 3 ++- > drivers/gpu/drm/scheduler/sched_entity.c | 24 ++++++++++++++++++---- > drivers/gpu/drm/scheduler/sched_main.c | 23 +++++++++++++++------ > drivers/gpu/drm/v3d/v3d_sched.c | 15 +++++++++----- > include/drm/gpu_scheduler.h | 20 ++++++++++++------ > 10 files changed, 72 insertions(+), 26 deletions(-) > > diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c > index c83a76bccc1d..ecb00991dd51 100644 > --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c > +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c > @@ -2309,6 +2309,7 @@ static int amdgpu_device_init_schedulers(struct amdgpu_device *adev) > ring->num_hw_submission, 0, > timeout, adev->reset_domain->wq, > ring->sched_score, ring->name, > + DRM_SCHED_POLICY_DEFAULT, > adev->dev); > if (r) { > DRM_ERROR("Failed to create scheduler on ring %s.\n", > diff --git a/drivers/gpu/drm/etnaviv/etnaviv_sched.c b/drivers/gpu/drm/etnaviv/etnaviv_sched.c > index 618a804ddc34..3646f995ca94 100644 > --- a/drivers/gpu/drm/etnaviv/etnaviv_sched.c > +++ b/drivers/gpu/drm/etnaviv/etnaviv_sched.c > @@ -137,7 +137,8 @@ int etnaviv_sched_init(struct etnaviv_gpu *gpu) > ret = drm_sched_init(&gpu->sched, &etnaviv_sched_ops, NULL, > etnaviv_hw_jobs_limit, etnaviv_job_hang_limit, > msecs_to_jiffies(500), NULL, NULL, > - dev_name(gpu->dev), gpu->dev); > + dev_name(gpu->dev), DRM_SCHED_POLICY_DEFAULT, > + gpu->dev); > if (ret) > return ret; > > diff --git a/drivers/gpu/drm/lima/lima_sched.c b/drivers/gpu/drm/lima/lima_sched.c > index 8d858aed0e56..465d4bf3882b 100644 > --- a/drivers/gpu/drm/lima/lima_sched.c > +++ b/drivers/gpu/drm/lima/lima_sched.c > @@ -491,7 +491,8 @@ int lima_sched_pipe_init(struct lima_sched_pipe *pipe, const char *name) > return drm_sched_init(&pipe->base, &lima_sched_ops, NULL, 1, > lima_job_hang_limit, > msecs_to_jiffies(timeout), NULL, > - NULL, name, pipe->ldev->dev); > + NULL, name, DRM_SCHED_POLICY_DEFAULT, > + pipe->ldev->dev); > } > > void lima_sched_pipe_fini(struct lima_sched_pipe *pipe) > diff --git a/drivers/gpu/drm/msm/msm_ringbuffer.c b/drivers/gpu/drm/msm/msm_ringbuffer.c > index b8865e61b40f..f45e674a0aaf 100644 > --- a/drivers/gpu/drm/msm/msm_ringbuffer.c > +++ b/drivers/gpu/drm/msm/msm_ringbuffer.c > @@ -96,7 +96,8 @@ struct msm_ringbuffer *msm_ringbuffer_new(struct msm_gpu *gpu, int id, > > ret = drm_sched_init(&ring->sched, &msm_sched_ops, NULL, > num_hw_submissions, 0, sched_timeout, > - NULL, NULL, to_msm_bo(ring->bo)->name, gpu->dev->dev); > + NULL, NULL, to_msm_bo(ring->bo)->name, > + DRM_SCHED_POLICY_DEFAULT, gpu->dev->dev); > if (ret) { > goto fail; > } > diff --git a/drivers/gpu/drm/nouveau/nouveau_sched.c b/drivers/gpu/drm/nouveau/nouveau_sched.c > index d458c2227d4f..70e497e40c70 100644 > --- a/drivers/gpu/drm/nouveau/nouveau_sched.c > +++ b/drivers/gpu/drm/nouveau/nouveau_sched.c > @@ -431,7 +431,8 @@ int nouveau_sched_init(struct nouveau_drm *drm) > > return drm_sched_init(sched, &nouveau_sched_ops, NULL, > NOUVEAU_SCHED_HW_SUBMISSIONS, 0, job_hang_limit, > - NULL, NULL, "nouveau_sched", drm->dev->dev); > + NULL, NULL, "nouveau_sched", > + DRM_SCHED_POLICY_DEFAULT, drm->dev->dev); > } > > void nouveau_sched_fini(struct nouveau_drm *drm) > diff --git a/drivers/gpu/drm/panfrost/panfrost_job.c b/drivers/gpu/drm/panfrost/panfrost_job.c > index 326ca1ddf1d7..ad36bf3a4699 100644 > --- a/drivers/gpu/drm/panfrost/panfrost_job.c > +++ b/drivers/gpu/drm/panfrost/panfrost_job.c > @@ -835,7 +835,8 @@ int panfrost_job_init(struct panfrost_device *pfdev) > nentries, 0, > msecs_to_jiffies(JOB_TIMEOUT_MS), > pfdev->reset.wq, > - NULL, "pan_js", pfdev->dev); > + NULL, "pan_js", DRM_SCHED_POLICY_DEFAULT, > + pfdev->dev); > if (ret) { > dev_err(pfdev->dev, "Failed to create scheduler: %d.", ret); > goto err_sched; > diff --git a/drivers/gpu/drm/scheduler/sched_entity.c b/drivers/gpu/drm/scheduler/sched_entity.c > index a42763e1429d..65a972b52eda 100644 > --- a/drivers/gpu/drm/scheduler/sched_entity.c > +++ b/drivers/gpu/drm/scheduler/sched_entity.c > @@ -33,6 +33,20 @@ > #define to_drm_sched_job(sched_job) \ > container_of((sched_job), struct drm_sched_job, queue_node) > > +static bool bad_policies(struct drm_gpu_scheduler **sched_list, > + unsigned int num_sched_list) Rename the function to the status quo, drm_sched_policy_mismatch(... > +{ > + enum drm_sched_policy sched_policy = sched_list[0]->sched_policy; > + unsigned int i; > + > + /* All schedule policies must match */ > + for (i = 1; i < num_sched_list; ++i) > + if (sched_policy != sched_list[i]->sched_policy) > + return true; > + > + return false; > +} > + > /** > * drm_sched_entity_init - Init a context entity used by scheduler when > * submit to HW ring. > @@ -62,7 +76,8 @@ int drm_sched_entity_init(struct drm_sched_entity *entity, > unsigned int num_sched_list, > atomic_t *guilty) > { > - if (!(entity && sched_list && (num_sched_list == 0 || sched_list[0]))) > + if (!(entity && sched_list && (num_sched_list == 0 || sched_list[0])) || > + bad_policies(sched_list, num_sched_list)) > return -EINVAL; > > memset(entity, 0, sizeof(struct drm_sched_entity)); > @@ -486,7 +501,7 @@ struct drm_sched_job *drm_sched_entity_pop_job(struct drm_sched_entity *entity) > * Update the entity's location in the min heap according to > * the timestamp of the next job, if any. > */ > - if (drm_sched_policy == DRM_SCHED_POLICY_FIFO) { > + if (entity->rq->sched->sched_policy == DRM_SCHED_POLICY_FIFO) { > struct drm_sched_job *next; > > next = to_drm_sched_job(spsc_queue_peek(&entity->job_queue)); > @@ -558,7 +573,8 @@ void drm_sched_entity_select_rq(struct drm_sched_entity *entity) > void drm_sched_entity_push_job(struct drm_sched_job *sched_job) > { > struct drm_sched_entity *entity = sched_job->entity; > - bool first; > + bool first, fifo = entity->rq->sched->sched_policy == > + DRM_SCHED_POLICY_FIFO; > ktime_t submit_ts; > > trace_drm_sched_job(sched_job, entity); > @@ -587,7 +603,7 @@ void drm_sched_entity_push_job(struct drm_sched_job *sched_job) > drm_sched_rq_add_entity(entity->rq, entity); > spin_unlock(&entity->rq_lock); > > - if (drm_sched_policy == DRM_SCHED_POLICY_FIFO) > + if (fifo) > drm_sched_rq_update_fifo(entity, submit_ts); > > drm_sched_wakeup_if_can_queue(entity->rq->sched); > diff --git a/drivers/gpu/drm/scheduler/sched_main.c b/drivers/gpu/drm/scheduler/sched_main.c > index 614e8c97a622..545d5298c086 100644 > --- a/drivers/gpu/drm/scheduler/sched_main.c > +++ b/drivers/gpu/drm/scheduler/sched_main.c > @@ -66,14 +66,14 @@ > #define to_drm_sched_job(sched_job) \ > container_of((sched_job), struct drm_sched_job, queue_node) > > -int drm_sched_policy = DRM_SCHED_POLICY_FIFO; > +int default_drm_sched_policy = DRM_SCHED_POLICY_FIFO; > > /** > * DOC: sched_policy (int) > * Used to override default entities scheduling policy in a run queue. > */ > -MODULE_PARM_DESC(sched_policy, "Specify the scheduling policy for entities on a run-queue, " __stringify(DRM_SCHED_POLICY_RR) " = Round Robin, " __stringify(DRM_SCHED_POLICY_FIFO) " = FIFO (default)."); > -module_param_named(sched_policy, drm_sched_policy, int, 0444); > +MODULE_PARM_DESC(sched_policy, "Specify the default scheduling policy for entities on a run-queue, " __stringify(DRM_SCHED_POLICY_RR) " = Round Robin, " __stringify(DRM_SCHED_POLICY_FIFO) " = FIFO (default)."); Note, that you don't need to add "default" in the text as it is already there at the very end "FIFO (default)." Else, it gets confusing what is meant by "default". Like this: Specify the default scheduling policy for entities on a run-queue, 1 = Round Robin, 2 = FIFO (default). See "default" appear twice and it creates confusion? We don't need our internal "default" play to get exported all the way to the casual user reading this. It is much clear, however, Specify the scheduling policy for entities on a run-queue, 1 = Round Robin, 2 = FIFO (default). To mean, if unset, the default one would be used. But this is all internal code stuff. So I'd say leave this one alone. > +module_param_named(sched_policy, default_drm_sched_policy, int, 0444); Put "default" as a postfix: default_drm_sched_policy --> drm_sched_policy_default > > static __always_inline bool drm_sched_entity_compare_before(struct rb_node *a, > const struct rb_node *b) > @@ -177,7 +177,7 @@ void drm_sched_rq_remove_entity(struct drm_sched_rq *rq, > if (rq->current_entity == entity) > rq->current_entity = NULL; > > - if (drm_sched_policy == DRM_SCHED_POLICY_FIFO) > + if (rq->sched->sched_policy == DRM_SCHED_POLICY_FIFO) > drm_sched_rq_remove_fifo_locked(entity); > > spin_unlock(&rq->lock); > @@ -898,7 +898,7 @@ drm_sched_select_entity(struct drm_gpu_scheduler *sched) > > /* Kernel run queue has higher priority than normal run queue*/ > for (i = DRM_SCHED_PRIORITY_COUNT - 1; i >= DRM_SCHED_PRIORITY_MIN; i--) { > - entity = drm_sched_policy == DRM_SCHED_POLICY_FIFO ? > + entity = sched->sched_policy == DRM_SCHED_POLICY_FIFO ? > drm_sched_rq_select_entity_fifo(&sched->sched_rq[i]) : > drm_sched_rq_select_entity_rr(&sched->sched_rq[i]); > if (entity) > @@ -1071,6 +1071,7 @@ static void drm_sched_main(struct work_struct *w) > * used > * @score: optional score atomic shared with other schedulers > * @name: name used for debugging > + * @sched_policy: schedule policy > * @dev: target &struct device > * > * Return 0 on success, otherwise error code. > @@ -1080,9 +1081,15 @@ int drm_sched_init(struct drm_gpu_scheduler *sched, > struct workqueue_struct *submit_wq, > unsigned hw_submission, unsigned hang_limit, > long timeout, struct workqueue_struct *timeout_wq, > - atomic_t *score, const char *name, struct device *dev) > + atomic_t *score, const char *name, > + enum drm_sched_policy sched_policy, > + struct device *dev) > { > int i; > + > + if (sched_policy >= DRM_SCHED_POLICY_COUNT) > + return -EINVAL; > + > sched->ops = ops; > sched->hw_submission_limit = hw_submission; > sched->name = name; > @@ -1092,6 +1099,10 @@ int drm_sched_init(struct drm_gpu_scheduler *sched, > sched->hang_limit = hang_limit; > sched->score = score ? score : &sched->_score; > sched->dev = dev; > + if (sched_policy == DRM_SCHED_POLICY_DEFAULT) > + sched->sched_policy = default_drm_sched_policy; > + else > + sched->sched_policy = sched_policy; > for (i = DRM_SCHED_PRIORITY_MIN; i < DRM_SCHED_PRIORITY_COUNT; i++) > drm_sched_rq_init(sched, &sched->sched_rq[i]); > > diff --git a/drivers/gpu/drm/v3d/v3d_sched.c b/drivers/gpu/drm/v3d/v3d_sched.c > index 38e092ea41e6..5e3fe77fa991 100644 > --- a/drivers/gpu/drm/v3d/v3d_sched.c > +++ b/drivers/gpu/drm/v3d/v3d_sched.c > @@ -391,7 +391,8 @@ v3d_sched_init(struct v3d_dev *v3d) > &v3d_bin_sched_ops, NULL, > hw_jobs_limit, job_hang_limit, > msecs_to_jiffies(hang_limit_ms), NULL, > - NULL, "v3d_bin", v3d->drm.dev); > + NULL, "v3d_bin", DRM_SCHED_POLICY_DEFAULT, > + v3d->drm.dev); > if (ret) > return ret; > > @@ -399,7 +400,8 @@ v3d_sched_init(struct v3d_dev *v3d) > &v3d_render_sched_ops, NULL, > hw_jobs_limit, job_hang_limit, > msecs_to_jiffies(hang_limit_ms), NULL, > - NULL, "v3d_render", v3d->drm.dev); > + ULL, "v3d_render", DRM_SCHED_POLICY_DEFAULT, > + v3d->drm.dev); > if (ret) > goto fail; > > @@ -407,7 +409,8 @@ v3d_sched_init(struct v3d_dev *v3d) > &v3d_tfu_sched_ops, NULL, > hw_jobs_limit, job_hang_limit, > msecs_to_jiffies(hang_limit_ms), NULL, > - NULL, "v3d_tfu", v3d->drm.dev); > + NULL, "v3d_tfu", DRM_SCHED_POLICY_DEFAULT, > + v3d->drm.dev); > if (ret) > goto fail; > > @@ -416,7 +419,8 @@ v3d_sched_init(struct v3d_dev *v3d) > &v3d_csd_sched_ops, NULL, > hw_jobs_limit, job_hang_limit, > msecs_to_jiffies(hang_limit_ms), NULL, > - NULL, "v3d_csd", v3d->drm.dev); > + NULL, "v3d_csd", DRM_SCHED_POLICY_DEFAULT, > + v3d->drm.dev); > if (ret) > goto fail; > > @@ -424,7 +428,8 @@ v3d_sched_init(struct v3d_dev *v3d) > &v3d_cache_clean_sched_ops, NULL, > hw_jobs_limit, job_hang_limit, > msecs_to_jiffies(hang_limit_ms), NULL, > - NULL, "v3d_cache_clean", v3d->drm.dev); > + NULL, "v3d_cache_clean", > + DRM_SCHED_POLICY_DEFAULT, v3d->drm.dev); > if (ret) > goto fail; > } > diff --git a/include/drm/gpu_scheduler.h b/include/drm/gpu_scheduler.h > index 278106e358a8..897d52a4ff4f 100644 > --- a/include/drm/gpu_scheduler.h > +++ b/include/drm/gpu_scheduler.h > @@ -72,11 +72,15 @@ enum drm_sched_priority { > DRM_SCHED_PRIORITY_UNSET = -2 > }; > > -/* Used to chose between FIFO and RR jobs scheduling */ > -extern int drm_sched_policy; > - > -#define DRM_SCHED_POLICY_RR 0 > -#define DRM_SCHED_POLICY_FIFO 1 > +/* Used to chose default scheduling policy*/ > +extern int default_drm_sched_policy; > + > +enum drm_sched_policy { > + DRM_SCHED_POLICY_DEFAULT, > + DRM_SCHED_POLICY_RR, > + DRM_SCHED_POLICY_FIFO, > + DRM_SCHED_POLICY_COUNT, > +}; No. Use as the first (0th) element name "DRM_SCHED_POLICY_UNSET". The DRM scheduling policies are, * unset, meaning no preference, whatever the default is, (but that's NOT the "default"), * Round-Robin, and * FIFO. "Default" is a _result_ of the policy being _unset_. "Default" is not a policy. IOW, we want to say, "If you don't set the policy (i.e. it's unset), we'll set it to the default one, which could be either Round-Robin, or FIFO." It may look a bit strange in function calls up there, "What do you mean `unset'? What is it?" but it needs to be understood that the _policy_ is "unset", "rr", or "fifo", and if it is "unset", we'll set it to whatever the default one was set to, at boot/compile time, RR or FIFO. Note that "unset" is equivalent to a function not having the policy parameter altogether (as right now). Now that you're adding it, you can extend that, as opposed to renaming the enum to "DEFAULT" to tell the caller that it will be set to the default one. But we don't need to tell function behaviour in the name of a function parameter/enum element. > > /** > * struct drm_sched_entity - A wrapper around a job queue (typically > @@ -489,6 +493,7 @@ struct drm_sched_backend_ops { > * guilty and it will no longer be considered for scheduling. > * @score: score to help loadbalancer pick a idle sched > * @_score: score used when the driver doesn't provide one > + * @sched_policy: Schedule policy for scheduler > * @ready: marks if the underlying HW is ready to work > * @free_guilty: A hit to time out handler to free the guilty job. > * @pause_submit: pause queuing of @work_submit on @submit_wq > @@ -514,6 +519,7 @@ struct drm_gpu_scheduler { > int hang_limit; > atomic_t *score; > atomic_t _score; > + enum drm_sched_policy sched_policy; > bool ready; > bool free_guilty; > bool pause_submit; > @@ -525,7 +531,9 @@ int drm_sched_init(struct drm_gpu_scheduler *sched, > struct workqueue_struct *submit_wq, > uint32_t hw_submission, unsigned hang_limit, > long timeout, struct workqueue_struct *timeout_wq, > - atomic_t *score, const char *name, struct device *dev); > + atomic_t *score, const char *name, > + enum drm_sched_policy sched_policy, > + struct device *dev); > > void drm_sched_fini(struct drm_gpu_scheduler *sched); > int drm_sched_job_init(struct drm_sched_job *job,
On 2023-09-14 00:18, Luben Tuikov wrote: > On 2023-09-11 22:16, Matthew Brost wrote: >> Rather than a global modparam for scheduling policy, move the scheduling >> policy to scheduler / entity so user can control each scheduler / entity >> policy. >> >> v2: >> - s/DRM_SCHED_POLICY_MAX/DRM_SCHED_POLICY_COUNT (Luben) >> - Only include policy in scheduler (Luben) >> >> Signed-off-by: Matthew Brost <matthew.brost@intel.com> >> --- >> drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 1 + >> drivers/gpu/drm/etnaviv/etnaviv_sched.c | 3 ++- >> drivers/gpu/drm/lima/lima_sched.c | 3 ++- >> drivers/gpu/drm/msm/msm_ringbuffer.c | 3 ++- >> drivers/gpu/drm/nouveau/nouveau_sched.c | 3 ++- >> drivers/gpu/drm/panfrost/panfrost_job.c | 3 ++- >> drivers/gpu/drm/scheduler/sched_entity.c | 24 ++++++++++++++++++---- >> drivers/gpu/drm/scheduler/sched_main.c | 23 +++++++++++++++------ >> drivers/gpu/drm/v3d/v3d_sched.c | 15 +++++++++----- >> include/drm/gpu_scheduler.h | 20 ++++++++++++------ >> 10 files changed, 72 insertions(+), 26 deletions(-) >> >> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c >> index c83a76bccc1d..ecb00991dd51 100644 >> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c >> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c >> @@ -2309,6 +2309,7 @@ static int amdgpu_device_init_schedulers(struct amdgpu_device *adev) >> ring->num_hw_submission, 0, >> timeout, adev->reset_domain->wq, >> ring->sched_score, ring->name, >> + DRM_SCHED_POLICY_DEFAULT, >> adev->dev); >> if (r) { >> DRM_ERROR("Failed to create scheduler on ring %s.\n", >> diff --git a/drivers/gpu/drm/etnaviv/etnaviv_sched.c b/drivers/gpu/drm/etnaviv/etnaviv_sched.c >> index 618a804ddc34..3646f995ca94 100644 >> --- a/drivers/gpu/drm/etnaviv/etnaviv_sched.c >> +++ b/drivers/gpu/drm/etnaviv/etnaviv_sched.c >> @@ -137,7 +137,8 @@ int etnaviv_sched_init(struct etnaviv_gpu *gpu) >> ret = drm_sched_init(&gpu->sched, &etnaviv_sched_ops, NULL, >> etnaviv_hw_jobs_limit, etnaviv_job_hang_limit, >> msecs_to_jiffies(500), NULL, NULL, >> - dev_name(gpu->dev), gpu->dev); >> + dev_name(gpu->dev), DRM_SCHED_POLICY_DEFAULT, >> + gpu->dev); >> if (ret) >> return ret; >> >> diff --git a/drivers/gpu/drm/lima/lima_sched.c b/drivers/gpu/drm/lima/lima_sched.c >> index 8d858aed0e56..465d4bf3882b 100644 >> --- a/drivers/gpu/drm/lima/lima_sched.c >> +++ b/drivers/gpu/drm/lima/lima_sched.c >> @@ -491,7 +491,8 @@ int lima_sched_pipe_init(struct lima_sched_pipe *pipe, const char *name) >> return drm_sched_init(&pipe->base, &lima_sched_ops, NULL, 1, >> lima_job_hang_limit, >> msecs_to_jiffies(timeout), NULL, >> - NULL, name, pipe->ldev->dev); >> + NULL, name, DRM_SCHED_POLICY_DEFAULT, >> + pipe->ldev->dev); >> } >> >> void lima_sched_pipe_fini(struct lima_sched_pipe *pipe) >> diff --git a/drivers/gpu/drm/msm/msm_ringbuffer.c b/drivers/gpu/drm/msm/msm_ringbuffer.c >> index b8865e61b40f..f45e674a0aaf 100644 >> --- a/drivers/gpu/drm/msm/msm_ringbuffer.c >> +++ b/drivers/gpu/drm/msm/msm_ringbuffer.c >> @@ -96,7 +96,8 @@ struct msm_ringbuffer *msm_ringbuffer_new(struct msm_gpu *gpu, int id, >> >> ret = drm_sched_init(&ring->sched, &msm_sched_ops, NULL, >> num_hw_submissions, 0, sched_timeout, >> - NULL, NULL, to_msm_bo(ring->bo)->name, gpu->dev->dev); >> + NULL, NULL, to_msm_bo(ring->bo)->name, >> + DRM_SCHED_POLICY_DEFAULT, gpu->dev->dev); >> if (ret) { >> goto fail; >> } >> diff --git a/drivers/gpu/drm/nouveau/nouveau_sched.c b/drivers/gpu/drm/nouveau/nouveau_sched.c >> index d458c2227d4f..70e497e40c70 100644 >> --- a/drivers/gpu/drm/nouveau/nouveau_sched.c >> +++ b/drivers/gpu/drm/nouveau/nouveau_sched.c >> @@ -431,7 +431,8 @@ int nouveau_sched_init(struct nouveau_drm *drm) >> >> return drm_sched_init(sched, &nouveau_sched_ops, NULL, >> NOUVEAU_SCHED_HW_SUBMISSIONS, 0, job_hang_limit, >> - NULL, NULL, "nouveau_sched", drm->dev->dev); >> + NULL, NULL, "nouveau_sched", >> + DRM_SCHED_POLICY_DEFAULT, drm->dev->dev); >> } >> >> void nouveau_sched_fini(struct nouveau_drm *drm) >> diff --git a/drivers/gpu/drm/panfrost/panfrost_job.c b/drivers/gpu/drm/panfrost/panfrost_job.c >> index 326ca1ddf1d7..ad36bf3a4699 100644 >> --- a/drivers/gpu/drm/panfrost/panfrost_job.c >> +++ b/drivers/gpu/drm/panfrost/panfrost_job.c >> @@ -835,7 +835,8 @@ int panfrost_job_init(struct panfrost_device *pfdev) >> nentries, 0, >> msecs_to_jiffies(JOB_TIMEOUT_MS), >> pfdev->reset.wq, >> - NULL, "pan_js", pfdev->dev); >> + NULL, "pan_js", DRM_SCHED_POLICY_DEFAULT, >> + pfdev->dev); >> if (ret) { >> dev_err(pfdev->dev, "Failed to create scheduler: %d.", ret); >> goto err_sched; >> diff --git a/drivers/gpu/drm/scheduler/sched_entity.c b/drivers/gpu/drm/scheduler/sched_entity.c >> index a42763e1429d..65a972b52eda 100644 >> --- a/drivers/gpu/drm/scheduler/sched_entity.c >> +++ b/drivers/gpu/drm/scheduler/sched_entity.c >> @@ -33,6 +33,20 @@ >> #define to_drm_sched_job(sched_job) \ >> container_of((sched_job), struct drm_sched_job, queue_node) >> >> +static bool bad_policies(struct drm_gpu_scheduler **sched_list, >> + unsigned int num_sched_list) > > Rename the function to the status quo, > drm_sched_policy_mismatch(... > >> +{ >> + enum drm_sched_policy sched_policy = sched_list[0]->sched_policy; >> + unsigned int i; >> + >> + /* All schedule policies must match */ >> + for (i = 1; i < num_sched_list; ++i) >> + if (sched_policy != sched_list[i]->sched_policy) >> + return true; >> + >> + return false; >> +} >> + >> /** >> * drm_sched_entity_init - Init a context entity used by scheduler when >> * submit to HW ring. >> @@ -62,7 +76,8 @@ int drm_sched_entity_init(struct drm_sched_entity *entity, >> unsigned int num_sched_list, >> atomic_t *guilty) >> { >> - if (!(entity && sched_list && (num_sched_list == 0 || sched_list[0]))) >> + if (!(entity && sched_list && (num_sched_list == 0 || sched_list[0])) || >> + bad_policies(sched_list, num_sched_list)) >> return -EINVAL; >> >> memset(entity, 0, sizeof(struct drm_sched_entity)); >> @@ -486,7 +501,7 @@ struct drm_sched_job *drm_sched_entity_pop_job(struct drm_sched_entity *entity) >> * Update the entity's location in the min heap according to >> * the timestamp of the next job, if any. >> */ >> - if (drm_sched_policy == DRM_SCHED_POLICY_FIFO) { >> + if (entity->rq->sched->sched_policy == DRM_SCHED_POLICY_FIFO) { >> struct drm_sched_job *next; >> >> next = to_drm_sched_job(spsc_queue_peek(&entity->job_queue)); >> @@ -558,7 +573,8 @@ void drm_sched_entity_select_rq(struct drm_sched_entity *entity) >> void drm_sched_entity_push_job(struct drm_sched_job *sched_job) >> { >> struct drm_sched_entity *entity = sched_job->entity; >> - bool first; >> + bool first, fifo = entity->rq->sched->sched_policy == >> + DRM_SCHED_POLICY_FIFO; >> ktime_t submit_ts; >> >> trace_drm_sched_job(sched_job, entity); >> @@ -587,7 +603,7 @@ void drm_sched_entity_push_job(struct drm_sched_job *sched_job) >> drm_sched_rq_add_entity(entity->rq, entity); >> spin_unlock(&entity->rq_lock); >> >> - if (drm_sched_policy == DRM_SCHED_POLICY_FIFO) >> + if (fifo) >> drm_sched_rq_update_fifo(entity, submit_ts); >> >> drm_sched_wakeup_if_can_queue(entity->rq->sched); >> diff --git a/drivers/gpu/drm/scheduler/sched_main.c b/drivers/gpu/drm/scheduler/sched_main.c >> index 614e8c97a622..545d5298c086 100644 >> --- a/drivers/gpu/drm/scheduler/sched_main.c >> +++ b/drivers/gpu/drm/scheduler/sched_main.c >> @@ -66,14 +66,14 @@ >> #define to_drm_sched_job(sched_job) \ >> container_of((sched_job), struct drm_sched_job, queue_node) >> >> -int drm_sched_policy = DRM_SCHED_POLICY_FIFO; >> +int default_drm_sched_policy = DRM_SCHED_POLICY_FIFO; >> >> /** >> * DOC: sched_policy (int) >> * Used to override default entities scheduling policy in a run queue. >> */ >> -MODULE_PARM_DESC(sched_policy, "Specify the scheduling policy for entities on a run-queue, " __stringify(DRM_SCHED_POLICY_RR) " = Round Robin, " __stringify(DRM_SCHED_POLICY_FIFO) " = FIFO (default)."); >> -module_param_named(sched_policy, drm_sched_policy, int, 0444); >> +MODULE_PARM_DESC(sched_policy, "Specify the default scheduling policy for entities on a run-queue, " __stringify(DRM_SCHED_POLICY_RR) " = Round Robin, " __stringify(DRM_SCHED_POLICY_FIFO) " = FIFO (default)."); > > Note, that you don't need to add "default" in the text as it is already there at the very end "FIFO (default)." > Else, it gets confusing what is meant by "default". Like this: > > Specify the default scheduling policy for entities on a run-queue, 1 = Round Robin, 2 = FIFO (default). > > See "default" appear twice and it creates confusion? We don't need our internal "default" play to get > exported all the way to the casual user reading this. It is much clear, however, > > Specify the scheduling policy for entities on a run-queue, 1 = Round Robin, 2 = FIFO (default). > > To mean, if unset, the default one would be used. But this is all internal code stuff. > > So I'd say leave this one alone. > >> +module_param_named(sched_policy, default_drm_sched_policy, int, 0444); > > Put "default" as a postfix: > default_drm_sched_policy --> drm_sched_policy_default > >> >> static __always_inline bool drm_sched_entity_compare_before(struct rb_node *a, >> const struct rb_node *b) >> @@ -177,7 +177,7 @@ void drm_sched_rq_remove_entity(struct drm_sched_rq *rq, >> if (rq->current_entity == entity) >> rq->current_entity = NULL; >> >> - if (drm_sched_policy == DRM_SCHED_POLICY_FIFO) >> + if (rq->sched->sched_policy == DRM_SCHED_POLICY_FIFO) >> drm_sched_rq_remove_fifo_locked(entity); >> >> spin_unlock(&rq->lock); >> @@ -898,7 +898,7 @@ drm_sched_select_entity(struct drm_gpu_scheduler *sched) >> >> /* Kernel run queue has higher priority than normal run queue*/ >> for (i = DRM_SCHED_PRIORITY_COUNT - 1; i >= DRM_SCHED_PRIORITY_MIN; i--) { >> - entity = drm_sched_policy == DRM_SCHED_POLICY_FIFO ? >> + entity = sched->sched_policy == DRM_SCHED_POLICY_FIFO ? >> drm_sched_rq_select_entity_fifo(&sched->sched_rq[i]) : >> drm_sched_rq_select_entity_rr(&sched->sched_rq[i]); >> if (entity) >> @@ -1071,6 +1071,7 @@ static void drm_sched_main(struct work_struct *w) >> * used >> * @score: optional score atomic shared with other schedulers >> * @name: name used for debugging >> + * @sched_policy: schedule policy >> * @dev: target &struct device >> * >> * Return 0 on success, otherwise error code. >> @@ -1080,9 +1081,15 @@ int drm_sched_init(struct drm_gpu_scheduler *sched, >> struct workqueue_struct *submit_wq, >> unsigned hw_submission, unsigned hang_limit, >> long timeout, struct workqueue_struct *timeout_wq, >> - atomic_t *score, const char *name, struct device *dev) >> + atomic_t *score, const char *name, >> + enum drm_sched_policy sched_policy, >> + struct device *dev) >> { >> int i; >> + >> + if (sched_policy >= DRM_SCHED_POLICY_COUNT) >> + return -EINVAL; >> + >> sched->ops = ops; >> sched->hw_submission_limit = hw_submission; >> sched->name = name; >> @@ -1092,6 +1099,10 @@ int drm_sched_init(struct drm_gpu_scheduler *sched, >> sched->hang_limit = hang_limit; >> sched->score = score ? score : &sched->_score; >> sched->dev = dev; >> + if (sched_policy == DRM_SCHED_POLICY_DEFAULT) >> + sched->sched_policy = default_drm_sched_policy; >> + else >> + sched->sched_policy = sched_policy; Note also that here you can use a ternary operator as opposed to an if-control. sched->sched_policy = sched_policy == DRM_SCHED_POLICY_UNSET ? drm_sched_policy_default : sched_policy;
On Thu, Sep 14, 2023 at 12:23:35AM -0400, Luben Tuikov wrote: > On 2023-09-14 00:18, Luben Tuikov wrote: > > On 2023-09-11 22:16, Matthew Brost wrote: > >> Rather than a global modparam for scheduling policy, move the scheduling > >> policy to scheduler / entity so user can control each scheduler / entity > >> policy. > >> > >> v2: > >> - s/DRM_SCHED_POLICY_MAX/DRM_SCHED_POLICY_COUNT (Luben) > >> - Only include policy in scheduler (Luben) > >> > >> Signed-off-by: Matthew Brost <matthew.brost@intel.com> > >> --- > >> drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 1 + > >> drivers/gpu/drm/etnaviv/etnaviv_sched.c | 3 ++- > >> drivers/gpu/drm/lima/lima_sched.c | 3 ++- > >> drivers/gpu/drm/msm/msm_ringbuffer.c | 3 ++- > >> drivers/gpu/drm/nouveau/nouveau_sched.c | 3 ++- > >> drivers/gpu/drm/panfrost/panfrost_job.c | 3 ++- > >> drivers/gpu/drm/scheduler/sched_entity.c | 24 ++++++++++++++++++---- > >> drivers/gpu/drm/scheduler/sched_main.c | 23 +++++++++++++++------ > >> drivers/gpu/drm/v3d/v3d_sched.c | 15 +++++++++----- > >> include/drm/gpu_scheduler.h | 20 ++++++++++++------ > >> 10 files changed, 72 insertions(+), 26 deletions(-) > >> > >> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c > >> index c83a76bccc1d..ecb00991dd51 100644 > >> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c > >> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c > >> @@ -2309,6 +2309,7 @@ static int amdgpu_device_init_schedulers(struct amdgpu_device *adev) > >> ring->num_hw_submission, 0, > >> timeout, adev->reset_domain->wq, > >> ring->sched_score, ring->name, > >> + DRM_SCHED_POLICY_DEFAULT, > >> adev->dev); > >> if (r) { > >> DRM_ERROR("Failed to create scheduler on ring %s.\n", > >> diff --git a/drivers/gpu/drm/etnaviv/etnaviv_sched.c b/drivers/gpu/drm/etnaviv/etnaviv_sched.c > >> index 618a804ddc34..3646f995ca94 100644 > >> --- a/drivers/gpu/drm/etnaviv/etnaviv_sched.c > >> +++ b/drivers/gpu/drm/etnaviv/etnaviv_sched.c > >> @@ -137,7 +137,8 @@ int etnaviv_sched_init(struct etnaviv_gpu *gpu) > >> ret = drm_sched_init(&gpu->sched, &etnaviv_sched_ops, NULL, > >> etnaviv_hw_jobs_limit, etnaviv_job_hang_limit, > >> msecs_to_jiffies(500), NULL, NULL, > >> - dev_name(gpu->dev), gpu->dev); > >> + dev_name(gpu->dev), DRM_SCHED_POLICY_DEFAULT, > >> + gpu->dev); > >> if (ret) > >> return ret; > >> > >> diff --git a/drivers/gpu/drm/lima/lima_sched.c b/drivers/gpu/drm/lima/lima_sched.c > >> index 8d858aed0e56..465d4bf3882b 100644 > >> --- a/drivers/gpu/drm/lima/lima_sched.c > >> +++ b/drivers/gpu/drm/lima/lima_sched.c > >> @@ -491,7 +491,8 @@ int lima_sched_pipe_init(struct lima_sched_pipe *pipe, const char *name) > >> return drm_sched_init(&pipe->base, &lima_sched_ops, NULL, 1, > >> lima_job_hang_limit, > >> msecs_to_jiffies(timeout), NULL, > >> - NULL, name, pipe->ldev->dev); > >> + NULL, name, DRM_SCHED_POLICY_DEFAULT, > >> + pipe->ldev->dev); > >> } > >> > >> void lima_sched_pipe_fini(struct lima_sched_pipe *pipe) > >> diff --git a/drivers/gpu/drm/msm/msm_ringbuffer.c b/drivers/gpu/drm/msm/msm_ringbuffer.c > >> index b8865e61b40f..f45e674a0aaf 100644 > >> --- a/drivers/gpu/drm/msm/msm_ringbuffer.c > >> +++ b/drivers/gpu/drm/msm/msm_ringbuffer.c > >> @@ -96,7 +96,8 @@ struct msm_ringbuffer *msm_ringbuffer_new(struct msm_gpu *gpu, int id, > >> > >> ret = drm_sched_init(&ring->sched, &msm_sched_ops, NULL, > >> num_hw_submissions, 0, sched_timeout, > >> - NULL, NULL, to_msm_bo(ring->bo)->name, gpu->dev->dev); > >> + NULL, NULL, to_msm_bo(ring->bo)->name, > >> + DRM_SCHED_POLICY_DEFAULT, gpu->dev->dev); > >> if (ret) { > >> goto fail; > >> } > >> diff --git a/drivers/gpu/drm/nouveau/nouveau_sched.c b/drivers/gpu/drm/nouveau/nouveau_sched.c > >> index d458c2227d4f..70e497e40c70 100644 > >> --- a/drivers/gpu/drm/nouveau/nouveau_sched.c > >> +++ b/drivers/gpu/drm/nouveau/nouveau_sched.c > >> @@ -431,7 +431,8 @@ int nouveau_sched_init(struct nouveau_drm *drm) > >> > >> return drm_sched_init(sched, &nouveau_sched_ops, NULL, > >> NOUVEAU_SCHED_HW_SUBMISSIONS, 0, job_hang_limit, > >> - NULL, NULL, "nouveau_sched", drm->dev->dev); > >> + NULL, NULL, "nouveau_sched", > >> + DRM_SCHED_POLICY_DEFAULT, drm->dev->dev); > >> } > >> > >> void nouveau_sched_fini(struct nouveau_drm *drm) > >> diff --git a/drivers/gpu/drm/panfrost/panfrost_job.c b/drivers/gpu/drm/panfrost/panfrost_job.c > >> index 326ca1ddf1d7..ad36bf3a4699 100644 > >> --- a/drivers/gpu/drm/panfrost/panfrost_job.c > >> +++ b/drivers/gpu/drm/panfrost/panfrost_job.c > >> @@ -835,7 +835,8 @@ int panfrost_job_init(struct panfrost_device *pfdev) > >> nentries, 0, > >> msecs_to_jiffies(JOB_TIMEOUT_MS), > >> pfdev->reset.wq, > >> - NULL, "pan_js", pfdev->dev); > >> + NULL, "pan_js", DRM_SCHED_POLICY_DEFAULT, > >> + pfdev->dev); > >> if (ret) { > >> dev_err(pfdev->dev, "Failed to create scheduler: %d.", ret); > >> goto err_sched; > >> diff --git a/drivers/gpu/drm/scheduler/sched_entity.c b/drivers/gpu/drm/scheduler/sched_entity.c > >> index a42763e1429d..65a972b52eda 100644 > >> --- a/drivers/gpu/drm/scheduler/sched_entity.c > >> +++ b/drivers/gpu/drm/scheduler/sched_entity.c > >> @@ -33,6 +33,20 @@ > >> #define to_drm_sched_job(sched_job) \ > >> container_of((sched_job), struct drm_sched_job, queue_node) > >> > >> +static bool bad_policies(struct drm_gpu_scheduler **sched_list, > >> + unsigned int num_sched_list) > > > > Rename the function to the status quo, > > drm_sched_policy_mismatch(... > > Will do. > >> +{ > >> + enum drm_sched_policy sched_policy = sched_list[0]->sched_policy; > >> + unsigned int i; > >> + > >> + /* All schedule policies must match */ > >> + for (i = 1; i < num_sched_list; ++i) > >> + if (sched_policy != sched_list[i]->sched_policy) > >> + return true; > >> + > >> + return false; > >> +} > >> + > >> /** > >> * drm_sched_entity_init - Init a context entity used by scheduler when > >> * submit to HW ring. > >> @@ -62,7 +76,8 @@ int drm_sched_entity_init(struct drm_sched_entity *entity, > >> unsigned int num_sched_list, > >> atomic_t *guilty) > >> { > >> - if (!(entity && sched_list && (num_sched_list == 0 || sched_list[0]))) > >> + if (!(entity && sched_list && (num_sched_list == 0 || sched_list[0])) || > >> + bad_policies(sched_list, num_sched_list)) > >> return -EINVAL; > >> > >> memset(entity, 0, sizeof(struct drm_sched_entity)); > >> @@ -486,7 +501,7 @@ struct drm_sched_job *drm_sched_entity_pop_job(struct drm_sched_entity *entity) > >> * Update the entity's location in the min heap according to > >> * the timestamp of the next job, if any. > >> */ > >> - if (drm_sched_policy == DRM_SCHED_POLICY_FIFO) { > >> + if (entity->rq->sched->sched_policy == DRM_SCHED_POLICY_FIFO) { > >> struct drm_sched_job *next; > >> > >> next = to_drm_sched_job(spsc_queue_peek(&entity->job_queue)); > >> @@ -558,7 +573,8 @@ void drm_sched_entity_select_rq(struct drm_sched_entity *entity) > >> void drm_sched_entity_push_job(struct drm_sched_job *sched_job) > >> { > >> struct drm_sched_entity *entity = sched_job->entity; > >> - bool first; > >> + bool first, fifo = entity->rq->sched->sched_policy == > >> + DRM_SCHED_POLICY_FIFO; > >> ktime_t submit_ts; > >> > >> trace_drm_sched_job(sched_job, entity); > >> @@ -587,7 +603,7 @@ void drm_sched_entity_push_job(struct drm_sched_job *sched_job) > >> drm_sched_rq_add_entity(entity->rq, entity); > >> spin_unlock(&entity->rq_lock); > >> > >> - if (drm_sched_policy == DRM_SCHED_POLICY_FIFO) > >> + if (fifo) > >> drm_sched_rq_update_fifo(entity, submit_ts); > >> > >> drm_sched_wakeup_if_can_queue(entity->rq->sched); > >> diff --git a/drivers/gpu/drm/scheduler/sched_main.c b/drivers/gpu/drm/scheduler/sched_main.c > >> index 614e8c97a622..545d5298c086 100644 > >> --- a/drivers/gpu/drm/scheduler/sched_main.c > >> +++ b/drivers/gpu/drm/scheduler/sched_main.c > >> @@ -66,14 +66,14 @@ > >> #define to_drm_sched_job(sched_job) \ > >> container_of((sched_job), struct drm_sched_job, queue_node) > >> > >> -int drm_sched_policy = DRM_SCHED_POLICY_FIFO; > >> +int default_drm_sched_policy = DRM_SCHED_POLICY_FIFO; > >> > >> /** > >> * DOC: sched_policy (int) > >> * Used to override default entities scheduling policy in a run queue. > >> */ > >> -MODULE_PARM_DESC(sched_policy, "Specify the scheduling policy for entities on a run-queue, " __stringify(DRM_SCHED_POLICY_RR) " = Round Robin, " __stringify(DRM_SCHED_POLICY_FIFO) " = FIFO (default)."); > >> -module_param_named(sched_policy, drm_sched_policy, int, 0444); > >> +MODULE_PARM_DESC(sched_policy, "Specify the default scheduling policy for entities on a run-queue, " __stringify(DRM_SCHED_POLICY_RR) " = Round Robin, " __stringify(DRM_SCHED_POLICY_FIFO) " = FIFO (default)."); > > > > Note, that you don't need to add "default" in the text as it is already there at the very end "FIFO (default)." > > Else, it gets confusing what is meant by "default". Like this: > > > > Specify the default scheduling policy for entities on a run-queue, 1 = Round Robin, 2 = FIFO (default). > > > > See "default" appear twice and it creates confusion? We don't need our internal "default" play to get > > exported all the way to the casual user reading this. It is much clear, however, > > > > Specify the scheduling policy for entities on a run-queue, 1 = Round Robin, 2 = FIFO (default). > > > > To mean, if unset, the default one would be used. But this is all internal code stuff. > > > > So I'd say leave this one alone. > > Ok. > >> +module_param_named(sched_policy, default_drm_sched_policy, int, 0444); > > > > Put "default" as a postfix: > > default_drm_sched_policy --> drm_sched_policy_default > > Sure. > >> > >> static __always_inline bool drm_sched_entity_compare_before(struct rb_node *a, > >> const struct rb_node *b) > >> @@ -177,7 +177,7 @@ void drm_sched_rq_remove_entity(struct drm_sched_rq *rq, > >> if (rq->current_entity == entity) > >> rq->current_entity = NULL; > >> > >> - if (drm_sched_policy == DRM_SCHED_POLICY_FIFO) > >> + if (rq->sched->sched_policy == DRM_SCHED_POLICY_FIFO) > >> drm_sched_rq_remove_fifo_locked(entity); > >> > >> spin_unlock(&rq->lock); > >> @@ -898,7 +898,7 @@ drm_sched_select_entity(struct drm_gpu_scheduler *sched) > >> > >> /* Kernel run queue has higher priority than normal run queue*/ > >> for (i = DRM_SCHED_PRIORITY_COUNT - 1; i >= DRM_SCHED_PRIORITY_MIN; i--) { > >> - entity = drm_sched_policy == DRM_SCHED_POLICY_FIFO ? > >> + entity = sched->sched_policy == DRM_SCHED_POLICY_FIFO ? > >> drm_sched_rq_select_entity_fifo(&sched->sched_rq[i]) : > >> drm_sched_rq_select_entity_rr(&sched->sched_rq[i]); > >> if (entity) > >> @@ -1071,6 +1071,7 @@ static void drm_sched_main(struct work_struct *w) > >> * used > >> * @score: optional score atomic shared with other schedulers > >> * @name: name used for debugging > >> + * @sched_policy: schedule policy > >> * @dev: target &struct device > >> * > >> * Return 0 on success, otherwise error code. > >> @@ -1080,9 +1081,15 @@ int drm_sched_init(struct drm_gpu_scheduler *sched, > >> struct workqueue_struct *submit_wq, > >> unsigned hw_submission, unsigned hang_limit, > >> long timeout, struct workqueue_struct *timeout_wq, > >> - atomic_t *score, const char *name, struct device *dev) > >> + atomic_t *score, const char *name, > >> + enum drm_sched_policy sched_policy, > >> + struct device *dev) > >> { > >> int i; > >> + > >> + if (sched_policy >= DRM_SCHED_POLICY_COUNT) > >> + return -EINVAL; > >> + > >> sched->ops = ops; > >> sched->hw_submission_limit = hw_submission; > >> sched->name = name; > >> @@ -1092,6 +1099,10 @@ int drm_sched_init(struct drm_gpu_scheduler *sched, > >> sched->hang_limit = hang_limit; > >> sched->score = score ? score : &sched->_score; > >> sched->dev = dev; > >> + if (sched_policy == DRM_SCHED_POLICY_DEFAULT) > >> + sched->sched_policy = default_drm_sched_policy; > >> + else > >> + sched->sched_policy = sched_policy; > > Note also that here you can use a ternary operator as opposed to an if-control. > > sched->sched_policy = sched_policy == DRM_SCHED_POLICY_UNSET ? > drm_sched_policy_default : sched_policy; Sure, will fix in next rev. Matt > > -- > Regards, > Luben >
On Thu, Sep 14, 2023 at 12:18:11AM -0400, Luben Tuikov wrote: > On 2023-09-11 22:16, Matthew Brost wrote: > > Rather than a global modparam for scheduling policy, move the scheduling > > policy to scheduler / entity so user can control each scheduler / entity > > policy. > > > > v2: > > - s/DRM_SCHED_POLICY_MAX/DRM_SCHED_POLICY_COUNT (Luben) > > - Only include policy in scheduler (Luben) > > > > Signed-off-by: Matthew Brost <matthew.brost@intel.com> > > --- > > drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 1 + > > drivers/gpu/drm/etnaviv/etnaviv_sched.c | 3 ++- > > drivers/gpu/drm/lima/lima_sched.c | 3 ++- > > drivers/gpu/drm/msm/msm_ringbuffer.c | 3 ++- > > drivers/gpu/drm/nouveau/nouveau_sched.c | 3 ++- > > drivers/gpu/drm/panfrost/panfrost_job.c | 3 ++- > > drivers/gpu/drm/scheduler/sched_entity.c | 24 ++++++++++++++++++---- > > drivers/gpu/drm/scheduler/sched_main.c | 23 +++++++++++++++------ > > drivers/gpu/drm/v3d/v3d_sched.c | 15 +++++++++----- > > include/drm/gpu_scheduler.h | 20 ++++++++++++------ > > 10 files changed, 72 insertions(+), 26 deletions(-) > > > > diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c > > index c83a76bccc1d..ecb00991dd51 100644 > > --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c > > +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c > > @@ -2309,6 +2309,7 @@ static int amdgpu_device_init_schedulers(struct amdgpu_device *adev) > > ring->num_hw_submission, 0, > > timeout, adev->reset_domain->wq, > > ring->sched_score, ring->name, > > + DRM_SCHED_POLICY_DEFAULT, > > adev->dev); > > if (r) { > > DRM_ERROR("Failed to create scheduler on ring %s.\n", > > diff --git a/drivers/gpu/drm/etnaviv/etnaviv_sched.c b/drivers/gpu/drm/etnaviv/etnaviv_sched.c > > index 618a804ddc34..3646f995ca94 100644 > > --- a/drivers/gpu/drm/etnaviv/etnaviv_sched.c > > +++ b/drivers/gpu/drm/etnaviv/etnaviv_sched.c > > @@ -137,7 +137,8 @@ int etnaviv_sched_init(struct etnaviv_gpu *gpu) > > ret = drm_sched_init(&gpu->sched, &etnaviv_sched_ops, NULL, > > etnaviv_hw_jobs_limit, etnaviv_job_hang_limit, > > msecs_to_jiffies(500), NULL, NULL, > > - dev_name(gpu->dev), gpu->dev); > > + dev_name(gpu->dev), DRM_SCHED_POLICY_DEFAULT, > > + gpu->dev); > > if (ret) > > return ret; > > > > diff --git a/drivers/gpu/drm/lima/lima_sched.c b/drivers/gpu/drm/lima/lima_sched.c > > index 8d858aed0e56..465d4bf3882b 100644 > > --- a/drivers/gpu/drm/lima/lima_sched.c > > +++ b/drivers/gpu/drm/lima/lima_sched.c > > @@ -491,7 +491,8 @@ int lima_sched_pipe_init(struct lima_sched_pipe *pipe, const char *name) > > return drm_sched_init(&pipe->base, &lima_sched_ops, NULL, 1, > > lima_job_hang_limit, > > msecs_to_jiffies(timeout), NULL, > > - NULL, name, pipe->ldev->dev); > > + NULL, name, DRM_SCHED_POLICY_DEFAULT, > > + pipe->ldev->dev); > > } > > > > void lima_sched_pipe_fini(struct lima_sched_pipe *pipe) > > diff --git a/drivers/gpu/drm/msm/msm_ringbuffer.c b/drivers/gpu/drm/msm/msm_ringbuffer.c > > index b8865e61b40f..f45e674a0aaf 100644 > > --- a/drivers/gpu/drm/msm/msm_ringbuffer.c > > +++ b/drivers/gpu/drm/msm/msm_ringbuffer.c > > @@ -96,7 +96,8 @@ struct msm_ringbuffer *msm_ringbuffer_new(struct msm_gpu *gpu, int id, > > > > ret = drm_sched_init(&ring->sched, &msm_sched_ops, NULL, > > num_hw_submissions, 0, sched_timeout, > > - NULL, NULL, to_msm_bo(ring->bo)->name, gpu->dev->dev); > > + NULL, NULL, to_msm_bo(ring->bo)->name, > > + DRM_SCHED_POLICY_DEFAULT, gpu->dev->dev); > > if (ret) { > > goto fail; > > } > > diff --git a/drivers/gpu/drm/nouveau/nouveau_sched.c b/drivers/gpu/drm/nouveau/nouveau_sched.c > > index d458c2227d4f..70e497e40c70 100644 > > --- a/drivers/gpu/drm/nouveau/nouveau_sched.c > > +++ b/drivers/gpu/drm/nouveau/nouveau_sched.c > > @@ -431,7 +431,8 @@ int nouveau_sched_init(struct nouveau_drm *drm) > > > > return drm_sched_init(sched, &nouveau_sched_ops, NULL, > > NOUVEAU_SCHED_HW_SUBMISSIONS, 0, job_hang_limit, > > - NULL, NULL, "nouveau_sched", drm->dev->dev); > > + NULL, NULL, "nouveau_sched", > > + DRM_SCHED_POLICY_DEFAULT, drm->dev->dev); > > } > > > > void nouveau_sched_fini(struct nouveau_drm *drm) > > diff --git a/drivers/gpu/drm/panfrost/panfrost_job.c b/drivers/gpu/drm/panfrost/panfrost_job.c > > index 326ca1ddf1d7..ad36bf3a4699 100644 > > --- a/drivers/gpu/drm/panfrost/panfrost_job.c > > +++ b/drivers/gpu/drm/panfrost/panfrost_job.c > > @@ -835,7 +835,8 @@ int panfrost_job_init(struct panfrost_device *pfdev) > > nentries, 0, > > msecs_to_jiffies(JOB_TIMEOUT_MS), > > pfdev->reset.wq, > > - NULL, "pan_js", pfdev->dev); > > + NULL, "pan_js", DRM_SCHED_POLICY_DEFAULT, > > + pfdev->dev); > > if (ret) { > > dev_err(pfdev->dev, "Failed to create scheduler: %d.", ret); > > goto err_sched; > > diff --git a/drivers/gpu/drm/scheduler/sched_entity.c b/drivers/gpu/drm/scheduler/sched_entity.c > > index a42763e1429d..65a972b52eda 100644 > > --- a/drivers/gpu/drm/scheduler/sched_entity.c > > +++ b/drivers/gpu/drm/scheduler/sched_entity.c > > @@ -33,6 +33,20 @@ > > #define to_drm_sched_job(sched_job) \ > > container_of((sched_job), struct drm_sched_job, queue_node) > > > > +static bool bad_policies(struct drm_gpu_scheduler **sched_list, > > + unsigned int num_sched_list) > > Rename the function to the status quo, > drm_sched_policy_mismatch(... > > > +{ > > + enum drm_sched_policy sched_policy = sched_list[0]->sched_policy; > > + unsigned int i; > > + > > + /* All schedule policies must match */ > > + for (i = 1; i < num_sched_list; ++i) > > + if (sched_policy != sched_list[i]->sched_policy) > > + return true; > > + > > + return false; > > +} > > + > > /** > > * drm_sched_entity_init - Init a context entity used by scheduler when > > * submit to HW ring. > > @@ -62,7 +76,8 @@ int drm_sched_entity_init(struct drm_sched_entity *entity, > > unsigned int num_sched_list, > > atomic_t *guilty) > > { > > - if (!(entity && sched_list && (num_sched_list == 0 || sched_list[0]))) > > + if (!(entity && sched_list && (num_sched_list == 0 || sched_list[0])) || > > + bad_policies(sched_list, num_sched_list)) > > return -EINVAL; > > > > memset(entity, 0, sizeof(struct drm_sched_entity)); > > @@ -486,7 +501,7 @@ struct drm_sched_job *drm_sched_entity_pop_job(struct drm_sched_entity *entity) > > * Update the entity's location in the min heap according to > > * the timestamp of the next job, if any. > > */ > > - if (drm_sched_policy == DRM_SCHED_POLICY_FIFO) { > > + if (entity->rq->sched->sched_policy == DRM_SCHED_POLICY_FIFO) { > > struct drm_sched_job *next; > > > > next = to_drm_sched_job(spsc_queue_peek(&entity->job_queue)); > > @@ -558,7 +573,8 @@ void drm_sched_entity_select_rq(struct drm_sched_entity *entity) > > void drm_sched_entity_push_job(struct drm_sched_job *sched_job) > > { > > struct drm_sched_entity *entity = sched_job->entity; > > - bool first; > > + bool first, fifo = entity->rq->sched->sched_policy == > > + DRM_SCHED_POLICY_FIFO; > > ktime_t submit_ts; > > > > trace_drm_sched_job(sched_job, entity); > > @@ -587,7 +603,7 @@ void drm_sched_entity_push_job(struct drm_sched_job *sched_job) > > drm_sched_rq_add_entity(entity->rq, entity); > > spin_unlock(&entity->rq_lock); > > > > - if (drm_sched_policy == DRM_SCHED_POLICY_FIFO) > > + if (fifo) > > drm_sched_rq_update_fifo(entity, submit_ts); > > > > drm_sched_wakeup_if_can_queue(entity->rq->sched); > > diff --git a/drivers/gpu/drm/scheduler/sched_main.c b/drivers/gpu/drm/scheduler/sched_main.c > > index 614e8c97a622..545d5298c086 100644 > > --- a/drivers/gpu/drm/scheduler/sched_main.c > > +++ b/drivers/gpu/drm/scheduler/sched_main.c > > @@ -66,14 +66,14 @@ > > #define to_drm_sched_job(sched_job) \ > > container_of((sched_job), struct drm_sched_job, queue_node) > > > > -int drm_sched_policy = DRM_SCHED_POLICY_FIFO; > > +int default_drm_sched_policy = DRM_SCHED_POLICY_FIFO; > > > > /** > > * DOC: sched_policy (int) > > * Used to override default entities scheduling policy in a run queue. > > */ > > -MODULE_PARM_DESC(sched_policy, "Specify the scheduling policy for entities on a run-queue, " __stringify(DRM_SCHED_POLICY_RR) " = Round Robin, " __stringify(DRM_SCHED_POLICY_FIFO) " = FIFO (default)."); > > -module_param_named(sched_policy, drm_sched_policy, int, 0444); > > +MODULE_PARM_DESC(sched_policy, "Specify the default scheduling policy for entities on a run-queue, " __stringify(DRM_SCHED_POLICY_RR) " = Round Robin, " __stringify(DRM_SCHED_POLICY_FIFO) " = FIFO (default)."); > > Note, that you don't need to add "default" in the text as it is already there at the very end "FIFO (default)." > Else, it gets confusing what is meant by "default". Like this: > > Specify the default scheduling policy for entities on a run-queue, 1 = Round Robin, 2 = FIFO (default). > > See "default" appear twice and it creates confusion? We don't need our internal "default" play to get > exported all the way to the casual user reading this. It is much clear, however, > > Specify the scheduling policy for entities on a run-queue, 1 = Round Robin, 2 = FIFO (default). > > To mean, if unset, the default one would be used. But this is all internal code stuff. > > So I'd say leave this one alone. > > > +module_param_named(sched_policy, default_drm_sched_policy, int, 0444); > > Put "default" as a postfix: > default_drm_sched_policy --> drm_sched_policy_default > > > > > static __always_inline bool drm_sched_entity_compare_before(struct rb_node *a, > > const struct rb_node *b) > > @@ -177,7 +177,7 @@ void drm_sched_rq_remove_entity(struct drm_sched_rq *rq, > > if (rq->current_entity == entity) > > rq->current_entity = NULL; > > > > - if (drm_sched_policy == DRM_SCHED_POLICY_FIFO) > > + if (rq->sched->sched_policy == DRM_SCHED_POLICY_FIFO) > > drm_sched_rq_remove_fifo_locked(entity); > > > > spin_unlock(&rq->lock); > > @@ -898,7 +898,7 @@ drm_sched_select_entity(struct drm_gpu_scheduler *sched) > > > > /* Kernel run queue has higher priority than normal run queue*/ > > for (i = DRM_SCHED_PRIORITY_COUNT - 1; i >= DRM_SCHED_PRIORITY_MIN; i--) { > > - entity = drm_sched_policy == DRM_SCHED_POLICY_FIFO ? > > + entity = sched->sched_policy == DRM_SCHED_POLICY_FIFO ? > > drm_sched_rq_select_entity_fifo(&sched->sched_rq[i]) : > > drm_sched_rq_select_entity_rr(&sched->sched_rq[i]); > > if (entity) > > @@ -1071,6 +1071,7 @@ static void drm_sched_main(struct work_struct *w) > > * used > > * @score: optional score atomic shared with other schedulers > > * @name: name used for debugging > > + * @sched_policy: schedule policy > > * @dev: target &struct device > > * > > * Return 0 on success, otherwise error code. > > @@ -1080,9 +1081,15 @@ int drm_sched_init(struct drm_gpu_scheduler *sched, > > struct workqueue_struct *submit_wq, > > unsigned hw_submission, unsigned hang_limit, > > long timeout, struct workqueue_struct *timeout_wq, > > - atomic_t *score, const char *name, struct device *dev) > > + atomic_t *score, const char *name, > > + enum drm_sched_policy sched_policy, > > + struct device *dev) > > { > > int i; > > + > > + if (sched_policy >= DRM_SCHED_POLICY_COUNT) > > + return -EINVAL; > > + > > sched->ops = ops; > > sched->hw_submission_limit = hw_submission; > > sched->name = name; > > @@ -1092,6 +1099,10 @@ int drm_sched_init(struct drm_gpu_scheduler *sched, > > sched->hang_limit = hang_limit; > > sched->score = score ? score : &sched->_score; > > sched->dev = dev; > > + if (sched_policy == DRM_SCHED_POLICY_DEFAULT) > > + sched->sched_policy = default_drm_sched_policy; > > + else > > + sched->sched_policy = sched_policy; > > for (i = DRM_SCHED_PRIORITY_MIN; i < DRM_SCHED_PRIORITY_COUNT; i++) > > drm_sched_rq_init(sched, &sched->sched_rq[i]); > > > > diff --git a/drivers/gpu/drm/v3d/v3d_sched.c b/drivers/gpu/drm/v3d/v3d_sched.c > > index 38e092ea41e6..5e3fe77fa991 100644 > > --- a/drivers/gpu/drm/v3d/v3d_sched.c > > +++ b/drivers/gpu/drm/v3d/v3d_sched.c > > @@ -391,7 +391,8 @@ v3d_sched_init(struct v3d_dev *v3d) > > &v3d_bin_sched_ops, NULL, > > hw_jobs_limit, job_hang_limit, > > msecs_to_jiffies(hang_limit_ms), NULL, > > - NULL, "v3d_bin", v3d->drm.dev); > > + NULL, "v3d_bin", DRM_SCHED_POLICY_DEFAULT, > > + v3d->drm.dev); > > if (ret) > > return ret; > > > > @@ -399,7 +400,8 @@ v3d_sched_init(struct v3d_dev *v3d) > > &v3d_render_sched_ops, NULL, > > hw_jobs_limit, job_hang_limit, > > msecs_to_jiffies(hang_limit_ms), NULL, > > - NULL, "v3d_render", v3d->drm.dev); > > + ULL, "v3d_render", DRM_SCHED_POLICY_DEFAULT, > > + v3d->drm.dev); > > if (ret) > > goto fail; > > > > @@ -407,7 +409,8 @@ v3d_sched_init(struct v3d_dev *v3d) > > &v3d_tfu_sched_ops, NULL, > > hw_jobs_limit, job_hang_limit, > > msecs_to_jiffies(hang_limit_ms), NULL, > > - NULL, "v3d_tfu", v3d->drm.dev); > > + NULL, "v3d_tfu", DRM_SCHED_POLICY_DEFAULT, > > + v3d->drm.dev); > > if (ret) > > goto fail; > > > > @@ -416,7 +419,8 @@ v3d_sched_init(struct v3d_dev *v3d) > > &v3d_csd_sched_ops, NULL, > > hw_jobs_limit, job_hang_limit, > > msecs_to_jiffies(hang_limit_ms), NULL, > > - NULL, "v3d_csd", v3d->drm.dev); > > + NULL, "v3d_csd", DRM_SCHED_POLICY_DEFAULT, > > + v3d->drm.dev); > > if (ret) > > goto fail; > > > > @@ -424,7 +428,8 @@ v3d_sched_init(struct v3d_dev *v3d) > > &v3d_cache_clean_sched_ops, NULL, > > hw_jobs_limit, job_hang_limit, > > msecs_to_jiffies(hang_limit_ms), NULL, > > - NULL, "v3d_cache_clean", v3d->drm.dev); > > + NULL, "v3d_cache_clean", > > + DRM_SCHED_POLICY_DEFAULT, v3d->drm.dev); > > if (ret) > > goto fail; > > } > > diff --git a/include/drm/gpu_scheduler.h b/include/drm/gpu_scheduler.h > > index 278106e358a8..897d52a4ff4f 100644 > > --- a/include/drm/gpu_scheduler.h > > +++ b/include/drm/gpu_scheduler.h > > @@ -72,11 +72,15 @@ enum drm_sched_priority { > > DRM_SCHED_PRIORITY_UNSET = -2 > > }; > > > > -/* Used to chose between FIFO and RR jobs scheduling */ > > -extern int drm_sched_policy; > > - > > -#define DRM_SCHED_POLICY_RR 0 > > -#define DRM_SCHED_POLICY_FIFO 1 > > +/* Used to chose default scheduling policy*/ > > +extern int default_drm_sched_policy; > > + > > +enum drm_sched_policy { > > + DRM_SCHED_POLICY_DEFAULT, > > + DRM_SCHED_POLICY_RR, > > + DRM_SCHED_POLICY_FIFO, > > + DRM_SCHED_POLICY_COUNT, > > +}; > > No. Use as the first (0th) element name "DRM_SCHED_POLICY_UNSET". > The DRM scheduling policies are, > * unset, meaning no preference, whatever the default is, (but that's NOT the "default"), > * Round-Robin, and > * FIFO. > "Default" is a _result_ of the policy being _unset_. "Default" is not a policy. > IOW, we want to say, > "If you don't set the policy (i.e. it's unset), we'll set it to the default one, > which could be either Round-Robin, or FIFO." > > It may look a bit strange in function calls up there, "What do you mean `unset'? What is it?" > but it needs to be understood that the _policy_ is "unset", "rr", or "fifo", and if it is "unset", > we'll set it to whatever the default one was set to, at boot/compile time, RR or FIFO. > > Note that "unset" is equivalent to a function not having the policy parameter altogether (as right now). > Now that you're adding it, you can extend that, as opposed to renaming the enum > to "DEFAULT" to tell the caller that it will be set to the default one. But we don't need > to tell function behaviour in the name of a function parameter/enum element. > s/DRM_SCHED_POLICY_DEFAULT/DRM_SCHED_POLICY_UNSET/ it is. Matt > > > > /** > > * struct drm_sched_entity - A wrapper around a job queue (typically > > @@ -489,6 +493,7 @@ struct drm_sched_backend_ops { > > * guilty and it will no longer be considered for scheduling. > > * @score: score to help loadbalancer pick a idle sched > > * @_score: score used when the driver doesn't provide one > > + * @sched_policy: Schedule policy for scheduler > > * @ready: marks if the underlying HW is ready to work > > * @free_guilty: A hit to time out handler to free the guilty job. > > * @pause_submit: pause queuing of @work_submit on @submit_wq > > @@ -514,6 +519,7 @@ struct drm_gpu_scheduler { > > int hang_limit; > > atomic_t *score; > > atomic_t _score; > > + enum drm_sched_policy sched_policy; > > bool ready; > > bool free_guilty; > > bool pause_submit; > > @@ -525,7 +531,9 @@ int drm_sched_init(struct drm_gpu_scheduler *sched, > > struct workqueue_struct *submit_wq, > > uint32_t hw_submission, unsigned hang_limit, > > long timeout, struct workqueue_struct *timeout_wq, > > - atomic_t *score, const char *name, struct device *dev); > > + atomic_t *score, const char *name, > > + enum drm_sched_policy sched_policy, > > + struct device *dev); > > > > void drm_sched_fini(struct drm_gpu_scheduler *sched); > > int drm_sched_job_init(struct drm_sched_job *job, > > -- > Regards, > Luben >
diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c index c83a76bccc1d..ecb00991dd51 100644 --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c @@ -2309,6 +2309,7 @@ static int amdgpu_device_init_schedulers(struct amdgpu_device *adev) ring->num_hw_submission, 0, timeout, adev->reset_domain->wq, ring->sched_score, ring->name, + DRM_SCHED_POLICY_DEFAULT, adev->dev); if (r) { DRM_ERROR("Failed to create scheduler on ring %s.\n", diff --git a/drivers/gpu/drm/etnaviv/etnaviv_sched.c b/drivers/gpu/drm/etnaviv/etnaviv_sched.c index 618a804ddc34..3646f995ca94 100644 --- a/drivers/gpu/drm/etnaviv/etnaviv_sched.c +++ b/drivers/gpu/drm/etnaviv/etnaviv_sched.c @@ -137,7 +137,8 @@ int etnaviv_sched_init(struct etnaviv_gpu *gpu) ret = drm_sched_init(&gpu->sched, &etnaviv_sched_ops, NULL, etnaviv_hw_jobs_limit, etnaviv_job_hang_limit, msecs_to_jiffies(500), NULL, NULL, - dev_name(gpu->dev), gpu->dev); + dev_name(gpu->dev), DRM_SCHED_POLICY_DEFAULT, + gpu->dev); if (ret) return ret; diff --git a/drivers/gpu/drm/lima/lima_sched.c b/drivers/gpu/drm/lima/lima_sched.c index 8d858aed0e56..465d4bf3882b 100644 --- a/drivers/gpu/drm/lima/lima_sched.c +++ b/drivers/gpu/drm/lima/lima_sched.c @@ -491,7 +491,8 @@ int lima_sched_pipe_init(struct lima_sched_pipe *pipe, const char *name) return drm_sched_init(&pipe->base, &lima_sched_ops, NULL, 1, lima_job_hang_limit, msecs_to_jiffies(timeout), NULL, - NULL, name, pipe->ldev->dev); + NULL, name, DRM_SCHED_POLICY_DEFAULT, + pipe->ldev->dev); } void lima_sched_pipe_fini(struct lima_sched_pipe *pipe) diff --git a/drivers/gpu/drm/msm/msm_ringbuffer.c b/drivers/gpu/drm/msm/msm_ringbuffer.c index b8865e61b40f..f45e674a0aaf 100644 --- a/drivers/gpu/drm/msm/msm_ringbuffer.c +++ b/drivers/gpu/drm/msm/msm_ringbuffer.c @@ -96,7 +96,8 @@ struct msm_ringbuffer *msm_ringbuffer_new(struct msm_gpu *gpu, int id, ret = drm_sched_init(&ring->sched, &msm_sched_ops, NULL, num_hw_submissions, 0, sched_timeout, - NULL, NULL, to_msm_bo(ring->bo)->name, gpu->dev->dev); + NULL, NULL, to_msm_bo(ring->bo)->name, + DRM_SCHED_POLICY_DEFAULT, gpu->dev->dev); if (ret) { goto fail; } diff --git a/drivers/gpu/drm/nouveau/nouveau_sched.c b/drivers/gpu/drm/nouveau/nouveau_sched.c index d458c2227d4f..70e497e40c70 100644 --- a/drivers/gpu/drm/nouveau/nouveau_sched.c +++ b/drivers/gpu/drm/nouveau/nouveau_sched.c @@ -431,7 +431,8 @@ int nouveau_sched_init(struct nouveau_drm *drm) return drm_sched_init(sched, &nouveau_sched_ops, NULL, NOUVEAU_SCHED_HW_SUBMISSIONS, 0, job_hang_limit, - NULL, NULL, "nouveau_sched", drm->dev->dev); + NULL, NULL, "nouveau_sched", + DRM_SCHED_POLICY_DEFAULT, drm->dev->dev); } void nouveau_sched_fini(struct nouveau_drm *drm) diff --git a/drivers/gpu/drm/panfrost/panfrost_job.c b/drivers/gpu/drm/panfrost/panfrost_job.c index 326ca1ddf1d7..ad36bf3a4699 100644 --- a/drivers/gpu/drm/panfrost/panfrost_job.c +++ b/drivers/gpu/drm/panfrost/panfrost_job.c @@ -835,7 +835,8 @@ int panfrost_job_init(struct panfrost_device *pfdev) nentries, 0, msecs_to_jiffies(JOB_TIMEOUT_MS), pfdev->reset.wq, - NULL, "pan_js", pfdev->dev); + NULL, "pan_js", DRM_SCHED_POLICY_DEFAULT, + pfdev->dev); if (ret) { dev_err(pfdev->dev, "Failed to create scheduler: %d.", ret); goto err_sched; diff --git a/drivers/gpu/drm/scheduler/sched_entity.c b/drivers/gpu/drm/scheduler/sched_entity.c index a42763e1429d..65a972b52eda 100644 --- a/drivers/gpu/drm/scheduler/sched_entity.c +++ b/drivers/gpu/drm/scheduler/sched_entity.c @@ -33,6 +33,20 @@ #define to_drm_sched_job(sched_job) \ container_of((sched_job), struct drm_sched_job, queue_node) +static bool bad_policies(struct drm_gpu_scheduler **sched_list, + unsigned int num_sched_list) +{ + enum drm_sched_policy sched_policy = sched_list[0]->sched_policy; + unsigned int i; + + /* All schedule policies must match */ + for (i = 1; i < num_sched_list; ++i) + if (sched_policy != sched_list[i]->sched_policy) + return true; + + return false; +} + /** * drm_sched_entity_init - Init a context entity used by scheduler when * submit to HW ring. @@ -62,7 +76,8 @@ int drm_sched_entity_init(struct drm_sched_entity *entity, unsigned int num_sched_list, atomic_t *guilty) { - if (!(entity && sched_list && (num_sched_list == 0 || sched_list[0]))) + if (!(entity && sched_list && (num_sched_list == 0 || sched_list[0])) || + bad_policies(sched_list, num_sched_list)) return -EINVAL; memset(entity, 0, sizeof(struct drm_sched_entity)); @@ -486,7 +501,7 @@ struct drm_sched_job *drm_sched_entity_pop_job(struct drm_sched_entity *entity) * Update the entity's location in the min heap according to * the timestamp of the next job, if any. */ - if (drm_sched_policy == DRM_SCHED_POLICY_FIFO) { + if (entity->rq->sched->sched_policy == DRM_SCHED_POLICY_FIFO) { struct drm_sched_job *next; next = to_drm_sched_job(spsc_queue_peek(&entity->job_queue)); @@ -558,7 +573,8 @@ void drm_sched_entity_select_rq(struct drm_sched_entity *entity) void drm_sched_entity_push_job(struct drm_sched_job *sched_job) { struct drm_sched_entity *entity = sched_job->entity; - bool first; + bool first, fifo = entity->rq->sched->sched_policy == + DRM_SCHED_POLICY_FIFO; ktime_t submit_ts; trace_drm_sched_job(sched_job, entity); @@ -587,7 +603,7 @@ void drm_sched_entity_push_job(struct drm_sched_job *sched_job) drm_sched_rq_add_entity(entity->rq, entity); spin_unlock(&entity->rq_lock); - if (drm_sched_policy == DRM_SCHED_POLICY_FIFO) + if (fifo) drm_sched_rq_update_fifo(entity, submit_ts); drm_sched_wakeup_if_can_queue(entity->rq->sched); diff --git a/drivers/gpu/drm/scheduler/sched_main.c b/drivers/gpu/drm/scheduler/sched_main.c index 614e8c97a622..545d5298c086 100644 --- a/drivers/gpu/drm/scheduler/sched_main.c +++ b/drivers/gpu/drm/scheduler/sched_main.c @@ -66,14 +66,14 @@ #define to_drm_sched_job(sched_job) \ container_of((sched_job), struct drm_sched_job, queue_node) -int drm_sched_policy = DRM_SCHED_POLICY_FIFO; +int default_drm_sched_policy = DRM_SCHED_POLICY_FIFO; /** * DOC: sched_policy (int) * Used to override default entities scheduling policy in a run queue. */ -MODULE_PARM_DESC(sched_policy, "Specify the scheduling policy for entities on a run-queue, " __stringify(DRM_SCHED_POLICY_RR) " = Round Robin, " __stringify(DRM_SCHED_POLICY_FIFO) " = FIFO (default)."); -module_param_named(sched_policy, drm_sched_policy, int, 0444); +MODULE_PARM_DESC(sched_policy, "Specify the default scheduling policy for entities on a run-queue, " __stringify(DRM_SCHED_POLICY_RR) " = Round Robin, " __stringify(DRM_SCHED_POLICY_FIFO) " = FIFO (default)."); +module_param_named(sched_policy, default_drm_sched_policy, int, 0444); static __always_inline bool drm_sched_entity_compare_before(struct rb_node *a, const struct rb_node *b) @@ -177,7 +177,7 @@ void drm_sched_rq_remove_entity(struct drm_sched_rq *rq, if (rq->current_entity == entity) rq->current_entity = NULL; - if (drm_sched_policy == DRM_SCHED_POLICY_FIFO) + if (rq->sched->sched_policy == DRM_SCHED_POLICY_FIFO) drm_sched_rq_remove_fifo_locked(entity); spin_unlock(&rq->lock); @@ -898,7 +898,7 @@ drm_sched_select_entity(struct drm_gpu_scheduler *sched) /* Kernel run queue has higher priority than normal run queue*/ for (i = DRM_SCHED_PRIORITY_COUNT - 1; i >= DRM_SCHED_PRIORITY_MIN; i--) { - entity = drm_sched_policy == DRM_SCHED_POLICY_FIFO ? + entity = sched->sched_policy == DRM_SCHED_POLICY_FIFO ? drm_sched_rq_select_entity_fifo(&sched->sched_rq[i]) : drm_sched_rq_select_entity_rr(&sched->sched_rq[i]); if (entity) @@ -1071,6 +1071,7 @@ static void drm_sched_main(struct work_struct *w) * used * @score: optional score atomic shared with other schedulers * @name: name used for debugging + * @sched_policy: schedule policy * @dev: target &struct device * * Return 0 on success, otherwise error code. @@ -1080,9 +1081,15 @@ int drm_sched_init(struct drm_gpu_scheduler *sched, struct workqueue_struct *submit_wq, unsigned hw_submission, unsigned hang_limit, long timeout, struct workqueue_struct *timeout_wq, - atomic_t *score, const char *name, struct device *dev) + atomic_t *score, const char *name, + enum drm_sched_policy sched_policy, + struct device *dev) { int i; + + if (sched_policy >= DRM_SCHED_POLICY_COUNT) + return -EINVAL; + sched->ops = ops; sched->hw_submission_limit = hw_submission; sched->name = name; @@ -1092,6 +1099,10 @@ int drm_sched_init(struct drm_gpu_scheduler *sched, sched->hang_limit = hang_limit; sched->score = score ? score : &sched->_score; sched->dev = dev; + if (sched_policy == DRM_SCHED_POLICY_DEFAULT) + sched->sched_policy = default_drm_sched_policy; + else + sched->sched_policy = sched_policy; for (i = DRM_SCHED_PRIORITY_MIN; i < DRM_SCHED_PRIORITY_COUNT; i++) drm_sched_rq_init(sched, &sched->sched_rq[i]); diff --git a/drivers/gpu/drm/v3d/v3d_sched.c b/drivers/gpu/drm/v3d/v3d_sched.c index 38e092ea41e6..5e3fe77fa991 100644 --- a/drivers/gpu/drm/v3d/v3d_sched.c +++ b/drivers/gpu/drm/v3d/v3d_sched.c @@ -391,7 +391,8 @@ v3d_sched_init(struct v3d_dev *v3d) &v3d_bin_sched_ops, NULL, hw_jobs_limit, job_hang_limit, msecs_to_jiffies(hang_limit_ms), NULL, - NULL, "v3d_bin", v3d->drm.dev); + NULL, "v3d_bin", DRM_SCHED_POLICY_DEFAULT, + v3d->drm.dev); if (ret) return ret; @@ -399,7 +400,8 @@ v3d_sched_init(struct v3d_dev *v3d) &v3d_render_sched_ops, NULL, hw_jobs_limit, job_hang_limit, msecs_to_jiffies(hang_limit_ms), NULL, - NULL, "v3d_render", v3d->drm.dev); + ULL, "v3d_render", DRM_SCHED_POLICY_DEFAULT, + v3d->drm.dev); if (ret) goto fail; @@ -407,7 +409,8 @@ v3d_sched_init(struct v3d_dev *v3d) &v3d_tfu_sched_ops, NULL, hw_jobs_limit, job_hang_limit, msecs_to_jiffies(hang_limit_ms), NULL, - NULL, "v3d_tfu", v3d->drm.dev); + NULL, "v3d_tfu", DRM_SCHED_POLICY_DEFAULT, + v3d->drm.dev); if (ret) goto fail; @@ -416,7 +419,8 @@ v3d_sched_init(struct v3d_dev *v3d) &v3d_csd_sched_ops, NULL, hw_jobs_limit, job_hang_limit, msecs_to_jiffies(hang_limit_ms), NULL, - NULL, "v3d_csd", v3d->drm.dev); + NULL, "v3d_csd", DRM_SCHED_POLICY_DEFAULT, + v3d->drm.dev); if (ret) goto fail; @@ -424,7 +428,8 @@ v3d_sched_init(struct v3d_dev *v3d) &v3d_cache_clean_sched_ops, NULL, hw_jobs_limit, job_hang_limit, msecs_to_jiffies(hang_limit_ms), NULL, - NULL, "v3d_cache_clean", v3d->drm.dev); + NULL, "v3d_cache_clean", + DRM_SCHED_POLICY_DEFAULT, v3d->drm.dev); if (ret) goto fail; } diff --git a/include/drm/gpu_scheduler.h b/include/drm/gpu_scheduler.h index 278106e358a8..897d52a4ff4f 100644 --- a/include/drm/gpu_scheduler.h +++ b/include/drm/gpu_scheduler.h @@ -72,11 +72,15 @@ enum drm_sched_priority { DRM_SCHED_PRIORITY_UNSET = -2 }; -/* Used to chose between FIFO and RR jobs scheduling */ -extern int drm_sched_policy; - -#define DRM_SCHED_POLICY_RR 0 -#define DRM_SCHED_POLICY_FIFO 1 +/* Used to chose default scheduling policy*/ +extern int default_drm_sched_policy; + +enum drm_sched_policy { + DRM_SCHED_POLICY_DEFAULT, + DRM_SCHED_POLICY_RR, + DRM_SCHED_POLICY_FIFO, + DRM_SCHED_POLICY_COUNT, +}; /** * struct drm_sched_entity - A wrapper around a job queue (typically @@ -489,6 +493,7 @@ struct drm_sched_backend_ops { * guilty and it will no longer be considered for scheduling. * @score: score to help loadbalancer pick a idle sched * @_score: score used when the driver doesn't provide one + * @sched_policy: Schedule policy for scheduler * @ready: marks if the underlying HW is ready to work * @free_guilty: A hit to time out handler to free the guilty job. * @pause_submit: pause queuing of @work_submit on @submit_wq @@ -514,6 +519,7 @@ struct drm_gpu_scheduler { int hang_limit; atomic_t *score; atomic_t _score; + enum drm_sched_policy sched_policy; bool ready; bool free_guilty; bool pause_submit; @@ -525,7 +531,9 @@ int drm_sched_init(struct drm_gpu_scheduler *sched, struct workqueue_struct *submit_wq, uint32_t hw_submission, unsigned hang_limit, long timeout, struct workqueue_struct *timeout_wq, - atomic_t *score, const char *name, struct device *dev); + atomic_t *score, const char *name, + enum drm_sched_policy sched_policy, + struct device *dev); void drm_sched_fini(struct drm_gpu_scheduler *sched); int drm_sched_job_init(struct drm_sched_job *job,
Rather than a global modparam for scheduling policy, move the scheduling policy to scheduler / entity so user can control each scheduler / entity policy. v2: - s/DRM_SCHED_POLICY_MAX/DRM_SCHED_POLICY_COUNT (Luben) - Only include policy in scheduler (Luben) Signed-off-by: Matthew Brost <matthew.brost@intel.com> --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 1 + drivers/gpu/drm/etnaviv/etnaviv_sched.c | 3 ++- drivers/gpu/drm/lima/lima_sched.c | 3 ++- drivers/gpu/drm/msm/msm_ringbuffer.c | 3 ++- drivers/gpu/drm/nouveau/nouveau_sched.c | 3 ++- drivers/gpu/drm/panfrost/panfrost_job.c | 3 ++- drivers/gpu/drm/scheduler/sched_entity.c | 24 ++++++++++++++++++---- drivers/gpu/drm/scheduler/sched_main.c | 23 +++++++++++++++------ drivers/gpu/drm/v3d/v3d_sched.c | 15 +++++++++----- include/drm/gpu_scheduler.h | 20 ++++++++++++------ 10 files changed, 72 insertions(+), 26 deletions(-)