From patchwork Thu Mar 10 15:27:00 2016 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Peter Antoine X-Patchwork-Id: 8557851 Return-Path: X-Original-To: patchwork-intel-gfx@patchwork.kernel.org Delivered-To: patchwork-parsemail@patchwork2.web.kernel.org Received: from mail.kernel.org (mail.kernel.org [198.145.29.136]) by patchwork2.web.kernel.org (Postfix) with ESMTP id 36BE0C0553 for ; Thu, 10 Mar 2016 15:27:14 +0000 (UTC) Received: from mail.kernel.org (localhost [127.0.0.1]) by mail.kernel.org (Postfix) with ESMTP id 21E7B20211 for ; Thu, 10 Mar 2016 15:27:13 +0000 (UTC) Received: from gabe.freedesktop.org (gabe.freedesktop.org [131.252.210.177]) by mail.kernel.org (Postfix) with ESMTP id DE5DE201ED for ; Thu, 10 Mar 2016 15:27:10 +0000 (UTC) Received: from gabe.freedesktop.org (localhost [127.0.0.1]) by gabe.freedesktop.org (Postfix) with ESMTP id A36406E9A3; Thu, 10 Mar 2016 15:27:09 +0000 (UTC) X-Original-To: intel-gfx@lists.freedesktop.org Delivered-To: intel-gfx@lists.freedesktop.org Received: from mga09.intel.com (mga09.intel.com [134.134.136.24]) by gabe.freedesktop.org (Postfix) with ESMTP id A86CD6E9A6 for ; Thu, 10 Mar 2016 15:27:07 +0000 (UTC) Received: from fmsmga003.fm.intel.com ([10.253.24.29]) by orsmga102.jf.intel.com with ESMTP; 10 Mar 2016 07:27:07 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.24,316,1455004800"; d="scan'208";a="667097731" Received: from peterant-linux2.isw.intel.com ([10.102.226.179]) by FMSMGA003.fm.intel.com with ESMTP; 10 Mar 2016 07:27:05 -0800 From: Peter Antoine To: intel-gfx@lists.freedesktop.org Date: Thu, 10 Mar 2016 15:27:00 +0000 Message-Id: <1457623620-17505-1-git-send-email-peter.antoine@intel.com> X-Mailer: git-send-email 1.9.1 Subject: [Intel-gfx] [PATCH] drm/i915/mocs: Program MOCS for all engines on init X-BeenThere: intel-gfx@lists.freedesktop.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: Intel graphics driver community testing & development List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , MIME-Version: 1.0 Errors-To: intel-gfx-bounces@lists.freedesktop.org Sender: "Intel-gfx" X-Spam-Status: No, score=-4.2 required=5.0 tests=BAYES_00, RCVD_IN_DNSWL_MED, RP_MATCHES_RCVD, UNPARSEABLE_RELAY autolearn=unavailable version=3.3.1 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on mail.kernel.org X-Virus-Scanned: ClamAV using ClamSMTP Allow for the MOCS to be programmed for all engines. Currently we program the MOCS when the first render batch goes through. This works on most platforms but fails on platforms that do not run a render batch early, i.e. headless servers. The patch now programs all initialised engines on init and the RCS is programmed again within the initial batch. This is done for predictable consistency with regards to the hardware context. Hardware context loading sets the values of the MOCS for RCS and L3CC. Programming them from within the batch makes sure that the render context is valid, no matter what the previous state of the saved-context was. Signed-off-by: Peter Antoine --- drivers/gpu/drm/i915/i915_gem.c | 7 ++ drivers/gpu/drm/i915/intel_mocs.c | 137 ++++++++++++++++++++++++++++++++++---- drivers/gpu/drm/i915/intel_mocs.h | 1 + 3 files changed, 133 insertions(+), 12 deletions(-) diff --git a/drivers/gpu/drm/i915/i915_gem.c b/drivers/gpu/drm/i915/i915_gem.c index b854af2..0bb6c60 100644 --- a/drivers/gpu/drm/i915/i915_gem.c +++ b/drivers/gpu/drm/i915/i915_gem.c @@ -32,6 +32,7 @@ #include "i915_vgpu.h" #include "i915_trace.h" #include "intel_drv.h" +#include "intel_mocs.h" #include #include #include @@ -4882,6 +4883,12 @@ i915_gem_init_hw(struct drm_device *dev) goto out; } + /* + * lets program the mocs values for all engines the RCS will + * send a batch on context start, but that will have to happen + */ + intel_program_mocs_all_engines(dev); + /* We can't enable contexts until all firmware is loaded */ if (HAS_GUC_UCODE(dev)) { ret = intel_guc_ucode_load(dev); diff --git a/drivers/gpu/drm/i915/intel_mocs.c b/drivers/gpu/drm/i915/intel_mocs.c index fed7bea..3939a50 100644 --- a/drivers/gpu/drm/i915/intel_mocs.c +++ b/drivers/gpu/drm/i915/intel_mocs.c @@ -128,9 +128,9 @@ static const struct drm_i915_mocs_entry broxton_mocs_table[] = { /** * get_mocs_settings() - * @dev: DRM device. + * @dev: DRM device. * @table: Output table that will be made to point at appropriate - * MOCS values for the device. + * MOCS values for the device. * * This function will return the values of the MOCS table that needs to * be programmed for the platform. It will return the values that need @@ -179,6 +179,48 @@ static i915_reg_t mocs_register(enum intel_ring_id ring, int index) } /** + * program_mocs_control_table() - emit the mocs control table + * @req: Request to set up the MOCS table for. + * @table: The values to program into the control regs. + * @ring: The engine for whom to emit the registers. + * + * This function simply emits a MI_LOAD_REGISTER_IMM command for the + * given table starting at the given address. + * + * Return: 0 on success, otherwise the error status. + */ +static int program_mocs_control_table(struct drm_device *dev, + const struct drm_i915_mocs_table *table, + enum intel_ring_id ring) +{ + struct drm_i915_private *dev_priv = dev->dev_private; + unsigned int index; + + if (WARN_ON(table->size > GEN9_NUM_MOCS_ENTRIES)) + return -ENODEV; + + for (index = 0; index < table->size; index++) { + I915_WRITE(mocs_register(ring, index), + table->table[index].control_value); + } + + /* + * Ok, now set the unused entries to uncached. These entries + * are officially undefined and no contract for the contents + * and settings is given for these entries. + * + * Entry 0 in the table is uncached - so we are just writing + * that value to all the used entries. + */ + for (; index < GEN9_NUM_MOCS_ENTRIES; index++) { + I915_WRITE(mocs_register(ring, index), + table->table[0].control_value); + } + + return 0; +} + +/** * emit_mocs_control_table() - emit the mocs control table * @req: Request to set up the MOCS table for. * @table: The values to program into the control regs. @@ -302,6 +344,83 @@ static int emit_mocs_l3cc_table(struct drm_i915_gem_request *req, } /** + * program_mocs_l3cc_table() - program the mocs control table + * @dev: The the device to be programmed. + * @table: The values to program into the control regs. + * + * This function simply programs the mocs registers for the given table + * starting at the given address. This register set is programmed in pairs. + * + * These registers may get programmed more than once, it is simpler to + * re-program 32 registers than maintain the state of when they were programmed. + * We are always reprogramming with the same values and this only on context + * start. + * + * Return: Nothing. + */ +static void program_mocs_l3cc_table(struct drm_device *dev, + struct drm_i915_mocs_table *table) +{ + unsigned int count; + unsigned int i; + u32 value; + struct drm_i915_private *dev_priv = dev->dev_private; + u32 filler = (table->table[0].l3cc_value & 0xffff) | + ((table->table[0].l3cc_value & 0xffff) << 16); + + for (i = 0, count = 0; i < table->size / 2; i++, count += 2) { + value = (table->table[count].l3cc_value & 0xffff) | + ((table->table[count + 1].l3cc_value & 0xffff) << 16); + + I915_WRITE(GEN9_LNCFCMOCS(i), value); + } + + if (table->size & 0x01) { + /* Odd table size - 1 left over */ + value = (table->table[count].l3cc_value & 0xffff) | + ((table->table[0].l3cc_value & 0xffff) << 16); + } else + value = filler; + + /* + * Now set the rest of the table to uncached - use entry 0 as this + * will be uncached. Leave the last pair as initialised as they are + * reserved by the hardware. + */ + for (; i < (GEN9_NUM_MOCS_ENTRIES / 2); i++) { + I915_WRITE(GEN9_LNCFCMOCS(i), value); + value = filler; + } +} + +/* + * intel_program_mocs_all_engines() - program all the MOCS register. + * + * This function will program the MOCS registers with the correct values. + * For all engines. + * + * Return: Nothing. + */ +void intel_program_mocs_all_engines(struct drm_device *dev) +{ + struct drm_i915_mocs_table t; + + if (get_mocs_settings(dev, &t)) { + struct drm_i915_private *dev_priv = dev->dev_private; + struct intel_engine_cs *ring; + enum intel_ring_id ring_id; + + /* Program the control registers */ + for_each_ring(ring, dev_priv, ring_id) { + program_mocs_control_table(dev, &t, ring_id); + } + + /* Now program the l3cc registers */ + program_mocs_l3cc_table(dev, &t); + } +} + +/** * intel_rcs_context_init_mocs() - program the MOCS register. * @req: Request to set up the MOCS tables for. * @@ -323,16 +442,10 @@ int intel_rcs_context_init_mocs(struct drm_i915_gem_request *req) int ret; if (get_mocs_settings(req->ring->dev, &t)) { - struct drm_i915_private *dev_priv = req->i915; - struct intel_engine_cs *ring; - enum intel_ring_id ring_id; - - /* Program the control registers */ - for_each_ring(ring, dev_priv, ring_id) { - ret = emit_mocs_control_table(req, &t, ring_id); - if (ret) - return ret; - } + /* Program the RCS control registers */ + ret = emit_mocs_control_table(req, &t, RCS); + if (ret) + return ret; /* Now program the l3cc registers */ ret = emit_mocs_l3cc_table(req, &t); diff --git a/drivers/gpu/drm/i915/intel_mocs.h b/drivers/gpu/drm/i915/intel_mocs.h index 76e45b1..6869317 100644 --- a/drivers/gpu/drm/i915/intel_mocs.h +++ b/drivers/gpu/drm/i915/intel_mocs.h @@ -53,5 +53,6 @@ #include "i915_drv.h" int intel_rcs_context_init_mocs(struct drm_i915_gem_request *req); +void intel_program_mocs_all_engines(struct drm_device *dev); #endif