diff mbox series

[RFC] ccu-sun50i-h6: Bump-up DDR0 PLL to 1800MHz

Message ID 20220710175214.544748-1-r.stratiienko@gmail.com (mailing list archive)
State New, archived
Headers show
Series [RFC] ccu-sun50i-h6: Bump-up DDR0 PLL to 1800MHz | expand

Commit Message

Roman Stratiienko July 10, 2022, 5:52 p.m. UTC
While debugging complex animated composition cases  I noticed a glitch
which as it turned-out eventually was caused by lack of memory bandwidth.

I can't find a DRAMC manual to check what input frequency must be
supplied to avoid exceeding the 800MHz LPDDR3 which is installed on
my orangepi3 board. But the system is running stable so far.

Signed-off-by: Roman Stratiienko <r.stratiienko@gmail.com>
---
 drivers/clk/sunxi-ng/ccu-sun50i-h6.c | 11 +++++++++++
 1 file changed, 11 insertions(+)

Comments

Jernej Škrabec July 10, 2022, 6:47 p.m. UTC | #1
Hi Roman,

Dne nedelja, 10. julij 2022 ob 19:52:14 CEST je Roman Stratiienko napisal(a):
> While debugging complex animated composition cases  I noticed a glitch
> which as it turned-out eventually was caused by lack of memory bandwidth.
> 
> I can't find a DRAMC manual to check what input frequency must be
> supplied to avoid exceeding the 800MHz LPDDR3 which is installed on
> my orangepi3 board. But the system is running stable so far.

this is hard topic. Generally, max. supported DRAM frequency is board specific 
and is set by SPL (part of U-Boot), so we can't assume every board supports 
max. frequency for its memory type. In fact, this assumption was proven wrong. 
Some boards were unstable until DRAM frequency was lowered. So in short, Linux 
clock driver certainly shouldn't set DRAM frequency. There is, however, Linux 
MBUS driver for H5 and A64, which can throttle DRAM frequency when system is 
idle, but max. frequency is still that one which was programmed by SPL.

There is no definitive DRAM documentation, because Allwinner usually tries to 
hide all details. DRAM drivers are, except in rare cases, all reverse 
engineered by community. Register values (mostly) match those produced by 
vendor driver. Fortunately, at least H6 DRAM controller matches pretty good 
with one other memory controller (one in Zynq UltraScale+ MPSoC), which is 
fully documented (see comments in U-Boot driver). MBUS is also pretty standard 
and it's not a mystery.

Note that it's hard to convince maintainers to either lower or increase DRAM 
frequency as it was common issue in the past. You have to do a lot of memory 
intensive tests, both CPU and GPU, running at the same time for extended 
period of time. Initial board bring ups usually just use same frequency as in 
BSP based images. If I understand procedure correctly, boards are sent to 
Allwinner, so their engineers determine max. DRAM frequency and timings, so 
these values are pretty trustwothy.

There is other possibility. Along with DRAM controller and PHY, there is MBUS 
core, which is memory arbiter and responsible for bandwidth allocation. You 
can change DE priority and bandwith there (again, in U-Boot DRAM driver).

I hope this explains why things are the way they are.

Best regards,
Jernej

> 
> Signed-off-by: Roman Stratiienko <r.stratiienko@gmail.com>
> ---
>  drivers/clk/sunxi-ng/ccu-sun50i-h6.c | 11 +++++++++++
>  1 file changed, 11 insertions(+)
> 
> diff --git a/drivers/clk/sunxi-ng/ccu-sun50i-h6.c
> b/drivers/clk/sunxi-ng/ccu-sun50i-h6.c index 750368a86b8b6..abdde80307993
> 100644
> --- a/drivers/clk/sunxi-ng/ccu-sun50i-h6.c
> +++ b/drivers/clk/sunxi-ng/ccu-sun50i-h6.c
> @@ -1203,6 +1203,17 @@ static int sun50i_h6_ccu_probe(struct platform_device
> *pdev) val &= ~GENMASK(3, 0);
>  	writel(val, reg + gpu_clk.common.reg);
> 
> +	/*
> +	 * Increase DDR0 PLL from 1488(default) to 1800MHz.
> +	 * (DE3.0 require higher memory bandwidth while displaying
> +	 * complex composition at 1920x1080@60FPS)
> +	 */
> +
> +	val = readl(reg + SUN50I_H6_PLL_DDR0_REG);
> +	val &= ~GENMASK(15, 0);
> +	val |= 74 << 8;
> +	writel(val, reg + SUN50I_H6_PLL_DDR0_REG);
> +
>  	/* Enable the lock bits on all PLLs */
>  	for (i = 0; i < ARRAY_SIZE(pll_regs); i++) {
>  		val = readl(reg + pll_regs[i]);
> --
> 2.34.1
Andre Przywara July 10, 2022, 10:54 p.m. UTC | #2
On Sun, 10 Jul 2022 20:47:55 +0200
Jernej Škrabec <jernej.skrabec@gmail.com> wrote:

Hi Roman,

> Dne nedelja, 10. julij 2022 ob 19:52:14 CEST je Roman Stratiienko napisal(a):
> > While debugging complex animated composition cases  I noticed a glitch
> > which as it turned-out eventually was caused by lack of memory bandwidth.
> > 
> > I can't find a DRAMC manual to check what input frequency must be
> > supplied to avoid exceeding the 800MHz LPDDR3 which is installed on
> > my orangepi3 board. But the system is running stable so far.  

Well, most recent Allwinner boards indeed have capable LPDDR3 (800 MHz)
or DDR3-1600 (or even higher) chips, but we never got them running at the
full clock speed, and lowering the DRAM frequency is a common kludge to
improve system stability (as in: actually reaching the login prompt).
And we figured that those "more aggressive" timings are quite board
specific, so while 800 MHz might work on your particular board, your
friend's OPi3 might not be as lucky.

> this is hard topic. Generally, max. supported DRAM frequency is board specific 
> and is set by SPL (part of U-Boot), so we can't assume every board supports 
> max. frequency for its memory type.

Yes, while we run most H6 boards at 744 MHz, the Tanix TX6 requires no
more than 648 MHz, for instance, partly because it has DDR3 (not LPDDR3)
DRAM. Another H6 TV box I have needs an even lower frequency (at least
with the current timing settings).

> In fact, this assumption was proven wrong. 
> Some boards were unstable until DRAM frequency was lowered. So in short, Linux 
> clock driver certainly shouldn't set DRAM frequency.

Yes, and on top of that, also keep in mind that many DRAM controller
registers take timing parameters in terms of clock cycles, but the
actual DDR chips are specified in terms of pico- or nanoseconds. This
means the register values must be calculated and programmed based on
the clock frequency, see [1] for examples in all those ns_to_t()
calls. And just increasing the PLL frequency might now lead to exceeded
specifications, and thus subtle instabilities.

[1] https://source.denx.de/u-boot/u-boot/-/blob/master/arch/arm/mach-sunxi/dram_timings/h6_ddr3_1333.c

> There is, however, Linux 
> MBUS driver for H5 and A64, which can throttle DRAM frequency when system is 
> idle, but max. frequency is still that one which was programmed by SPL.
> 
> There is no definitive DRAM documentation, because Allwinner usually tries to 
> hide all details. DRAM drivers are, except in rare cases, all reverse 
> engineered by community. Register values (mostly) match those produced by 
> vendor driver. Fortunately, at least H6 DRAM controller matches pretty good 
> with one other memory controller (one in Zynq UltraScale+ MPSoC), which is 
> fully documented (see comments in U-Boot driver). MBUS is also pretty standard 
> and it's not a mystery.
> 
> Note that it's hard to convince maintainers to either lower or increase DRAM 
> frequency as it was common issue in the past. You have to do a lot of memory 
> intensive tests, both CPU and GPU, running at the same time for extended 
> period of time. Initial board bring ups usually just use same frequency as in 
> BSP based images.

Yes, this DDR3 controller logic is mostly undocumented rocket science,
see [2] in our wiki for just *some* (old!) pieces of information and
the complexity it involved in the past to come to a set of timing
parameters (including the clock frequency) that works stably across
all boards.

[2] https://linux-sunxi.org/A10_DRAM_Controller_Calibration

> If I understand procedure correctly, boards are sent to 
> Allwinner, so their engineers determine max. DRAM frequency and timings, so 
> these values are pretty trustwothy.
> There is other possibility. Along with DRAM controller and PHY, there is MBUS 
> core, which is memory arbiter and responsible for bandwidth allocation. You 
> can change DE priority and bandwith there (again, in U-Boot DRAM driver).

That's indeed an interesting topic to explore, though the DE seems to
be already in the top tier in our settings:
https://source.denx.de/u-boot/u-boot/-/blob/master/arch/arm/mach-sunxi/dram_sun50i_h6.c#L135-151

Cheers,
Andre

> I hope this explains why things are the way they are.
> 
> Best regards,
> Jernej
> 
> > 
> > Signed-off-by: Roman Stratiienko <r.stratiienko@gmail.com>
> > ---
> >  drivers/clk/sunxi-ng/ccu-sun50i-h6.c | 11 +++++++++++
> >  1 file changed, 11 insertions(+)
> > 
> > diff --git a/drivers/clk/sunxi-ng/ccu-sun50i-h6.c
> > b/drivers/clk/sunxi-ng/ccu-sun50i-h6.c index 750368a86b8b6..abdde80307993
> > 100644
> > --- a/drivers/clk/sunxi-ng/ccu-sun50i-h6.c
> > +++ b/drivers/clk/sunxi-ng/ccu-sun50i-h6.c
> > @@ -1203,6 +1203,17 @@ static int sun50i_h6_ccu_probe(struct platform_device
> > *pdev) val &= ~GENMASK(3, 0);
> >  	writel(val, reg + gpu_clk.common.reg);
> > 
> > +	/*
> > +	 * Increase DDR0 PLL from 1488(default) to 1800MHz.
> > +	 * (DE3.0 require higher memory bandwidth while displaying
> > +	 * complex composition at 1920x1080@60FPS)
> > +	 */
> > +
> > +	val = readl(reg + SUN50I_H6_PLL_DDR0_REG);
> > +	val &= ~GENMASK(15, 0);
> > +	val |= 74 << 8;
> > +	writel(val, reg + SUN50I_H6_PLL_DDR0_REG);
> > +
> >  	/* Enable the lock bits on all PLLs */
> >  	for (i = 0; i < ARRAY_SIZE(pll_regs); i++) {
> >  		val = readl(reg + pll_regs[i]);
> > --
> > 2.34.1  
> 
> 
>
diff mbox series

Patch

diff --git a/drivers/clk/sunxi-ng/ccu-sun50i-h6.c b/drivers/clk/sunxi-ng/ccu-sun50i-h6.c
index 750368a86b8b6..abdde80307993 100644
--- a/drivers/clk/sunxi-ng/ccu-sun50i-h6.c
+++ b/drivers/clk/sunxi-ng/ccu-sun50i-h6.c
@@ -1203,6 +1203,17 @@  static int sun50i_h6_ccu_probe(struct platform_device *pdev)
 	val &= ~GENMASK(3, 0);
 	writel(val, reg + gpu_clk.common.reg);
 
+	/*
+	 * Increase DDR0 PLL from 1488(default) to 1800MHz.
+	 * (DE3.0 require higher memory bandwidth while displaying
+	 * complex composition at 1920x1080@60FPS)
+	 */
+
+	val = readl(reg + SUN50I_H6_PLL_DDR0_REG);
+	val &= ~GENMASK(15, 0);
+	val |= 74 << 8;
+	writel(val, reg + SUN50I_H6_PLL_DDR0_REG);
+
 	/* Enable the lock bits on all PLLs */
 	for (i = 0; i < ARRAY_SIZE(pll_regs); i++) {
 		val = readl(reg + pll_regs[i]);