Message ID | 1626861198-6133-13-git-send-email-gaosong@loongson.cn (mailing list archive) |
---|---|
State | New, archived |
Headers | show |
Series | Add LoongArch linux-user emulation support | expand |
On 7/20/21 11:53 PM, Song Gao wrote: > +target_ulong helper_cpucfg(CPULoongArchState *env, target_ulong rj) > +{ > + target_ulong r = 0; > + > + switch (rj) { > + case 0: > + r = env->CSR_MCSR0 & 0xffffffff; > + break; > + case 1: > + r = (env->CSR_MCSR0 & 0xffffffff00000000) >> 32; > + break; Why do you represent all of these as high and low portions of a 64-bit internal value, when the manual describes them as 32-bit values? > +/* Fixed point extra instruction translation */ > +static bool trans_crc_w_b_w(DisasContext *ctx, arg_crc_w_b_w *a) > +{ > + TCGv t0, t1; > + TCGv Rd = cpu_gpr[a->rd]; > + TCGv_i32 tsz = tcg_const_i32(1 << 1); This size is wrong. It should be 1, not 1 << 1 (2). > +static bool trans_crc_w_w_w(DisasContext *ctx, arg_crc_w_w_w *a) > +{ > + TCGv t0, t1; > + TCGv Rd = cpu_gpr[a->rd]; > + TCGv_i32 tsz = tcg_const_i32(1 << 4); Because this size most certainly should not be 16... > +static bool trans_crc_w_d_w(DisasContext *ctx, arg_crc_w_d_w *a) > +{ > + TCGv t0, t1; > + TCGv Rd = cpu_gpr[a->rd]; > + TCGv_i32 tsz = tcg_const_i32(1 << 8); ... and this size should not be 256. Both well larger than the 8 byte buffer that you've allocated. Also, you need a helper so that you don't have 8 copies of this code. > +static bool trans_asrtle_d(DisasContext *ctx, arg_asrtle_d * a) > +{ > + TCGv t0, t1; > + > + t0 = get_gpr(a->rj); > + t1 = get_gpr(a->rk); > + > + gen_helper_asrtle_d(cpu_env, t0, t1); > + > + return true; > +} > + > +static bool trans_asrtgt_d(DisasContext *ctx, arg_asrtgt_d * a) > +{ > + TCGv t0, t1; > + > + t0 = get_gpr(a->rj); > + t1 = get_gpr(a->rk); > + > + gen_helper_asrtgt_d(cpu_env, t0, t1); > + > + return true; > +} I'm not sure why both of these instructions are in the ISA, since ASRTLE X,Y <-> ASRTGT Y,X but we certainly don't need two different helpers. Just swap the arguments for one of them. > +static bool trans_rdtimel_w(DisasContext *ctx, arg_rdtimel_w *a) > +{ > + /* Nop */ > + return true; > +} > + > +static bool trans_rdtimeh_w(DisasContext *ctx, arg_rdtimeh_w *a) > +{ > + /* Nop */ > + return true; > +} > + > +static bool trans_rdtime_d(DisasContext *ctx, arg_rdtime_d *a) > +{ > + /* Nop */ > + return true; > +} If you don't want to implement these right now, you should at least initialize the destination register to 0, or something. r~
Hi, Richard. On 07/23/2021 01:12 PM, Richard Henderson wrote: > On 7/20/21 11:53 PM, Song Gao wrote: >> +target_ulong helper_cpucfg(CPULoongArchState *env, target_ulong rj) >> +{ >> + target_ulong r = 0; >> + >> + switch (rj) { >> + case 0: >> + r = env->CSR_MCSR0 & 0xffffffff; >> + break; >> + case 1: >> + r = (env->CSR_MCSR0 & 0xffffffff00000000) >> 32; >> + break; > > Why do you represent all of these as high and low portions of a 64-bit internal value, when the manual describes them as 32-bit values? > This method can reduce variables on env. > >> +/* Fixed point extra instruction translation */ >> +static bool trans_crc_w_b_w(DisasContext *ctx, arg_crc_w_b_w *a) >> +{ >> + TCGv t0, t1; >> + TCGv Rd = cpu_gpr[a->rd]; >> + TCGv_i32 tsz = tcg_const_i32(1 << 1); > > This size is wrong. It should be 1, not 1 << 1 (2). > > >> +static bool trans_crc_w_w_w(DisasContext *ctx, arg_crc_w_w_w *a) >> +{ >> + TCGv t0, t1; >> + TCGv Rd = cpu_gpr[a->rd]; >> + TCGv_i32 tsz = tcg_const_i32(1 << 4); > > Because this size most certainly should not be 16... >>> +static bool trans_crc_w_d_w(DisasContext *ctx, arg_crc_w_d_w *a) >> +{ >> + TCGv t0, t1; >> + TCGv Rd = cpu_gpr[a->rd]; >> + TCGv_i32 tsz = tcg_const_i32(1 << 8); > > ... and this size should not be 256. Both well larger than the 8 byte buffer that you've allocated. > I'm not sure about that. > Also, you need a helper so that you don't have 8 copies of this code. > OK. >> +static bool trans_asrtle_d(DisasContext *ctx, arg_asrtle_d * a) >> +{ >> + TCGv t0, t1; >> + >> + t0 = get_gpr(a->rj); >> + t1 = get_gpr(a->rk); >> + >> + gen_helper_asrtle_d(cpu_env, t0, t1); >> + >> + return true; >> +} >> + >> +static bool trans_asrtgt_d(DisasContext *ctx, arg_asrtgt_d * a) >> +{ >> + TCGv t0, t1; >> + >> + t0 = get_gpr(a->rj); >> + t1 = get_gpr(a->rk); >> + >> + gen_helper_asrtgt_d(cpu_env, t0, t1); >> + >> + return true; >> +} > > I'm not sure why both of these instructions are in the ISA, since > > ASRTLE X,Y <-> ASRTGT Y,X > > but we certainly don't need two different helpers. > Just swap the arguments for one of them. > OK. >> +static bool trans_rdtimel_w(DisasContext *ctx, arg_rdtimel_w *a) >> +{ >> + /* Nop */ >> + return true; >> +} >> + >> +static bool trans_rdtimeh_w(DisasContext *ctx, arg_rdtimeh_w *a) >> +{ >> + /* Nop */ >> + return true; >> +} >> + >> +static bool trans_rdtime_d(DisasContext *ctx, arg_rdtime_d *a) >> +{ >> + /* Nop */ >> + return true; >> +} > > If you don't want to implement these right now, you should at least initialize the destination register to 0, or something. > OK. > > r~ Again ,thanks you kindly help. Thanks Song Gao.
On 7/26/21 2:57 AM, Song Gao wrote: > > Hi, Richard. > > On 07/23/2021 01:12 PM, Richard Henderson wrote: >> On 7/20/21 11:53 PM, Song Gao wrote: >>> +target_ulong helper_cpucfg(CPULoongArchState *env, target_ulong rj) >>> +{ >>> + target_ulong r = 0; >>> + >>> + switch (rj) { >>> + case 0: >>> + r = env->CSR_MCSR0 & 0xffffffff; >>> + break; >>> + case 1: >>> + r = (env->CSR_MCSR0 & 0xffffffff00000000) >> 32; >>> + break; >> >> Why do you represent all of these as high and low portions of a 64-bit internal value, when the manual describes them as 32-bit values? >> > This method can reduce variables on env. The number of variables may increase, but the memory consumed -- which is the metric that is more important -- is still the same. Also, it is much less confusing to match the description in the manual. r~
Hi, Richard. On 07/27/2021 12:42 AM, Richard Henderson wrote: > On 7/26/21 2:57 AM, Song Gao wrote: >> >> Hi, Richard. >> >> On 07/23/2021 01:12 PM, Richard Henderson wrote: >>> On 7/20/21 11:53 PM, Song Gao wrote: >>>> +target_ulong helper_cpucfg(CPULoongArchState *env, target_ulong rj) >>>> +{ >>>> + target_ulong r = 0; >>>> + >>>> + switch (rj) { >>>> + case 0: >>>> + r = env->CSR_MCSR0 & 0xffffffff; >>>> + break; >>>> + case 1: >>>> + r = (env->CSR_MCSR0 & 0xffffffff00000000) >> 32; >>>> + break; >>> >>> Why do you represent all of these as high and low portions of a 64-bit internal value, when the manual describes them as 32-bit values? >>> >> This method can reduce variables on env. > > The number of variables may increase, but the memory consumed -- which is the metric that is more important -- is still the same. > > Also, it is much less confusing to match the description in the manual. > OK. Thanks Song Gao. > > r~
On 07/23/2021 01:12 PM, Richard Henderson wrote: >> +static bool trans_asrtle_d(DisasContext *ctx, arg_asrtle_d * a) >> +{ >> + TCGv t0, t1; >> + >> + t0 = get_gpr(a->rj); >> + t1 = get_gpr(a->rk); >> + >> + gen_helper_asrtle_d(cpu_env, t0, t1); >> + >> + return true; >> +} >> + >> +static bool trans_asrtgt_d(DisasContext *ctx, arg_asrtgt_d * a) >> +{ >> + TCGv t0, t1; >> + >> + t0 = get_gpr(a->rj); >> + t1 = get_gpr(a->rk); >> + >> + gen_helper_asrtgt_d(cpu_env, t0, t1); >> + >> + return true; >> +} > > I'm not sure why both of these instructions are in the ISA, since > > ASRTLE X,Y <-> ASRTGT Y,X > > but we certainly don't need two different helpers. > Just swap the arguments for one of them. Hi, Richard. I find 'ASRTLE X,Y <-> ASRTGT Y,X ' is not right, ASRTLE X, Y is X <= Y, raise a exception. ASRTGT Y, X is X < Y, raise a exception, lose X=Y. Thanks Song Gao
On 08/04/2021 03:40 PM, Song Gao wrote: > > On 07/23/2021 01:12 PM, Richard Henderson wrote: >>> +static bool trans_asrtle_d(DisasContext *ctx, arg_asrtle_d * a) >>> +{ >>> + TCGv t0, t1; >>> + >>> + t0 = get_gpr(a->rj); >>> + t1 = get_gpr(a->rk); >>> + >>> + gen_helper_asrtle_d(cpu_env, t0, t1); >>> + >>> + return true; >>> +} >>> + >>> +static bool trans_asrtgt_d(DisasContext *ctx, arg_asrtgt_d * a) >>> +{ >>> + TCGv t0, t1; >>> + >>> + t0 = get_gpr(a->rj); >>> + t1 = get_gpr(a->rk); >>> + >>> + gen_helper_asrtgt_d(cpu_env, t0, t1); >>> + >>> + return true; >>> +} >> >> I'm not sure why both of these instructions are in the ISA, since >> >> ASRTLE X,Y <-> ASRTGT Y,X >> >> but we certainly don't need two different helpers. >> Just swap the arguments for one of them. > > Hi, Richard. > > I find 'ASRTLE X,Y <-> ASRTGT Y,X ' is not right, > > ASRTLE X, Y is X <= Y, raise a exception. > ASRTGT Y, X is X < Y, raise a exception, lose X=Y. > sorry, I said it wrong。 ASRTLE X, Y is X > Y, raise a exception. ASRTGT Y, X is X >= Y raise a exception. more X=Y. > Thanks > Song Gao >
diff --git a/target/loongarch/helper.h b/target/loongarch/helper.h index 5cd38c8..a60f293 100644 --- a/target/loongarch/helper.h +++ b/target/loongarch/helper.h @@ -21,3 +21,7 @@ DEF_HELPER_FLAGS_1(loongarch_dbitswap, TCG_CALL_NO_RWG_SE, tl, tl) DEF_HELPER_3(asrtle_d, void, env, tl, tl) DEF_HELPER_3(asrtgt_d, void, env, tl, tl) + +DEF_HELPER_3(crc32, tl, tl, tl, i32) +DEF_HELPER_3(crc32c, tl, tl, tl, i32) +DEF_HELPER_2(cpucfg, tl, env, tl) diff --git a/target/loongarch/insns.decode b/target/loongarch/insns.decode index 574c055..66bc314 100644 --- a/target/loongarch/insns.decode +++ b/target/loongarch/insns.decode @@ -27,6 +27,7 @@ %si14 10:s14 %hint 0:5 %whint 0:15 +%code 0:15 # # Argument sets @@ -46,6 +47,8 @@ &fmt_rdrjsi14 rd rj si14 &fmt_hintrjsi12 hint rj si12 &fmt_whint whint +&fmt_rjrk rj rk +&fmt_code code # # Formats @@ -65,6 +68,8 @@ @fmt_hintrjsi12 .... ...... ............ ..... ..... &fmt_hintrjsi12 %hint %rj %si12 @fmt_whint .... ........ ..... ............... &fmt_whint %whint @fmt_rdrjsi14 .... .... .............. ..... ..... &fmt_rdrjsi14 %rd %rj %si14 +@fmt_rjrk .... ........ ..... ..... ..... ..... &fmt_rjrk %rj %rk +@fmt_code .... ........ ..... ............... &fmt_code %code # # Fixed point arithmetic operation instruction @@ -260,3 +265,23 @@ ammax_db_wu 0011 10000111 00000 ..... ..... ..... @fmt_rdrjrk ammax_db_du 0011 10000111 00001 ..... ..... ..... @fmt_rdrjrk ammin_db_wu 0011 10000111 00010 ..... ..... ..... @fmt_rdrjrk ammin_db_du 0011 10000111 00011 ..... ..... ..... @fmt_rdrjrk + +# +# Fixed point extra instruction +# +crc_w_b_w 0000 00000010 01000 ..... ..... ..... @fmt_rdrjrk +crc_w_h_w 0000 00000010 01001 ..... ..... ..... @fmt_rdrjrk +crc_w_w_w 0000 00000010 01010 ..... ..... ..... @fmt_rdrjrk +crc_w_d_w 0000 00000010 01011 ..... ..... ..... @fmt_rdrjrk +crcc_w_b_w 0000 00000010 01100 ..... ..... ..... @fmt_rdrjrk +crcc_w_h_w 0000 00000010 01101 ..... ..... ..... @fmt_rdrjrk +crcc_w_w_w 0000 00000010 01110 ..... ..... ..... @fmt_rdrjrk +crcc_w_d_w 0000 00000010 01111 ..... ..... ..... @fmt_rdrjrk +break 0000 00000010 10100 ............... @fmt_code +syscall 0000 00000010 10110 ............... @fmt_code +asrtle_d 0000 00000000 00010 ..... ..... 00000 @fmt_rjrk +asrtgt_d 0000 00000000 00011 ..... ..... 00000 @fmt_rjrk +rdtimel_w 0000 00000000 00000 11000 ..... ..... @fmt_rdrj +rdtimeh_w 0000 00000000 00000 11001 ..... ..... @fmt_rdrj +rdtime_d 0000 00000000 00000 11010 ..... ..... @fmt_rdrj +cpucfg 0000 00000000 00000 11011 ..... ..... @fmt_rdrj diff --git a/target/loongarch/op_helper.c b/target/loongarch/op_helper.c index 738e067..5bf2806 100644 --- a/target/loongarch/op_helper.c +++ b/target/loongarch/op_helper.c @@ -13,6 +13,8 @@ #include "exec/helper-proto.h" #include "exec/exec-all.h" #include "exec/cpu_ldst.h" +#include "qemu/crc32c.h" +#include <zlib.h> /* Exceptions helpers */ void helper_raise_exception_err(CPULoongArchState *env, uint32_t exception, @@ -159,3 +161,70 @@ void helper_asrtgt_d(CPULoongArchState *env, target_ulong rj, target_ulong rk) do_raise_exception(env, EXCP_ADE, GETPC()); } } + +target_ulong helper_crc32(target_ulong val, target_ulong m, uint32_t sz) +{ + uint8_t buf[8]; + target_ulong mask = ((sz * 8) == 64) ? -1ULL : ((1ULL << (sz * 8)) - 1); + + m &= mask; + stq_le_p(buf, m); + return (int32_t) (crc32(val ^ 0xffffffff, buf, sz) ^ 0xffffffff); +} + +target_ulong helper_crc32c(target_ulong val, target_ulong m, uint32_t sz) +{ + uint8_t buf[8]; + target_ulong mask = ((sz * 8) == 64) ? -1ULL : ((1ULL << (sz * 8)) - 1); + m &= mask; + stq_le_p(buf, m); + return (int32_t) (crc32c(val, buf, sz) ^ 0xffffffff); +} + +target_ulong helper_cpucfg(CPULoongArchState *env, target_ulong rj) +{ + target_ulong r = 0; + + switch (rj) { + case 0: + r = env->CSR_MCSR0 & 0xffffffff; + break; + case 1: + r = (env->CSR_MCSR0 & 0xffffffff00000000) >> 32; + break; + case 2: + r = env->CSR_MCSR1 & 0xffffffff; + break; + case 3: + r = (env->CSR_MCSR1 & 0xffffffff00000000) >> 32; + break; + case 4: + r = env->CSR_MCSR2 & 0xffffffff; + break; + case 5: + r = (env->CSR_MCSR2 & 0xffffffff00000000) >> 32; + break; + case 6: + r = env->CSR_MCSR3 & 0xffffffff; + break; + case 10: + r = env->CSR_MCSR8 & 0xffffffff; + break; + case 11: + r = (env->CSR_MCSR8 & 0xffffffff00000000) >> 32; + break; + case 12: + r = env->CSR_MCSR9 & 0xffffffff; + break; + case 13: + r = (env->CSR_MCSR9 & 0xffffffff00000000) >> 32; + break; + case 14: + r = env->CSR_MCSR10 & 0xffffffff; + break; + case 30: + r = env->CSR_MCSR24 & 0xffffffff; + break; + } + return r; +} diff --git a/target/loongarch/trans.inc.c b/target/loongarch/trans.inc.c index a87da4a..366877e 100644 --- a/target/loongarch/trans.inc.c +++ b/target/loongarch/trans.inc.c @@ -3084,3 +3084,238 @@ TRANS_AM_DB_D(ammin_db_du, fetch_umin) /* trans_ammin_db_du */ #undef TRANS_AM_DB #undef TRANS_AM_DB_W #undef TRANS_AM_DB_D + +/* Fixed point extra instruction translation */ +static bool trans_crc_w_b_w(DisasContext *ctx, arg_crc_w_b_w *a) +{ + TCGv t0, t1; + TCGv Rd = cpu_gpr[a->rd]; + TCGv_i32 tsz = tcg_const_i32(1 << 1); + + if (a->rd == 0) { + /* Nop */ + return true; + } + + t0 = get_gpr(a->rk); + t1 = get_gpr(a->rj); + + gen_helper_crc32(Rd, t0, t1, tsz); + + tcg_temp_free_i32(tsz); + + return true; +} + +static bool trans_crc_w_h_w(DisasContext *ctx, arg_crc_w_h_w *a) +{ + TCGv t0, t1; + TCGv Rd = cpu_gpr[a->rd]; + TCGv_i32 tsz = tcg_const_i32(1 << 2); + + if (a->rd == 0) { + /* Nop */ + return true; + } + + t0 = get_gpr(a->rk); + t1 = get_gpr(a->rj); + + gen_helper_crc32(Rd, t0, t1, tsz); + + tcg_temp_free_i32(tsz); + + return true; +} + +static bool trans_crc_w_w_w(DisasContext *ctx, arg_crc_w_w_w *a) +{ + TCGv t0, t1; + TCGv Rd = cpu_gpr[a->rd]; + TCGv_i32 tsz = tcg_const_i32(1 << 4); + + if (a->rd == 0) { + /* Nop */ + return true; + } + + t0 = get_gpr(a->rk); + t1 = get_gpr(a->rj); + + gen_helper_crc32(Rd, t0, t1, tsz); + + tcg_temp_free_i32(tsz); + + return true; +} + +static bool trans_crc_w_d_w(DisasContext *ctx, arg_crc_w_d_w *a) +{ + TCGv t0, t1; + TCGv Rd = cpu_gpr[a->rd]; + TCGv_i32 tsz = tcg_const_i32(1 << 8); + + if (a->rd == 0) { + /* Nop */ + return true; + } + + t0 = get_gpr(a->rk); + t1 = get_gpr(a->rj); + + gen_helper_crc32(Rd, t0, t1, tsz); + + tcg_temp_free_i32(tsz); + + return true; +} + +static bool trans_crcc_w_b_w(DisasContext *ctx, arg_crcc_w_b_w *a) +{ + TCGv t0, t1; + TCGv Rd = cpu_gpr[a->rd]; + TCGv_i32 tsz = tcg_const_i32(1 << 1); + + if (a->rd == 0) { + /* Nop */ + return true; + } + + t0 = get_gpr(a->rk); + t1 = get_gpr(a->rj); + + gen_helper_crc32c(Rd, t0, t1, tsz); + + tcg_temp_free_i32(tsz); + + return true; +} + +static bool trans_crcc_w_h_w(DisasContext *ctx, arg_crcc_w_h_w *a) +{ + TCGv t0, t1; + TCGv Rd = cpu_gpr[a->rd]; + TCGv_i32 tsz = tcg_const_i32(1 << 2); + + if (a->rd == 0) { + /* Nop */ + return true; + } + + t0 = get_gpr(a->rk); + t1 = get_gpr(a->rj); + + gen_helper_crc32c(Rd, t0, t1, tsz); + + tcg_temp_free_i32(tsz); + + return true; +} + +static bool trans_crcc_w_w_w(DisasContext *ctx, arg_crcc_w_w_w *a) +{ + TCGv t0, t1; + TCGv Rd = cpu_gpr[a->rd]; + TCGv_i32 tsz = tcg_const_i32(1 << 4); + + if (a->rd == 0) { + /* Nop */ + return true; + } + + t0 = get_gpr(a->rk); + t1 = get_gpr(a->rj); + + gen_helper_crc32c(Rd, t0, t1, tsz); + + tcg_temp_free_i32(tsz); + + return true; +} + +static bool trans_crcc_w_d_w(DisasContext *ctx, arg_crcc_w_d_w *a) +{ + TCGv t0, t1; + TCGv Rd = cpu_gpr[a->rd]; + TCGv_i32 tsz = tcg_const_i32(1 << 8); + + if (a->rd == 0) { + /* Nop */ + return true; + } + + t0 = get_gpr(a->rk); + t1 = get_gpr(a->rj); + + gen_helper_crc32c(Rd, t0, t1, tsz); + + tcg_temp_free_i32(tsz); + + return true; +} + +static bool trans_break(DisasContext *ctx, arg_break *a) +{ + generate_exception_end(ctx, EXCP_BREAK); + return true; +} + +static bool trans_syscall(DisasContext *ctx, arg_syscall *a) +{ + generate_exception_end(ctx, EXCP_SYSCALL); + return true; +} + +static bool trans_asrtle_d(DisasContext *ctx, arg_asrtle_d * a) +{ + TCGv t0, t1; + + t0 = get_gpr(a->rj); + t1 = get_gpr(a->rk); + + gen_helper_asrtle_d(cpu_env, t0, t1); + + return true; +} + +static bool trans_asrtgt_d(DisasContext *ctx, arg_asrtgt_d * a) +{ + TCGv t0, t1; + + t0 = get_gpr(a->rj); + t1 = get_gpr(a->rk); + + gen_helper_asrtgt_d(cpu_env, t0, t1); + + return true; +} + +static bool trans_rdtimel_w(DisasContext *ctx, arg_rdtimel_w *a) +{ + /* Nop */ + return true; +} + +static bool trans_rdtimeh_w(DisasContext *ctx, arg_rdtimeh_w *a) +{ + /* Nop */ + return true; +} + +static bool trans_rdtime_d(DisasContext *ctx, arg_rdtime_d *a) +{ + /* Nop */ + return true; +} + +static bool trans_cpucfg(DisasContext *ctx, arg_cpucfg *a) +{ + TCGv t0; + TCGv Rd = cpu_gpr[a->rd]; + + t0 = get_gpr(a->rj); + + gen_helper_cpucfg(Rd, cpu_env, t0); + + return true; +}
This patch implement fixed point extra instruction translation. This includes: - CRC[C].W.{B/H/W/D}.W - SYSCALL - BREAK - ASRT{LE/GT}.D - RDTIME{L/H}.W, RDTIME.D - CPUCFG Signed-off-by: Song Gao <gaosong@loongson.cn> --- target/loongarch/helper.h | 4 + target/loongarch/insns.decode | 25 +++++ target/loongarch/op_helper.c | 69 +++++++++++++ target/loongarch/trans.inc.c | 235 ++++++++++++++++++++++++++++++++++++++++++ 4 files changed, 333 insertions(+)