1. 14 Aug, 2019 1 commit
  2. 13 Aug, 2019 1 commit
    • gxw's avatar
      avutil/mips: refine msa macros CLIP_*. · a3e572d9
      gxw authored
      Changing details as following:
      1. Remove the local variable 'out_m' in 'CLIP_SH' and store the result in
         source vector.
      2. Refine the implementation of macro 'CLIP_SH_0_255' and 'CLIP_SW_0_255'.
         Performance of VP8 decoding has speed up about 1.1%(from 7.03x to 7.11x).
         Performance of H264 decoding has speed up about 0.5%(from 4.35x to 4.37x).
         Performance of Theora decoding has speed up about 0.7%(from 5.79x to 5.83x).
      3. Remove redundant macro 'CLIP_SH/Wn_0_255_MAX_SATU' and use 'CLIP_SH/Wn_0_255'
         instead, because there are no difference in the effect of this two macros.
      Reviewed-by: 's avatarShiyou Yin <yinshiyou-hf@loongson.cn>
      Signed-off-by: 's avatarMichael Niedermayer <michael@niedermayer.cc>
      a3e572d9
  3. 18 Jul, 2019 1 commit
    • Shiyou Yin's avatar
      avutil/mips: refactor msa load and store macros. · 153c6075
      Shiyou Yin authored
      Replace STnxm_UB and LDnxm_SH with new macros ST_{H/W/D}{1/2/4/8}.
      The old macros are difficult to use because they don't follow the same parameter passing rules.
      Changing details as following:
      1. remove LD4x4_SH.
      2. replace ST2x4_UB with ST_H4.
      3. replace ST4x2_UB with ST_W2.
      4. replace ST4x4_UB with ST_W4.
      5. replace ST4x8_UB with ST_W8.
      6. replace ST6x4_UB with ST_W2 and ST_H2.
      7. replace ST8x1_UB with ST_D1.
      8. replace ST8x2_UB with ST_D2.
      9. replace ST8x4_UB with ST_D4.
      10. replace ST8x8_UB with ST_D8.
      11. replace ST12x4_UB with ST_D4 and ST_W4.
      
      Examples of new macro: ST_H4(in, idx0, idx1, idx2, idx3, pdst, stride)
      ST_H4 store four half-word elements in vector 'in' to pdst with stride.
      About the macro name:
      1) 'ST' means store operation.
      2) 'H/W/D' means type of vector element is 'half-word/word/double-word'.
      3) Number '1/2/4/8' means how many elements will be stored.
      About the macro parameter:
      1) 'in0, in1...' 128-bits vector.
      2) 'idx0, idx1...' elements index.
      3) 'pdst' destination pointer to store to
      4) 'stride' stride of each store operation.
      Signed-off-by: 's avatarMichael Niedermayer <michael@niedermayer.cc>
      153c6075
  4. 10 Jul, 2019 1 commit
  5. 25 Oct, 2017 1 commit
  6. 10 Oct, 2017 1 commit
  7. 27 Sep, 2017 1 commit
  8. 24 Sep, 2017 1 commit
  9. 15 Sep, 2017 1 commit
  10. 08 Sep, 2017 1 commit
  11. 25 Jul, 2017 1 commit
  12. 21 Jul, 2017 1 commit
  13. 05 Oct, 2016 1 commit
  14. 23 Jul, 2015 1 commit
  15. 07 Jul, 2015 1 commit
  16. 06 Jul, 2015 2 commits
  17. 01 Jul, 2015 1 commit
  18. 29 Jun, 2015 1 commit
  19. 19 Jun, 2015 1 commit
  20. 18 Jun, 2015 1 commit
  21. 13 Jun, 2015 1 commit
  22. 11 Jun, 2015 3 commits
  23. 10 Jun, 2015 2 commits
  24. 04 Jun, 2015 1 commit
  25. 03 Jun, 2015 2 commits
  26. 01 Jun, 2015 1 commit
  27. 28 May, 2015 2 commits
    • Shivraj Patil's avatar
      avcodec/mips: Split uni mc optimizations to new file · 10b77fbf
      Shivraj Patil authored
      This patch moves HEVC code of uni mc cases to new file hevc_mc_uni_msa.c.
      (There are total 5 sub-modules of HEVC mc functions, if we add all these modules in one single file, its size would be huge (~750k) & difficult to maintain, so splitting it in multiple files)
      This patch also adds new HEVC header file libavcodec/mips/hevc_macros_msa.h
      Signed-off-by: 's avatarShivraj Patil <shivraj.patil@imgtec.com>
      Signed-off-by: 's avatarMichael Niedermayer <michaelni@gmx.at>
      10b77fbf
    • Shivraj Patil's avatar
      avutil/mips: Restructure of generic macros · 02a49912
      Shivraj Patil authored
      This patch includes restructuring of existing macros and addition of more generic macros.
      
      This change was necessary to avoid repeated review comments in remaining patches which we were about to submit.
      Also this patch reduces number of code lines due to maximum use of generic macros, allows better code alignment & readability etc.
      
      These modifications in commonly used .libavutil/mips/generic_macros_msa.h. impacts the already accepted code, hence re-submitting it in 2/4,3/4 & 4/4.
      Overall, this patch set is just upgrading the code with styling changes and will bring it in sync with MIPS-SIMD optimized latest codebase at our end.
      Signed-off-by: 's avatarShivraj Patil <shivraj.patil@imgtec.com>
      Signed-off-by: 's avatarMichael Niedermayer <michaelni@gmx.at>
      02a49912
  28. 07 May, 2015 1 commit
  29. 01 May, 2015 1 commit
  30. 24 Apr, 2015 1 commit
  31. 20 Apr, 2015 1 commit
  32. 17 Apr, 2015 1 commit