Page MenuHome GnuPG
Feed Advanced Search

Oct 26 2022

jukivili committed rC14f39993d632: sm4: accelerate ECB (for benchmarking) (authored by jukivili).
sm4: accelerate ECB (for benchmarking)
Oct 26 2022, 9:05 PM
jukivili committed rC6475d0915ffe: camellia: accelerate ECB (for benchmarking) (authored by jukivili).
camellia: accelerate ECB (for benchmarking)
Oct 26 2022, 9:05 PM
jukivili committed rCa43e03ef842b: sm4: fix lookup-table prefetching (authored by jukivili).
sm4: fix lookup-table prefetching
Oct 26 2022, 9:05 PM
jukivili committed rC7c1aa4c9452a: rijndael-vaes: align asm functions (authored by jukivili).
rijndael-vaes: align asm functions
Oct 26 2022, 9:05 PM
jukivili committed rC84f3d41acb23: rijndael: add ECB acceleration (for benchmarking purposes) (authored by jukivili).
rijndael: add ECB acceleration (for benchmarking purposes)
Oct 26 2022, 9:05 PM
jukivili committed rCbf5ec001dfcb: mpi/longlong: update powerpc macros from GCC (authored by jukivili).
mpi/longlong: update powerpc macros from GCC
Oct 26 2022, 9:05 PM
jukivili committed rC4b1cb76e3587: hwf-x86: enable VPGATHER usage for AMD CPUs with AVX512 (authored by jukivili).
hwf-x86: enable VPGATHER usage for AMD CPUs with AVX512
Oct 26 2022, 9:05 PM
jukivili committed rCc0f85e0c8657: sha512-avx512: enable only on Intel CPUs for now (authored by jukivili).
sha512-avx512: enable only on Intel CPUs for now
Oct 26 2022, 9:05 PM

Oct 16 2022

jukivili created T6242: libgcrypt: optimize ECB? (as it may be used to estimate library crypto performance).
Oct 16 2022, 4:57 PM · libgcrypt, Feature Request

Oct 8 2022

jukivili closed T6232: Documentation refers to wrong the RFC for GCM-SIV cipher mode (GCRY_CIPHER_MODE_GCM_SIV) as Resolved.

Thanks. Fix has been pushed to master.

Oct 8 2022, 2:51 PM · Documentation
jukivili committed rC94dcd2cf5cd1: doc: fix RFC reference for GCM-SIV (authored by jukivili).
doc: fix RFC reference for GCM-SIV
Oct 8 2022, 2:36 PM
jukivili committed rCf01d4b7a601f: mpi/longlong.h: i386: use tzcnt instruction for trailing zeros (authored by jukivili).
mpi/longlong.h: i386: use tzcnt instruction for trailing zeros
Oct 8 2022, 2:36 PM
jukivili committed rC38bd31bc46b9: mpi/longlong.h: x86-64: use tzcnt instruction for trailing zeros (authored by jukivili).
mpi/longlong.h: x86-64: use tzcnt instruction for trailing zeros
Oct 8 2022, 2:36 PM
jukivili committed rCd67f453d1b9d: mpi/longlong: fix generic smul_ppmm ifdef (authored by jukivili).
mpi/longlong: fix generic smul_ppmm ifdef
Oct 8 2022, 2:36 PM
jukivili committed rCaef0ddf4d37a: mpi/longlong: provide generic implementation using double word type (authored by jukivili).
mpi/longlong: provide generic implementation using double word type
Oct 8 2022, 2:36 PM
jukivili claimed T6232: Documentation refers to wrong the RFC for GCM-SIV cipher mode (GCRY_CIPHER_MODE_GCM_SIV).
Oct 8 2022, 2:11 PM · Documentation

Oct 2 2022

jukivili committed rC567bc62e1c30: tests: Avoid memory leak (authored by Jakuje).
tests: Avoid memory leak
Oct 2 2022, 4:02 PM
jukivili committed rC0909186b9e66: t-rsa-testparm: fix 'function declaration isn’t a prototype' warning (authored by jukivili).
t-rsa-testparm: fix 'function declaration isn’t a prototype' warning
Oct 2 2022, 4:02 PM
jukivili committed rC6419fbb1d3dd: tests/benchmark: remove VLA usage (authored by jukivili).
tests/benchmark: remove VLA usage
Oct 2 2022, 4:02 PM
jukivili committed rC335b8eb1211b: tests/bench-slope: remove VLA usage (authored by jukivili).
tests/bench-slope: remove VLA usage
Oct 2 2022, 4:02 PM
jukivili committed rCce60a68a1172: cipher-ccm: remove VLA usage (authored by jukivili).
cipher-ccm: remove VLA usage
Oct 2 2022, 4:02 PM
jukivili committed rC9978fc22045c: mpi/ec: remove VLA usage (authored by jukivili).
mpi/ec: remove VLA usage
Oct 2 2022, 4:02 PM
jukivili added a comment to T6217: sha3: wrong results for large inputs.

Patch applied to master, thanks.

Oct 2 2022, 3:55 PM · libgcrypt, FIPS, Bug Report

Sep 27 2022

jukivili committed rC0cb29a5736cf: tests/hashtest: add hugeblock & disable-hwf options and 6 gig test vectors (authored by jukivili).
tests/hashtest: add hugeblock & disable-hwf options and 6 gig test vectors
Sep 27 2022, 7:34 AM
jukivili committed rC9c828129b205: keccak: Use size_t to avoid integer overflow (authored by Jakuje).
keccak: Use size_t to avoid integer overflow
Sep 27 2022, 7:34 AM
jukivili added a comment to T6217: sha3: wrong results for large inputs.

I've tested the different hw implementations (amd64, arm64, s390x) and they are all ok.

Sep 27 2022, 7:27 AM · libgcrypt, FIPS, Bug Report

Sep 25 2022

jukivili added a comment to T6217: sha3: wrong results for large inputs.

Fix looks good to me. This could be tested with new long running test (tests/hashtest) that would allocate 4GiB+ pattern block for inputting to gcry_md_write.

Sep 25 2022, 9:26 PM · libgcrypt, FIPS, Bug Report

Aug 18 2022

jukivili committed rC8a1f50e66364: kdf: Restructure KDF test vectors (authored by gmazyland).
kdf: Restructure KDF test vectors
Aug 18 2022, 7:03 PM
jukivili committed rCa20700c55f0e: kdf: Allow empty password for Argon2 (authored by gmazyland).
kdf: Allow empty password for Argon2
Aug 18 2022, 7:03 PM

Aug 11 2022

jukivili updated the task description for T4460: libgcrypt performance TODOs.
Aug 11 2022, 6:21 PM · libgcrypt
jukivili updated the task description for T4460: libgcrypt performance TODOs.
Aug 11 2022, 6:19 PM · libgcrypt
jukivili updated the task description for T4460: libgcrypt performance TODOs.
Aug 11 2022, 6:19 PM · libgcrypt
jukivili updated the task description for T4460: libgcrypt performance TODOs.
Aug 11 2022, 6:19 PM · libgcrypt

Aug 2 2022

jukivili committed rC97e2c237f15d: tests/basic: Add ifdefs for SM4 and CAMELLIA tests (authored by neverpanic).
tests/basic: Add ifdefs for SM4 and CAMELLIA tests
Aug 2 2022, 6:04 PM

Aug 1 2022

jukivili committed rCa8f66410ca0d: basic: gcm-siv: add fips checks for SM4 and CAMELLIA128 (authored by jukivili).
basic: gcm-siv: add fips checks for SM4 and CAMELLIA128
Aug 1 2022, 7:29 PM

Jul 31 2022

jukivili committed rC8287dea8379f: sm4: add ARMv8 CE accelerated implementation for XTS mode (authored by Tianjia Zhang <tianjia.zhang@linux.alibaba.com>).
sm4: add ARMv8 CE accelerated implementation for XTS mode
Jul 31 2022, 6:02 PM
jukivili committed rCa0c933f6fed0: sm4: fix unused parameter compiler warning (authored by jukivili).
sm4: fix unused parameter compiler warning
Jul 31 2022, 6:02 PM
jukivili committed rCdb5136c4d933: Simplify AES key schedule implementation (authored by jukivili).
Simplify AES key schedule implementation
Jul 31 2022, 6:02 PM
jukivili committed rC2ac6c24aa530: rijndael-ppc: small speed-up for CBC and CFB encryption (authored by jukivili).
rijndael-ppc: small speed-up for CBC and CFB encryption
Jul 31 2022, 6:02 PM

Jul 28 2022

jukivili updated the task description for T4460: libgcrypt performance TODOs.
Jul 28 2022, 12:19 PM · libgcrypt
jukivili updated the task description for T4460: libgcrypt performance TODOs.
Jul 28 2022, 12:15 PM · libgcrypt
jukivili updated the task description for T4460: libgcrypt performance TODOs.
Jul 28 2022, 12:15 PM · libgcrypt

Jul 27 2022

jukivili added a comment to T5826: Improve detached signing and verification speed.

@werner Could these two patches could be backported to 2.2? These changes give same level of performance increase in 2.2 as seen in 2.3.

Jul 27 2022, 4:48 PM · gnupg

Jul 25 2022

jukivili updated the task description for T4460: libgcrypt performance TODOs.
Jul 25 2022, 11:00 PM · libgcrypt
jukivili updated the task description for T4460: libgcrypt performance TODOs.
Jul 25 2022, 11:00 PM · libgcrypt
jukivili committed rC909daa700e4b: blake2: add AVX512 accelerated implementations (authored by jukivili).
blake2: add AVX512 accelerated implementations
Jul 25 2022, 3:34 PM
jukivili committed rCe51d3b8330a1: sha512: add AArch64 crypto/SHA512 extension implementation (authored by jukivili).
sha512: add AArch64 crypto/SHA512 extension implementation
Jul 25 2022, 3:34 PM
jukivili committed rCdca0bd133dd0: sm4-arm-sve-ce: use 32 parallel blocks for XTS and CTR32LE (authored by jukivili).
sm4-arm-sve-ce: use 32 parallel blocks for XTS and CTR32LE
Jul 25 2022, 3:34 PM
jukivili committed rCcf956793afc2: sm4 & camellia: add generic bulk acceleration for CTR32LE mode (GCM-SIV) (authored by jukivili).
sm4 & camellia: add generic bulk acceleration for CTR32LE mode (GCM-SIV)
Jul 25 2022, 3:34 PM
jukivili committed rCbeaad75f4655: sha3: Add x86-64 AVX512 accelerated implementation (authored by jukivili).
sha3: Add x86-64 AVX512 accelerated implementation
Jul 25 2022, 3:34 PM
jukivili committed rCeaed633c1662: sm4: add amd64 GFNI/AVX512 implementation (authored by jukivili).
sm4: add amd64 GFNI/AVX512 implementation
Jul 25 2022, 3:34 PM

Jul 21 2022

jukivili updated the task description for T4460: libgcrypt performance TODOs.
Jul 21 2022, 4:07 PM · libgcrypt
jukivili committed rCea5e71f9c8eb: Add ARMv9 SVE2 and optional Crypto Extension HW features (authored by Tianjia Zhang <tianjia.zhang@linux.alibaba.com>).
Add ARMv9 SVE2 and optional Crypto Extension HW features
Jul 21 2022, 10:04 AM
jukivili committed rC8921b5221e33: Add detection for HW feature "ARMv8 SVE" (authored by Tianjia Zhang <tianjia.zhang@linux.alibaba.com>).
Add detection for HW feature "ARMv8 SVE"
Jul 21 2022, 10:04 AM
jukivili committed rC2dc265400674: Add SM4 ARMv9 SVE CE assembly implementation (authored by Tianjia Zhang <tianjia.zhang@linux.alibaba.com>).
Add SM4 ARMv9 SVE CE assembly implementation
Jul 21 2022, 10:04 AM

Jul 20 2022

jukivili committed rC434a58d0e784: visibility: add missing fips_is_operational check for gcry_md_extract (authored by jukivili).
visibility: add missing fips_is_operational check for gcry_md_extract
Jul 20 2022, 9:55 PM
jukivili committed rC9636c88262fc: hwf-x86: fix UBSAN warning (authored by jukivili).
hwf-x86: fix UBSAN warning
Jul 20 2022, 9:55 PM
jukivili committed rC4abcedcea820: hwf-arm: add ARM HW feature detection support for MacOS (authored by jukivili).
hwf-arm: add ARM HW feature detection support for MacOS
Jul 20 2022, 9:55 PM

Jul 7 2022

jukivili updated the task description for T4460: libgcrypt performance TODOs.
Jul 7 2022, 7:03 PM · libgcrypt
jukivili updated the task description for T4460: libgcrypt performance TODOs.
Jul 7 2022, 10:36 AM · libgcrypt

Jul 6 2022

jukivili updated the task description for T4460: libgcrypt performance TODOs.
Jul 6 2022, 8:19 PM · libgcrypt
jukivili committed rC66ef99bb1804: sm4: fix wrong macro used for GFNI/AVX2 code-path (authored by jukivili).
sm4: fix wrong macro used for GFNI/AVX2 code-path
Jul 6 2022, 12:17 PM
jukivili committed rCfd3ed68754eb: tests/basic: enable IV checks for CBC/CFB/CTR bulk tests (authored by jukivili).
tests/basic: enable IV checks for CBC/CFB/CTR bulk tests
Jul 6 2022, 12:17 PM
jukivili committed rC935e211af145: sm4: fix use of GFNI/AVX2 accelerated key expansion (authored by jukivili).
sm4: fix use of GFNI/AVX2 accelerated key expansion
Jul 6 2022, 12:17 PM
jukivili committed rC99b7375bd616: camellia-gfni-avx512: remove copy-paste / leftover extra instructions (authored by jukivili).
camellia-gfni-avx512: remove copy-paste / leftover extra instructions
Jul 6 2022, 12:17 PM
jukivili committed rCac14d9ee7a09: camellia-gfni-avx512: add missing register clearing on function exits (authored by jukivili).
camellia-gfni-avx512: add missing register clearing on function exits
Jul 6 2022, 12:17 PM

Jun 12 2022

jukivili closed T6006: Optimize Chacha20 and Poly1305 for PPC P10 LE as Resolved.
Jun 12 2022, 9:58 PM · patch, ppc, Feature Request, libgcrypt
jukivili added a comment to T6006: Optimize Chacha20 and Poly1305 for PPC P10 LE.

Patch applied to master with small changes.

Jun 12 2022, 9:58 PM · patch, ppc, Feature Request, libgcrypt
jukivili committed rC88fe7ac33eb4: Chacha20/poly1305 - Optimized chacha20/poly1305 for P10 operation (authored by dannytsen).
Chacha20/poly1305 - Optimized chacha20/poly1305 for P10 operation
Jun 12 2022, 9:14 PM
jukivili committed rC2c5e5ab6843d: ppc: enable P10 assembly with ENABLE_FORCE_SOFT_HWFEATURES on arch-3.00 (authored by jukivili).
ppc: enable P10 assembly with ENABLE_FORCE_SOFT_HWFEATURES on arch-3.00
Jun 12 2022, 9:14 PM

Jun 3 2022

jukivili added a comment to T6006: Optimize Chacha20 and Poly1305 for PPC P10 LE.

Thanks for updated patch. I'm travelling next week and have time to check it closely only after I'm back. On quick glance, it looks good. What is also needed is the changelog for git commit log.

Jun 3 2022, 10:30 AM · patch, ppc, Feature Request, libgcrypt

Jun 1 2022

jukivili added a comment to T6006: Optimize Chacha20 and Poly1305 for PPC P10 LE.

I meant interleaving integer register based 1xPoly1305 with 8xChacha20 as is done for 4xChacha20 in cipher/chacha20-ppc.c (interleaved so that for each 4xChaCha20 processed, 4 blocks of 1xPoly1305 is executed). Quite often microarchitectures have separate execution units for integer registers and vector registers and then it makes sense to interleave integer-poly1305 with vector-chacha20 as algorithms do not end up competing for same execution resources. Interleaving vector-poly1305 and vector-chacha20 is not likely to give performance increase (and likely to run problems with running out of vector registers).

Jun 1 2022, 5:37 PM · patch, ppc, Feature Request, libgcrypt

May 28 2022

jukivili added a comment to T6006: Optimize Chacha20 and Poly1305 for PPC P10 LE.

Problem is that new assembly is using VSX registers vs14-vs31 which overlap with floating-point registers f14-f31. f14-f31 are ABI callee saved, so those need to be stored and restored.

May 28 2022, 9:04 PM · patch, ppc, Feature Request, libgcrypt
jukivili added a comment to T6006: Optimize Chacha20 and Poly1305 for PPC P10 LE.

Tested patch with small change so that HWF_PPC_ARCH_3_00 is used instead of HWF_PPC_ARCH_3_10. Building bench-slope with "-O3 -flto" makes bug in new implementation visible. Without new implementations bench-slope is ok (testing with QEMU):

$ tests/bench-slope --disable-hwf ppc-arch_3_00 cipher chacha20
Cipher:
 CHACHA20       |  nanosecs/byte   mebibytes/sec   cycles/byte
     STREAM enc |      2.35 ns/B     405.0 MiB/s         - c/B
     STREAM dec |      2.32 ns/B     410.7 MiB/s         - c/B
   POLY1305 enc |      2.46 ns/B     388.0 MiB/s         - c/B
   POLY1305 dec |      2.34 ns/B     408.1 MiB/s         - c/B
  POLY1305 auth |     0.238 ns/B      4003 MiB/s         - c/B
May 28 2022, 6:49 PM · patch, ppc, Feature Request, libgcrypt

May 27 2022

jukivili added a comment to T6006: Optimize Chacha20 and Poly1305 for PPC P10 LE.

-O2 problem with bench-slope seems strange. Does problem appear after this patch is applied?

May 27 2022, 7:15 PM · patch, ppc, Feature Request, libgcrypt

May 15 2022

jukivili committed rCfd02e8e78470: aarch64-asm: use ADR for getting pointers for local labels (authored by jukivili).
aarch64-asm: use ADR for getting pointers for local labels
May 15 2022, 10:27 AM

May 11 2022

jukivili committed rC9ab61ba24b72: camellia: add amd64 GFNI/AVX512 implementation (authored by jukivili).
camellia: add amd64 GFNI/AVX512 implementation
May 11 2022, 7:37 PM
jukivili committed rCa9700956361d: cipher: move CBC/CFB/CTR self-tests to tests/basic (authored by jukivili).
cipher: move CBC/CFB/CTR self-tests to tests/basic
May 11 2022, 7:37 PM

May 9 2022

jukivili created T5970: gcry_mpi_invm producing wrong result.
May 9 2022, 8:30 PM · backport, libgcrypt, Bug Report
jukivili updated the task description for T4460: libgcrypt performance TODOs.
May 9 2022, 8:19 PM · libgcrypt

Apr 30 2022

jukivili committed rC9ba1f0091ff4: tests/basic: add testing for partial bulk processing code paths (authored by jukivili).
tests/basic: add testing for partial bulk processing code paths
Apr 30 2022, 12:37 PM
jukivili committed rCaad3381e9384: sm4: add XTS bulk processing (authored by jukivili).
sm4: add XTS bulk processing
Apr 30 2022, 12:37 PM
jukivili committed rCe239738b4af2: sm4-aesni-avx2: add generic 1 to 16 block bulk processing function (authored by jukivili).
sm4-aesni-avx2: add generic 1 to 16 block bulk processing function
Apr 30 2022, 12:37 PM
jukivili committed rC32b18cdb87b7: camellia-avx2: add bulk processing for XTS mode (authored by jukivili).
camellia-avx2: add bulk processing for XTS mode
Apr 30 2022, 12:37 PM
jukivili committed rC5095d60af42d: Add SM4 x86-64/GFNI/AVX2 implementation (authored by jukivili).
Add SM4 x86-64/GFNI/AVX2 implementation
Apr 30 2022, 12:37 PM
jukivili committed rCe1c5f950838b: sm4: deduplicate bulk processing function selection (authored by jukivili).
sm4: deduplicate bulk processing function selection
Apr 30 2022, 12:37 PM
jukivili committed rC9388279803ff: Move bulk OCB L pointer array setup code to common header (authored by jukivili).
Move bulk OCB L pointer array setup code to common header
Apr 30 2022, 12:37 PM
jukivili committed rC754055ccd043: cipher/bulkhelp: add functions for CTR/CBC/CFB/OCB bulk processing (authored by jukivili).
cipher/bulkhelp: add functions for CTR/CBC/CFB/OCB bulk processing
Apr 30 2022, 12:37 PM
jukivili committed rCbacdc1de3f4f: camellia-avx2: add partial parallel block processing (authored by jukivili).
camellia-avx2: add partial parallel block processing
Apr 30 2022, 12:37 PM
jukivili committed rC3410d40996d8: Add detection for HW feature "intel-gfni" (authored by jukivili).
Add detection for HW feature "intel-gfni"
Apr 30 2022, 12:37 PM
jukivili committed rC4e6896eb9fce: Add GFNI/AVX2 implementation of Camellia (authored by jukivili).
Add GFNI/AVX2 implementation of Camellia
Apr 30 2022, 12:37 PM

Apr 19 2022

jukivili closed T5913: libgcrypt: bug fix for PPC bulk AES-GCM acceleratieration, missing HWF_PPC_ARCH_3_10 in HW feature as Resolved.
Apr 19 2022, 5:59 PM · ppc, libgcrypt

Apr 6 2022

jukivili committed rCa7c3e0b9b0ff: doc: Fix missing ARM hardware features (authored by Tianjia Zhang <tianjia.zhang@linux.alibaba.com>).
doc: Fix missing ARM hardware features
Apr 6 2022, 9:34 PM
jukivili committed rC9a63cfd61753: chacha20: add AVX512 implementation (authored by jukivili).
chacha20: add AVX512 implementation
Apr 6 2022, 9:34 PM
jukivili committed rC972aae9fc337: build: Fix for arm crypto support (authored by Tianjia Zhang <tianjia.zhang@linux.alibaba.com>).
build: Fix for arm crypto support
Apr 6 2022, 9:34 PM
jukivili committed rCcd3ed4977076: poly1305: add AVX512 implementation (authored by jukivili).
poly1305: add AVX512 implementation
Apr 6 2022, 9:34 PM

Apr 4 2022

jukivili committed rCfe891ff4a3cd: Add SM3 ARMv8/AArch64/CE assembly implementation (authored by Tianjia Zhang <tianjia.zhang@linux.alibaba.com>).
Add SM3 ARMv8/AArch64/CE assembly implementation
Apr 4 2022, 6:12 PM

Apr 1 2022

jukivili added a comment to T5913: libgcrypt: bug fix for PPC bulk AES-GCM acceleratieration, missing HWF_PPC_ARCH_3_10 in HW feature.

Fixed in master. I rechecked that bulk implementation passes tests with qemu-ppc64le.

Apr 1 2022, 8:55 AM · ppc, libgcrypt
jukivili committed rC29bfb3ebbc63: hwf-ppc: fix missing HWF_PPC_ARCH_3_10 in HW feature (authored by jukivili).
hwf-ppc: fix missing HWF_PPC_ARCH_3_10 in HW feature
Apr 1 2022, 8:54 AM
jukivili added a comment to T5913: libgcrypt: bug fix for PPC bulk AES-GCM acceleratieration, missing HWF_PPC_ARCH_3_10 in HW feature.

Looks like that line went missing in third/final version of AES-GCM patch at https://dev.gnupg.org/T5700

Apr 1 2022, 8:51 AM · ppc, libgcrypt