Bill Allombert on Sat, 12 Oct 2019 13:51:54 +0200 |
[Date Prev] [Date Next] [Thread Prev] [Thread Next] [Date Index] [Thread Index]
Re: Faster digits and fromdigits in base 2^k, and FlxqM_mul_Kronecker |
On Thu, Oct 08, 2015 at 05:51:07PM +0200, Peter Bruin wrote: > Hi Bill, > >> My motivation for this was mostly to speed up multiplication of matrices > >> over non-prime finite fields of small characteristic (FlxqM). I am > >> attaching a second patch (FlxqM_mul_Kronecker.patch) that enhances > >> Kronecker multiplication for FlxqM to use the above functions for > >> packing into and unpacking from integer matrices in cases where it > >> allows us to pack the entries in fewer words per entry than the > >> (half-)word-aligned packing that is used now. > > > > Hello Peter, > > > > Thanks for your patch! > > > > Is that method applicable to ZX_mul and Flx_mul ? > > (What about ZX_eval2BILspec/Z_mod2BIL_ZX (and > > Flx_mulspec_mulii_inflate/Z_mod2BIL_Flx)) ? > > In principle it is, but it is harder to make it pay off because the > overhead remains non-negligible. I spent some time trying to apply this > method to Flx_mul; so far I haven't managed to get a systematic speed-up > over the existing code, but I am still optimistic that some improvement > may be possible. As for ZX_mul, I haven't looked at that yet. I am working on it for Flx_mul/Flx_sqr. There still some tuning to do. Thanks again for this patch! Cheers, Bill