bn_sqr8x_internal function consumes a lot of CPU time

Patrick Steuer psteuer at
Mon Mar 4 08:45:34 UTC 2019

Its an x86_64 assembly subroutine of bn_mul_mont which implements 
montgomery modular multiplication. Its heavily used in rsa comutations.

Just set a breakpoint in the debugger and have a look at the callstack.

