Zhimin Chen, Patrick Schaumont, "A Parallel Implementation of Montgomery Multiplication on Multicore Systems: Algorithm, Analysis, and Prototype," IEEE Transactions on Computers, vol. 60, no. 12, pp. 16921703, December, 2011.  
