How much memory does your program use? 

My code finds (p1)! mod p*p in polynomial time, this is achieved with that we determine for lots of primes at once (p1)! mod p*p. But for p~10^10 we don't have enough memory, there is a slowdown for (very) large primes. It doesn't really matter how much memory you have got, but with more memory it runs faster. I have choosen a parameter that it will use approx 1GB RAM for p~10^10. Determining for 10 million primes in a single search (p1)! mod p*p. Last fiddled with by R. Gerbicz on 20110829 at 16:52 

Either your algorithm is vastly different from mine (although it sounds similar) or you have written better optimized code. 

Mark, you missed p=3542985241 as a nearWilson prime: (p1)!==174p mod p^2. Now I am at 6 billions.

I'll have to take a look. Either it is a code bug or the person who ran that range made a mistake. There were a number of ranges above 3e9 that I did not run and that was in one of them.

As I promised here you can download and use my code: https://sites.google.com/site/robertgerbicz/wilson
I've finished the search up to 1e10. There were two new nearWilson prime. 
I d/l'd your code, but haven't looked at it in detail. I did see that you use GMP, so your speed is rather surprising considering GMP overhead.

