Improve ops per second

Same problem here, I got maximum of 3000 sets/second using a single process, obviously using more threads can increase the performance, but that doesn’t suit my need.
I have much (MUCH) better results using SDK 1.4, do you want to give it a try?