On Tue, 22 Oct 2013 17:37:41 +0200, Sönke Ludwig wrote:

It's not a thread per task, but a fiber per task (and fibers are
reused). Context switches of fibers are very fast and shouldn't be the
cause of the bad performance. If you can upload a benchmark/testcase
somewhere, I'd like to take a look at what takes so long.

Ah of course I read a lot more the vibed code now and I get it, thanks for explaining things to me :)

I can't provide the exact code/test benchmark I'm using as this code isn't open source, but I'll try to distill the performance issue into a smaller test case so I can send it along.