Hemant Vishwakarma: Python multiprocessing within Flask request with Gunicorn + Nginx

Friday, 25 June 2021

I want to build a service that will be able to handle:

My understanding of a pre-fork server is that something like the following happens:

server starts
Gunicorn creates multiple OS processes, also called workers, ready to accept requests
request comes in. Nginx forwards to Gunicorn. Gunicorn sends to one of the workers.

What I want to understand is what happens if, in my Flask code, when handling the request, I have this:

from multiprocessing import pool as ProcessPool
with ProcessPool(4) as pool:
    pool.map(some_expensive_function, some_data)

In particular:

Will additional OS processes be started? Will the speedup be what I expect? (I.e., similar to if I ran the ProcessPool outside of a Flask production context?) If Gunicorn created 4 web workers, will there now be 7 OS processes running? 9? Is there a risk of making too many? Does Gunicorn assume that each worker will not fork or does it not care?
If a web-worker dies or is killed after starting the ProcessPool, will it be closed by the context manager properly?
Is this a sane thing to do? What are the alternatives?

Hemant Vishwakarma