Most forums online are suggesting using Flask to create an endpoint to serve ML/Deep learning models into production. This requires to use Gunicorn and NGINX with loadbalancer. How are companies deploying ML models written in Pytorch/Tensorlow into production?
Do you find the current process tedious? Is there a good managed solution out there?