NLP Model Deployment using Amazon Sagemaker

JUNE, 2020
service


A startup in France uses linguistic NLP models to create disruptive applicaiton in fields such as education. It needed to run Machine Learning inferences at scale, anticipating servicing thousands of active, concurrent clients in production.

TECHNOLOGIES USED

Amazon Sagemaker, PyTorch, HuggingFace Transformers, AWS API Gateway, Amazon CloudFront, TorchServe

A startup in France uses linguistic NLP models to create disruptive applicaiton in fields such as education. It needed to run Machine Learning inferences at scale, anticipating servicing thousands of active, concurrent clients in production.

Minmini implemented a solution where the client's HuggingFace Transformer-based NLP models can be deployed at scale in an Amazon Sagemaker Endpoint using TorchServe. We also made the endpoint consumable from public, non-AWS clients via Amazon CloudFront and AWS API Gateway.