Skip to content

Deploy large models at high performance using FasterTransformer on Amazon SageMaker Dhawalkumar Patel AWS Machine Learning Blog

  • by

​ Sparked by the release of large AI models like AlexaTM, GPT, OpenChatKit, BLOOM, GPT-J, GPT-NeoX, FLAN-T5, OPT, Stable Diffusion, and ControlNet, the popularity of generative AI has seen a recent boom. Businesses are beginning to evaluate new cutting-edge applications of the technology in text,… Read More »Deploy large models at high performance using FasterTransformer on Amazon SageMaker Dhawalkumar Patel AWS Machine Learning Blog

Deploy large models at high performance using FasterTransformer on Amazon SageMaker Dhawalkumar Patel AWS Machine Learning Blog

  • by

​ Sparked by the release of large AI models like AlexaTM, GPT, OpenChatKit, BLOOM, GPT-J, GPT-NeoX, FLAN-T5, OPT, Stable Diffusion, and ControlNet, the popularity of generative AI has seen a recent boom. Businesses are beginning to evaluate new cutting-edge applications of the technology in text,… Read More »Deploy large models at high performance using FasterTransformer on Amazon SageMaker Dhawalkumar Patel AWS Machine Learning Blog