Skip to content

zetabyte

Cost-effective AI image generation with PixArt-Σ inference on AWS Trainium and AWS Inferentia Achintya Pinninti AWS Machine Learning Blog

​[[{“value”:” PixArt-Sigma is a diffusion transformer model that is capable of image generation at 4k resolution. This model shows significant improvements over previous generation PixArt models like Pixart-Alpha and other diffusion models through dataset and architectural improvements. AWS Trainium and AWS Inferentia are purpose-built AI… Read More »Cost-effective AI image generation with PixArt-Σ inference on AWS Trainium and AWS Inferentia Achintya Pinninti AWS Machine Learning Blog

Customize DeepSeek-R1 671b model using Amazon SageMaker HyperPod recipes – Part 2 Kanwaljit Khurmi AWS Machine Learning Blog

​[[{“value”:” This post is the second part of the DeepSeek series focusing on model customization with Amazon SageMaker HyperPod recipes (or recipes for brevity). In Part 1, we demonstrated the performance and ease of fine-tuning DeepSeek-R1 distilled models using these recipes. In this post, we… Read More »Customize DeepSeek-R1 671b model using Amazon SageMaker HyperPod recipes – Part 2 Kanwaljit Khurmi AWS Machine Learning Blog

Build a financial research assistant using Amazon Q Business and Amazon QuickSight for generative AI–powered insights Vishnu Elangovan AWS Machine Learning Blog

​[[{“value”:” According to a Gartner survey in 2024, 58% of finance functions have adopted generative AI, marking a significant rise in adoption. Among these, four primary use cases have emerged as especially prominent: intelligent process automation, anomaly detection, analytics, and operational assistance. In this post,… Read More »Build a financial research assistant using Amazon Q Business and Amazon QuickSight for generative AI–powered insights Vishnu Elangovan AWS Machine Learning Blog

AlphaEvolve: A Gemini-powered coding agent for designing advanced algorithms Google DeepMind Blog

​New AI agent evolves algorithms for math and practical applications in computing by combining the creativity of large language models with automated evaluators New AI agent evolves algorithms for math and practical applications in computing by combining the creativity of large language models with automated evaluators  Read… Read More »AlphaEvolve: A Gemini-powered coding agent for designing advanced algorithms Google DeepMind Blog

Securing Amazon Bedrock Agents: A guide to safeguarding against indirect prompt injections Hina Chaudhry AWS Machine Learning Blog

​[[{“value”:” Generative AI tools have transformed how we work, create, and process information. At Amazon Web Services (AWS), security is our top priority. Therefore, Amazon Bedrock provides comprehensive security controls and best practices to help protect your applications and data. In this post, we explore… Read More »Securing Amazon Bedrock Agents: A guide to safeguarding against indirect prompt injections Hina Chaudhry AWS Machine Learning Blog

Build scalable containerized RAG based generative AI applications in AWS using Amazon EKS with Amazon Bedrock Kanishk Mahajan AWS Machine Learning Blog

​[[{“value”:” Generative artificial intelligence (AI) applications are commonly built using a technique called Retrieval Augmented Generation (RAG) that provides foundation models (FMs) access to additional data they didn’t have during training. This data is used to enrich the generative AI prompt to deliver more context-specific… Read More »Build scalable containerized RAG based generative AI applications in AWS using Amazon EKS with Amazon Bedrock Kanishk Mahajan AWS Machine Learning Blog

How Hexagon built an AI assistant using AWS generative AI services Julio P. Roque AWS Machine Learning Blog

​[[{“value”:” This post was co-written with Julio P. Roque Hexagon ALI. Recognizing the transformative benefits of generative AI for enterprises, we at Hexagon’s Asset Lifecycle Intelligence division sought to enhance how users interact with our Enterprise Asset Management (EAM) products. Understanding these advantages, we partnered… Read More »How Hexagon built an AI assistant using AWS generative AI services Julio P. Roque AWS Machine Learning Blog