Skip to content

zetabyte

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation Apple Machine Learning Research

​Diffusion models have become the dominant approach for visual generation. They are trained by denoising a Markovian process which gradually adds noise to the input. We argue that the Markovian property limits the model’s ability to fully utilize the generation trajectory, leading to inefficiencies during… Read More »DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation Apple Machine Learning Research

Scaling Diffusion Language Models via Adaptation from Autoregressive Models Apple Machine Learning Research

​Diffusion Language Models (DLMs) have emerged as a promising new paradigm for text generative modeling, potentially addressing limitations of autoregressive (AR) models. However, current DLMs have been studied at a smaller scale compared to their AR counterparts and lack fair comparison on language modeling benchmarks.… Read More »Scaling Diffusion Language Models via Adaptation from Autoregressive Models Apple Machine Learning Research

Clario enhances the quality of the clinical trial documentation process with Amazon Bedrock Kim Nguyen, Shyam Banuprakash AWS Machine Learning Blog

​[[{“value”:” This post is co-written with Kim Nguyen and Shyam Banuprakash from Clario. Clario is a leading provider of endpoint data solutions to the clinical trials industry, generating high-quality clinical evidence for life sciences companies seeking to bring new therapies to patients. Since Clario’s founding… Read More »Clario enhances the quality of the clinical trial documentation process with Amazon Bedrock Kim Nguyen, Shyam Banuprakash AWS Machine Learning Blog

Optimizing Mixtral 8x7B on Amazon SageMaker with AWS Inferentia2 Lior Sadan AWS Machine Learning Blog

​[[{“value”:” Organizations are constantly seeking ways to harness the power of advanced large language models (LLMs) to enable a wide range of applications such as text generation, summarizationquestion answering, and many others. As these models grow more powerful and capable, deploying them in production environments… Read More »Optimizing Mixtral 8x7B on Amazon SageMaker with AWS Inferentia2 Lior Sadan AWS Machine Learning Blog

Elevate business productivity with Amazon Q and Amazon Connect Sujatha Dantuluri AWS Machine Learning Blog

​[[{“value”:” Modern banking faces dual challenges: delivering rapid loan processing while maintaining robust security against sophisticated fraud. Amazon Q Business provides AI-driven analysis of regulatory requirements and lending patterns. Additionally, you can now report fraud from the same interface with a custom plugin capability that… Read More »Elevate business productivity with Amazon Q and Amazon Connect Sujatha Dantuluri AWS Machine Learning Blog

Generate videos in Gemini and Whisk with Veo 2 Google DeepMind Blog

​Transform text-based prompts into high-resolution eight-second videos in Gemini Advanced and use Whisk Animate to turn images into eight-second animated clips. Transform text-based prompts into high-resolution eight-second videos in Gemini Advanced and use Whisk Animate to turn images into eight-second animated clips.  Read More  

EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing Apple Machine Learning Research

​Diffusion transformers have been widely adopted for text-to-image synthesis. While scaling these models up to billions of parameters shows promise, the effectiveness of scaling beyond current sizes remains underexplored and challenging. By explicitly exploiting the computational heterogeneity of image generations, we develop a new family… Read More »EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing Apple Machine Learning Research