zetabyte

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation Apple Machine Learning Research

by zetabyte

Diffusion models have become the dominant approach for visual generation. They are trained by denoising a Markovian process which gradually adds noise to the input. We argue that the Markovian property limits the model’s ability to fully utilize the generation trajectory, leading to inefficiencies during… Read More »DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation Apple Machine Learning Research

International Conference on Learning Representations (ICLR) 2025 Apple Machine Learning Research

by zetabyte

Post Content Read More

Scaling Diffusion Language Models via Adaptation from Autoregressive Models Apple Machine Learning Research

by zetabyte

Diffusion Language Models (DLMs) have emerged as a promising new paradigm for text generative modeling, potentially addressing limitations of autoregressive (AR) models. However, current DLMs have been studied at a smaller scale compared to their AR counterparts and lack fair comparison on language modeling benchmarks.… Read More »Scaling Diffusion Language Models via Adaptation from Autoregressive Models Apple Machine Learning Research

Clario enhances the quality of the clinical trial documentation process with Amazon Bedrock Kim Nguyen, Shyam Banuprakash AWS Machine Learning Blog

by zetabyte

[[{“value”:” This post is co-written with Kim Nguyen and Shyam Banuprakash from Clario. Clario is a leading provider of endpoint data solutions to the clinical trials industry, generating high-quality clinical evidence for life sciences companies seeking to bring new therapies to patients. Since Clario’s founding… Read More »Clario enhances the quality of the clinical trial documentation process with Amazon Bedrock Kim Nguyen, Shyam Banuprakash AWS Machine Learning Blog

Optimizing Mixtral 8x7B on Amazon SageMaker with AWS Inferentia2 Lior Sadan AWS Machine Learning Blog

by zetabyte

[[{“value”:” Organizations are constantly seeking ways to harness the power of advanced large language models (LLMs) to enable a wide range of applications such as text generation, summarizationquestion answering, and many others. As these models grow more powerful and capable, deploying them in production environments… Read More »Optimizing Mixtral 8x7B on Amazon SageMaker with AWS Inferentia2 Lior Sadan AWS Machine Learning Blog

Elevate business productivity with Amazon Q and Amazon Connect Sujatha Dantuluri AWS Machine Learning Blog

by zetabyte

[[{“value”:” Modern banking faces dual challenges: delivering rapid loan processing while maintaining robust security against sophisticated fraud. Amazon Q Business provides AI-driven analysis of regulatory requirements and lending patterns. Additionally, you can now report fraud from the same interface with a custom plugin capability that… Read More »Elevate business productivity with Amazon Q and Amazon Connect Sujatha Dantuluri AWS Machine Learning Blog

Generate videos in Gemini and Whisk with Veo 2 Google DeepMind Blog

by zetabyte

Transform text-based prompts into high-resolution eight-second videos in Gemini Advanced and use Whisk Animate to turn images into eight-second animated clips. Transform text-based prompts into high-resolution eight-second videos in Gemini Advanced and use Whisk Animate to turn images into eight-second animated clips. Read More

A Field Guide to Rapidly Improving AI Products Hamel Husain AI & ML – Radar

by zetabyte

[[{“value”:” Most AI teams focus on the wrong things. Here’s a common scene from my consulting work: AI TEAMHere’s our agent architecture—we’ve got RAG here, a router there, and we’re using this new framework for… ME[Holding up my hand to pause the enthusiastic tech lead]Can… Read More »A Field Guide to Rapidly Improving AI Products Hamel Husain AI & ML – Radar

EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing Apple Machine Learning Research

by zetabyte

Diffusion transformers have been widely adopted for text-to-image synthesis. While scaling these models up to billions of parameters shows promise, the effectiveness of scaling beyond current sizes remains underexplored and challenging. By explicitly exploiting the computational heterogeneity of image generations, we develop a new family… Read More »EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing Apple Machine Learning Research

CoMotion: Concurrent Multi-Person 3D Motion Apple Machine Learning Research

by zetabyte

We introduce an approach for detecting and tracking detailed 3D poses of multiple people from a single monocular camera stream. Our system maintains temporally coherent predictions in crowded scenes filled with difficult poses and occlusions. Our model performs both strong per-frame detection and a learned… Read More »CoMotion: Concurrent Multi-Person 3D Motion Apple Machine Learning Research

« Previous
1
…
100
101
102
103
104
…
167
Next »