Skip to content

zetabyte

Optimizing document AI and structured outputs by fine-tuning Amazon Nova Models and on-demand inference Sharon Li Artificial Intelligence

​[[{“value”:” Multimodal fine-tuning represents a powerful approach for customizing vision large language models (LLMs) to excel at specific tasks that involve both visual and textual information. Although base multimodal models offer impressive general capabilities, they often fall short when faced with specialized visual tasks, domain-specific… Read More »Optimizing document AI and structured outputs by fine-tuning Amazon Nova Models and on-demand inference Sharon Li Artificial Intelligence

Training Software Engineering Agents and Verifiers with SWE-Gym Apple Machine Learning Research

​We present SWE-Gym, the first environment for training real-world software engineering (SWE) agents. SWE-Gym contains 2,438 real-world Python task instances, each comprising a codebase with an executable runtime environment, unit tests, and a task specified in natural language. We use SWE-Gym to train language model… Read More »Training Software Engineering Agents and Verifiers with SWE-Gym Apple Machine Learning Research

CPEP: Contrastive Pose-EMG Pre-training Enhances Gesture Generalization on EMG Signals Apple Machine Learning Research

​[[{“value”:”This paper was accepted at the Foundation Models for the Brain and Body Workshop at NeurIPS 2025. Hand gesture classification using high-quality structured data such as videos, images, and hand skeletons is a well-explored problem in computer vision. Leveraging low-power, cost-effective biosignals, e.g. surface electromyography… Read More »CPEP: Contrastive Pose-EMG Pre-training Enhances Gesture Generalization on EMG Signals Apple Machine Learning Research

Transforming enterprise operations: Four high-impact use cases with Amazon Nova Abhinav Bhargava Artificial Intelligence

​[[{“value”:” Since the launch of Amazon Nova at AWS re:Invent 2024, we have seen adoption trends across industries, with notable gains in operational efficiency, compliance, and customer satisfaction. With its capabilities in secure, multimodal AI and domain customization, Nova is enhancing workflows and enabling cost… Read More »Transforming enterprise operations: Four high-impact use cases with Amazon Nova Abhinav Bhargava Artificial Intelligence

Building smarter AI agents: AgentCore long-term memory deep dive Akarsha Sehwag Artificial Intelligence

​[[{“value”:” Building AI agents that remember user interactions requires more than just storing raw conversations. While Amazon Bedrock AgentCore short-term memory captures immediate context, the real challenge lies in transforming these interactions into persistent, actionable knowledge that spans across sessions. This is the information that… Read More »Building smarter AI agents: AgentCore long-term memory deep dive Akarsha Sehwag Artificial Intelligence

Configure and verify a distributed training cluster with AWS Deep Learning Containers on Amazon EKS Meryem Ozcelik Artificial Intelligence

​[[{“value”:” Training state-of-the-art large language models (LLMs) demands massive, distributed compute infrastructure. Meta’s Llama 3, for instance, ran on 16,000 NVIDIA H100 GPUs for over 30.84 million GPU hours. Amazon Elastic Kubernetes Service (Amazon EKS) is a managed service that simplifies the deployment, management, and… Read More »Configure and verify a distributed training cluster with AWS Deep Learning Containers on Amazon EKS Meryem Ozcelik Artificial Intelligence