Skip to content

zetabyte

Do Large Language Models Have an English Accent? Evaluating and Improving the Naturalness of Multilingual LLMs Apple Machine Learning Research

​Current Large Language Models (LLMs) are predominantly designed with English as the primary language, and even the few that are multilingual tend to exhibit strong English-centric biases. Much like speakers who might produce awkward expressions when learning a second language, LLMs often generate unnatural outputs… Read More »Do Large Language Models Have an English Accent? Evaluating and Improving the Naturalness of Multilingual LLMs Apple Machine Learning Research

How Apoidea Group enhances visual information extraction from banking documents with multimodal models using LLaMA-Factory on Amazon SageMaker HyperPod Tony Wong AWS Machine Learning Blog

​[[{“value”:” This post is co-written with Ken Tsui, Edward Tsoi and Mickey Yip from Apoidea Group. The banking industry has long struggled with the inefficiencies associated with repetitive processes such as information extraction, document review, and auditing. These tasks, which require significant human resources, slow… Read More »How Apoidea Group enhances visual information extraction from banking documents with multimodal models using LLaMA-Factory on Amazon SageMaker HyperPod Tony Wong AWS Machine Learning Blog

How Qualtrics built Socrates: An AI platform powered by Amazon SageMaker and Amazon Bedrock Jay Kshirsagar, Ronald Quan AWS Machine Learning Blog

​[[{“value”:” This post is co-authored by Jay Kshirsagar and Ronald Quan from Qualtrics. The content and opinions in this post are those of the third-party author and AWS is not responsible for the content or accuracy of this post. Qualtrics, founded in 2002, is a… Read More »How Qualtrics built Socrates: An AI platform powered by Amazon SageMaker and Amazon Bedrock Jay Kshirsagar, Ronald Quan AWS Machine Learning Blog

Vxceed secures transport operations with Amazon Bedrock Deepika Kumar AWS Machine Learning Blog

​[[{“value”:” Vxceed delivers SaaS solutions across industries such as consumer packaged goods (CPG), transportation, and logistics. Its modular environments include Lighthouse for CPG demand and supply chains, GroundCentric247 for airline and airport operations, and LimoConnect247 and FleetConnect247 for passenger transport. These solutions support a wide… Read More »Vxceed secures transport operations with Amazon Bedrock Deepika Kumar AWS Machine Learning Blog

Cost-effective AI image generation with PixArt-Σ inference on AWS Trainium and AWS Inferentia Achintya Pinninti AWS Machine Learning Blog

​[[{“value”:” PixArt-Sigma is a diffusion transformer model that is capable of image generation at 4k resolution. This model shows significant improvements over previous generation PixArt models like Pixart-Alpha and other diffusion models through dataset and architectural improvements. AWS Trainium and AWS Inferentia are purpose-built AI… Read More »Cost-effective AI image generation with PixArt-Σ inference on AWS Trainium and AWS Inferentia Achintya Pinninti AWS Machine Learning Blog

Customize DeepSeek-R1 671b model using Amazon SageMaker HyperPod recipes – Part 2 Kanwaljit Khurmi AWS Machine Learning Blog

​[[{“value”:” This post is the second part of the DeepSeek series focusing on model customization with Amazon SageMaker HyperPod recipes (or recipes for brevity). In Part 1, we demonstrated the performance and ease of fine-tuning DeepSeek-R1 distilled models using these recipes. In this post, we… Read More »Customize DeepSeek-R1 671b model using Amazon SageMaker HyperPod recipes – Part 2 Kanwaljit Khurmi AWS Machine Learning Blog

Build a financial research assistant using Amazon Q Business and Amazon QuickSight for generative AI–powered insights Vishnu Elangovan AWS Machine Learning Blog

​[[{“value”:” According to a Gartner survey in 2024, 58% of finance functions have adopted generative AI, marking a significant rise in adoption. Among these, four primary use cases have emerged as especially prominent: intelligent process automation, anomaly detection, analytics, and operational assistance. In this post,… Read More »Build a financial research assistant using Amazon Q Business and Amazon QuickSight for generative AI–powered insights Vishnu Elangovan AWS Machine Learning Blog

AlphaEvolve: A Gemini-powered coding agent for designing advanced algorithms Google DeepMind Blog

​New AI agent evolves algorithms for math and practical applications in computing by combining the creativity of large language models with automated evaluators New AI agent evolves algorithms for math and practical applications in computing by combining the creativity of large language models with automated evaluators  Read… Read More »AlphaEvolve: A Gemini-powered coding agent for designing advanced algorithms Google DeepMind Blog