Skip to content

zetabyte

TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization Apple Machine Learning Research

​Direct Preference Optimization (DPO) has been widely adopted for preference alignment of Large Language Models (LLMs) due to its simplicity and effectiveness. However, DPO is derived as a bandit problem in which the whole response is treated as a single arm, ignoring the importance differences… Read More »TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization Apple Machine Learning Research

Generating and Visualizing Context Vectors in Transformers Muhammad Asad Iqbal Khan MachineLearningMastery.com

​This post is divided into three parts; they are: • Understanding Context Vectors • Visualizing Context Vectors from Different Layers • Visualizing Attention Patterns Unlike traditional word embeddings (such as Word2Vec or GloVe), which assign a fixed vector to each word regardless of context, transformer… Read More »Generating and Visualizing Context Vectors in Transformers Muhammad Asad Iqbal Khan MachineLearningMastery.com

DolphinGemma: How Google AI is helping decode dolphin communication Google DeepMind Blog

​DolphinGemma, a large language model developed by Google, is helping scientists study how dolphins communicate — and hopefully find out what they’re saying, too. DolphinGemma, a large language model developed by Google, is helping scientists study how dolphins communicate — and hopefully find out what they’re… Read More »DolphinGemma: How Google AI is helping decode dolphin communication Google DeepMind Blog

Build multi-agent systems with LangGraph and Amazon Bedrock Jagdeep Singh Soni AWS Machine Learning Blog

​[[{“value”:” Large language models (LLMs) have raised the bar for human-computer interaction where the expectation from users is that they can communicate with their applications through natural language. Beyond simple language understanding, real-world applications require managing complex workflows, connecting to external data, and coordinating multiple… Read More »Build multi-agent systems with LangGraph and Amazon Bedrock Jagdeep Singh Soni AWS Machine Learning Blog

Dynamic text-to-SQL for enterprise workloads with Amazon Bedrock Agents Jiwon Yeom AWS Machine Learning Blog

​[[{“value”:” Generative AI enables us to accomplish more in less time. Text-to-SQL empowers people to explore data and draw insights using natural language, without requiring specialized database knowledge. Amazon Web Services (AWS) has helped many customers connect this text-to-SQL capability with their own data, which… Read More »Dynamic text-to-SQL for enterprise workloads with Amazon Bedrock Agents Jiwon Yeom AWS Machine Learning Blog

Understanding Aggregate Trends for Apple Intelligence Using Differential Privacy Apple Machine Learning Research

​[[{“value”:”At Apple, we believe privacy is a fundamental human right. And we believe in giving our users a great experience while protecting their privacy. For years, we’ve used techniques like differential privacy as part of our opt-in device analytics program. This lets us gain insights… Read More »Understanding Aggregate Trends for Apple Intelligence Using Differential Privacy Apple Machine Learning Research

FocalLens: Instruction Tuning Enables Zero-Shot Conditional Image Representations Apple Machine Learning Research

​[[{“value”:”This paper was accepted at the Workshop on Foundation Models in the Wild at ICLR 2025. Visual understanding is inherently contextual – what we focus on in an image depends on the task at hand. For instance, given an image of a person holding a… Read More »FocalLens: Instruction Tuning Enables Zero-Shot Conditional Image Representations Apple Machine Learning Research

Building an AIOps chatbot with Amazon Q Business custom plugins Upendra V AWS Machine Learning Blog

​[[{“value”:” Many organizations rely on multiple third-party applications and services for different aspects of their operations, such as scheduling, HR management, financial data, customer relationship management (CRM) systems, and more. However, these systems often exist in silos, requiring users to manually navigate different interfaces, switch… Read More »Building an AIOps chatbot with Amazon Q Business custom plugins Upendra V AWS Machine Learning Blog