Skip to content

zetabyte

Loss Functions Explained: Understand the Maths in Just 2 Minutes Each Kanwal Mehreen MachineLearningMastery.com

​I must say, with the ongoing hype around machine learning, a lot of people jump straight to the application side without really understanding how things work behind the scenes. I must say, with the ongoing hype around machine learning, a lot of people jump straight to… Read More »Loss Functions Explained: Understand the Maths in Just 2 Minutes Each Kanwal Mehreen MachineLearningMastery.com

Beyond Text Compression: Evaluating Tokenizers Across Scales Apple Machine Learning Research

​[[{“value”:”Tokenizer design significantly impacts language model performance, yet evaluating tokenizer quality remains challenging. While text compression has emerged as a common intrinsic metric, recent work questions its reliability as a quality indicator. We investigate whether evaluating tokenizers on smaller models (350M parameters) reliably predicts their… Read More »Beyond Text Compression: Evaluating Tokenizers Across Scales Apple Machine Learning Research

Proxy-FDA: Proxy-Based Feature Distribution Alignment for Fine-Tuning Vision Foundation Models Without Forgetting Apple Machine Learning Research

​Vision foundation models pre-trained on massive data encode rich representations of real-world concepts, which can be adapted to downstream tasks by fine-tuning. However, fine-tuning foundation models on one task often leads to the issue of concept forgetting on other tasks. Recent methods of robust fine-tuning… Read More »Proxy-FDA: Proxy-Based Feature Distribution Alignment for Fine-Tuning Vision Foundation Models Without Forgetting Apple Machine Learning Research

The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity Apple Machine Learning Research

​[[{“value”:”Recent generations of frontier language models have introduced Large Reasoning Models (LRMs) that generate detailed thinking processes before providing answers. While these models demonstrate improved performance on reasoning benchmarks, their fundamental capabilities, scal- ing properties, and limitations remain insufficiently understood. Current evaluations primarily fo- cus… Read More »The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity Apple Machine Learning Research

Improve Vision Language Model Chain-of-thought Reasoning Apple Machine Learning Research

​[[{“value”:”Chain-of-thought (CoT) reasoning in vision language models (VLMs) is crucial for improving interpretability and trustworthiness. However, current training recipes often relying on datasets dominated by short annotations with minimal rationales. In this work, we show that training VLM on short answers leads to poor generalization… Read More »Improve Vision Language Model Chain-of-thought Reasoning Apple Machine Learning Research

Voice Quality Dimensions as Interpretable Primitives for Speaking Style for Atypical Speech and Affect Apple Machine Learning Research

​Perceptual voice quality dimensions describe key characteristics of atypical speech and other speech modulations. Here we develop and evaluate voice quality models for seven voice and speech dimensions (intelligibility, imprecise consonants, harsh voice, naturalness, monoloudness, monopitch, and breathiness). Probes were trained on the public Speech… Read More »Voice Quality Dimensions as Interpretable Primitives for Speaking Style for Atypical Speech and Affect Apple Machine Learning Research

Impel enhances automotive dealership customer experience with fine-tuned LLMs on Amazon SageMaker Nicholas Scozzafava AWS Machine Learning Blog

​[[{“value”:” This post is co-written with Tatia Tsmindashvili, Ana Kolkhidashvili, Guram Dentoshvili, Dachi Choladze from Impel. Impel transforms automotive retail through an AI-powered customer lifecycle management solution that drives dealership operations and customer interactions. Their core product, Sales AI, provides all-day personalized customer engagement, handling… Read More »Impel enhances automotive dealership customer experience with fine-tuned LLMs on Amazon SageMaker Nicholas Scozzafava AWS Machine Learning Blog

How climate tech startups are building foundation models with Amazon SageMaker HyperPod Ilan Gleiser AWS Machine Learning Blog

​[[{“value”:” Climate tech startups are companies that use technology and innovation to address the climate crisis, with a primary focus on either reducing greenhouse gas emissions or helping society adapt to climate change impacts. Their unifying mission is to create scalable solutions that accelerate the… Read More »How climate tech startups are building foundation models with Amazon SageMaker HyperPod Ilan Gleiser AWS Machine Learning Blog