Skip to content

Do LLMs Internally “Know” When They Follow Instructions? Apple Machine Learning Research

  • by

​[[{“value”:”This paper was accepted at the Foundation Model Interventions (MINT) Workshop at NeurIPS 2024. Instruction-following is crucial for building AI agents with large language models (LLMs), as these models must adhere strictly to user-provided guidelines. However, LLMs often fail to follow even simple instructions. To… Read More »Do LLMs Internally “Know” When They Follow Instructions? Apple Machine Learning Research

Faster Algorithms for User-Level Private Stochastic Convex Optimization Apple Machine Learning Research

  • by

​We study private stochastic convex optimization (SCO) under user-level differential privacy (DP) constraints. In this setting, there are nnn users, each possessing mmm data items, and we need to protect the privacy of each user’s entire collection of data items. Existing algorithms for user-level DP… Read More »Faster Algorithms for User-Level Private Stochastic Convex Optimization Apple Machine Learning Research

Transformation-Invariant Learning and Theoretical Guarantees for OOD Generalization Apple Machine Learning Research

  • by

​Learning with identical train and test distributions has been extensively investigated both practically and theoretically. Much remains to be understood, however, in statistical learning under distribution shifts. This paper focuses on a distribution shift setting where train and test distributions can be related by classes… Read More »Transformation-Invariant Learning and Theoretical Guarantees for OOD Generalization Apple Machine Learning Research

Do LLMs Estimate Uncertainty Well in Instruction-Following? Apple Machine Learning Research

  • by

​[[{“value”:”This paper was accepted at the Safe Generative AI Workshop (SGAIW) at NeurIPS 2024. Large language models (LLMs) could be valuable personal AI agents across various domains, provided they can precisely follow user instructions. However, recent studies have shown significant limitations in LLMs’ instruction-following capabilities,… Read More »Do LLMs Estimate Uncertainty Well in Instruction-Following? Apple Machine Learning Research

Private Stochastic Convex Optimization with Heavy Tails: Near-Optimality from Simple Reductions Apple Machine Learning Research

  • by

​We study the problem of differentially private stochastic convex optimization (DP-SCO) with heavy-tailed gradients, where we assume a kthk^{text{th}}kth-moment bound on the Lipschitz constants of sample functions, rather than a uniform bound. We propose a new reduction-based approach that enables us to obtain the first… Read More »Private Stochastic Convex Optimization with Heavy Tails: Near-Optimality from Simple Reductions Apple Machine Learning Research

Alibaba Research Introduces XiYan-SQL: A Multi-Generator Ensemble AI Framework for Text-to-SQL Asif Razzaq Artificial Intelligence Category – MarkTechPost

  • by

​[[{“value”:” Natural Language to SQL (NL2SQL) technology has emerged as a transformative aspect of natural language processing (NLP), enabling users to convert human language queries into Structured Query Language (SQL) statements. This development has made it easier for individuals who need more technical expertise to… Read More »Alibaba Research Introduces XiYan-SQL: A Multi-Generator Ensemble AI Framework for Text-to-SQL Asif Razzaq Artificial Intelligence Category – MarkTechPost

Racing into the future: How AWS DeepRacer fueled my AI and ML journey Matt Camp AWS Machine Learning Blog

  • by

​[[{“value”:” In 2018, I sat in the audience at AWS re:Invent as Andy Jassy announced AWS DeepRacer—a fully autonomous 1/18th scale race car driven by reinforcement learning. At the time, I knew little about AI or machine learning (ML). As an engineer transitioning from legacy… Read More »Racing into the future: How AWS DeepRacer fueled my AI and ML journey Matt Camp AWS Machine Learning Blog

Customize small language models on AWS with automotive terminology Bruno Pistone AWS Machine Learning Blog

  • by

​[[{“value”:” In the rapidly evolving world of AI, the ability to customize language models for specific industries has become more important. Although large language models (LLMs) are adept at handling a wide range of tasks with natural language, they excel at general purpose tasks as… Read More »Customize small language models on AWS with automotive terminology Bruno Pistone AWS Machine Learning Blog