News Feed - Page 144 of 959 - PhD Studio January 18, 2025

Conservative Algorithms for Zero-Shot Reinforcement Learning on Limited Data Sana Hassan Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Reinforcement learning (RL) is a domain within artificial intelligence that trains agents to make sequential decisions through trial and error in an environment. This approach enables the agent to learn by interacting with its surroundings, receiving rewards or penalties based on its actions. However,… Read More »Conservative Algorithms for Zero-Shot Reinforcement Learning on Limited Data Sana Hassan Artificial Intelligence Category – MarkTechPost

JailbreakBench: An Open Sourced Benchmark for Jailbreaking Large Language Models (LLMs) Tanya Malhotra Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Large Language Models (LLMs) are vulnerable to jailbreak attacks, which can generate offensive, immoral, or otherwise improper information. By taking advantage of LLM flaws, these attacks go beyond the safety precautions meant to prevent offensive or hazardous outputs from being generated. Jailbreak attack evaluation… Read More »JailbreakBench: An Open Sourced Benchmark for Jailbreaking Large Language Models (LLMs) Tanya Malhotra Artificial Intelligence Category – MarkTechPost

This AI Paper from China Introduces a Reward-Robust Reinforcement Learning from Human Feedback RLHF Framework for Enhancing the Stability and Performance of Large Language Models Nikhil Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Reinforcement Learning from Human Feedback (RLHF) has emerged as a vital technique in aligning large language models (LLMs) with human values and expectations. It plays a critical role in ensuring that AI systems behave in understandable and trustworthy ways. RLHF enhances the capabilities of… Read More »This AI Paper from China Introduces a Reward-Robust Reinforcement Learning from Human Feedback RLHF Framework for Enhancing the Stability and Performance of Large Language Models Nikhil Artificial Intelligence Category – MarkTechPost

Circuit Breakers for AI: Interrupting Harmful Outputs Through Representation Engineering Shoaib Nazir Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” The adversarial attacks and defenses for LLMs encompass a wide range of techniques and strategies. Manually crafted and automated red teaming methods expose vulnerabilities, while white box access reveals potential for prefilling attacks. Defense approaches include RLHF, DPO, prompt optimization, and adversarial training. Inference-time… Read More »Circuit Breakers for AI: Interrupting Harmful Outputs Through Representation Engineering Shoaib Nazir Artificial Intelligence Category – MarkTechPost

Salesforce AI Introduces SFR-Judge: A Family of Three Judge Models of 8-Billion Parameters 8B, 12B, and 70B Size, Built with Meta Llama 3 and Mistral NeMO Asif Razzaq Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” The advancement of large language models (LLMs) in natural language processing has significantly improved various domains. As more complex models are developed, evaluating their outputs accurately becomes essential. Traditionally, human evaluations have been the standard approach for assessing quality, but this process is time… Read More »Salesforce AI Introduces SFR-Judge: A Family of Three Judge Models of 8-Billion Parameters 8B, 12B, and 70B Size, Built with Meta Llama 3 and Mistral NeMO Asif Razzaq Artificial Intelligence Category – MarkTechPost

SELMA: A Novel AI Approach to Enhance Text-to-Image Generation Models Using Auto-Generated Data and Skill-Specific Learning Techniques Asif Razzaq Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Text-to-image (T2I) models have seen rapid progress in recent years, allowing the generation of complex images based on natural language inputs. However, even state-of-the-art T2I models need help accurately capture and reflect all the semantics in given prompts, leading to images that may miss… Read More »SELMA: A Novel AI Approach to Enhance Text-to-Image Generation Models Using Auto-Generated Data and Skill-Specific Learning Techniques Asif Razzaq Artificial Intelligence Category – MarkTechPost

Multi-View and Multi-Scale Alignment (MaMA): Advancing Mammography with Contrastive Learning and Visual-Language Pre-training Sana Hassan Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Multi-View and Multi-Scale Alignment for Mammography Contrastive Learning:Contrastive Language-Image Pre-training (CLIP) has shown potential in medical imaging, but its application to mammography faces challenges due to limited labeled data, high-resolution images, and imbalanced datasets. This study introduces the first full adaptation of CLIP to… Read More »Multi-View and Multi-Scale Alignment (MaMA): Advancing Mammography with Contrastive Learning and Visual-Language Pre-training Sana Hassan Artificial Intelligence Category – MarkTechPost

AMD Releases AMD-135M: AMD’s First Small Language Model Series Trained from Scratch on AMD Instinct™ MI250 Accelerators Utilizing 670B Tokens Asif Razzaq Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” AMD has recently introduced its new language model, AMD-135M or AMD-Llama-135M, which is a significant addition to the landscape of AI models. Based on the LLaMA2 model architecture, this language model boasts a robust structure with 135 million parameters and is optimized for performance… Read More »AMD Releases AMD-135M: AMD’s First Small Language Model Series Trained from Scratch on AMD Instinct™ MI250 Accelerators Utilizing 670B Tokens Asif Razzaq Artificial Intelligence Category – MarkTechPost

ReliabilityBench: Measuring the Unpredictable Performance of Shaped-Up Large Language Models Across Five Key Domains of Human Cognition Asif Razzaq Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” The research evaluates the reliability of large language models (LLMs) such as GPT, LLaMA, and BLOOM, extensively used across various domains, including education, medicine, science, and administration. As the usage of these models becomes more prevalent, understanding their limitations and potential pitfalls is crucial.… Read More »ReliabilityBench: Measuring the Unpredictable Performance of Shaped-Up Large Language Models Across Five Key Domains of Human Cognition Asif Razzaq Artificial Intelligence Category – MarkTechPost

Exploring the Influence of Code Generation Tools (ChatGPT & GitHub Copilot) on Programming Education Mahmoud Ghorbel Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Integrating AI-powered code-generating technologies, such as ChatGPT and GitHub Copilot, is revolutionizing programming education. These tools, by providing real-time assistance to developers, accelerate the development process, enhance problem-solving, and make coding more accessible. Their increasing prevalence has sparked a growing interest in their influence… Read More »Exploring the Influence of Code Generation Tools (ChatGPT & GitHub Copilot) on Programming Education Mahmoud Ghorbel Artificial Intelligence Category – MarkTechPost

« Previous
1
…
142
143
144
145
146
…
959
Next »