News Feed - Page 89 of 954 - PhD Studio January 13, 2025

Researchers at KAUST Use Anderson Exploitation to Maximize GPU Efficiency with Greater Model Accuracy and Generalizability Adeeba Alam Ansari Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Escalation in AI implies an increased infrastructure expenditure. The massive and multidisciplinary research exerts economic pressure on institutions as high-performance computing (HPC) costs an arm and a leg. HPC is financially draining and critically impacts energy consumption and the environment. By 2030, AI is… Read More »Researchers at KAUST Use Anderson Exploitation to Maximize GPU Efficiency with Greater Model Accuracy and Generalizability Adeeba Alam Ansari Artificial Intelligence Category – MarkTechPost

KVSharer: A Plug-and-Play Machine Learning Method that Shares the KV Cache between Layers to Achieve Layer-Wise Compression Divyesh Vitthal Jawkhede Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” In recent times, large language models (LLMs) built on the Transformer architecture have shown remarkable abilities across a wide range of tasks. However, these impressive capabilities usually come with a significant increase in model size, resulting in substantial GPU memory costs during inference. The… Read More »KVSharer: A Plug-and-Play Machine Learning Method that Shares the KV Cache between Layers to Achieve Layer-Wise Compression Divyesh Vitthal Jawkhede Artificial Intelligence Category – MarkTechPost

iP-VAE: A Spiking Neural Network for Iterative Bayesian Inference and ELBO Maximization Sana Hassan Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” The Evidence Lower Bound (ELBO) is a key objective for training generative models like Variational Autoencoders (VAEs). It parallels neuroscience, aligning with the Free Energy Principle (FEP) for brain function. This shared objective hints at a potential unified machine learning and neuroscience theory. However,… Read More »iP-VAE: A Spiking Neural Network for Iterative Bayesian Inference and ELBO Maximization Sana Hassan Artificial Intelligence Category – MarkTechPost

Enhancing Artificial Intelligence Reasoning by Addressing Softmax Limitations in Sharp Decision-Making with Adaptive Temperature Techniques Tanya Malhotra Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” The ability to generate accurate conclusions based on data inputs is essential for strong reasoning and dependable performance in Artificial Intelligence (AI) systems. The softmax function is a crucial element that supports this functionality in modern AI models. A major component of differentiable query-key… Read More »Enhancing Artificial Intelligence Reasoning by Addressing Softmax Limitations in Sharp Decision-Making with Adaptive Temperature Techniques Tanya Malhotra Artificial Intelligence Category – MarkTechPost

This AI Paper Explores New Ways to Utilize and Optimize Multimodal RAG System for Industrial Applications Nikhil Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Multimodal Retrieval Augmented Generation (RAG) technology has opened new possibilities for artificial intelligence (AI) applications in manufacturing, engineering, and maintenance industries. These fields rely heavily on documents that combine complex text and images, including manuals, technical diagrams, and schematics. AI systems capable of interpreting… Read More »This AI Paper Explores New Ways to Utilize and Optimize Multimodal RAG System for Industrial Applications Nikhil Artificial Intelligence Category – MarkTechPost

Promptfoo: An AI Tool For Testing, Evaluating and Red-Teaming LLM apps Sajjad Ansari Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Promptfoo is a command-line interface (CLI) and library designed to enhance the evaluation and security of large language model (LLM) applications. It enables users to create robust prompts, model configurations, and retrieval-augmented generation (RAG) systems through use-case-specific benchmarks. This tool supports automated red teaming… Read More »Promptfoo: An AI Tool For Testing, Evaluating and Red-Teaming LLM apps Sajjad Ansari Artificial Intelligence Category – MarkTechPost

Llama-3-Nanda-10B-Chat: A 10B-Parameter Open Generative Large Language Model for Hindi with Cutting-Edge NLP Capabilities and Optimized Tokenization Asif Razzaq Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Natural Language Processing (NLP) focuses on building computational models to interpret and generate human language. With advancements in transformer-based models, large language models (LLMs) have shown impressive English NLP capabilities, enabling applications ranging from text summarization and sentiment analysis to complex reasoning tasks. However,… Read More »Llama-3-Nanda-10B-Chat: A 10B-Parameter Open Generative Large Language Model for Hindi with Cutting-Edge NLP Capabilities and Optimized Tokenization Asif Razzaq Artificial Intelligence Category – MarkTechPost

AMD Open Sources AMD OLMo: A Fully Open-Source 1B Language Model Series that is Trained from Scratch by AMD on AMD Instinct™ MI250 GPUs Asif Razzaq Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” In the rapidly evolving world of artificial intelligence and machine learning, the demand for powerful, flexible, and open-access solutions has grown immensely. Developers, researchers, and tech enthusiasts frequently face challenges when it comes to leveraging cutting-edge technology without being constrained by closed ecosystems. Many… Read More »AMD Open Sources AMD OLMo: A Fully Open-Source 1B Language Model Series that is Trained from Scratch by AMD on AMD Instinct™ MI250 GPUs Asif Razzaq Artificial Intelligence Category – MarkTechPost

Best practices and lessons for fine-tuning Anthropic’s Claude 3 Haiku on Amazon Bedrock Yanyan Zhang AWS Machine Learning Blog

by

[[{“value”:” Fine-tuning is a powerful approach in natural language processing (NLP) and generative AI, allowing businesses to tailor pre-trained large language models (LLMs) for specific tasks. This process involves updating the model’s weights to improve its performance on targeted applications. By fine-tuning, the LLM can… Read More »Best practices and lessons for fine-tuning Anthropic’s Claude 3 Haiku on Amazon Bedrock Yanyan Zhang AWS Machine Learning Blog

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock Kyle Blocksom AWS Machine Learning Blog

by

[[{“value”:” As enterprises increasingly embrace generative AI , they face challenges in managing the associated costs. With demand for generative AI applications surging across projects and multiple lines of business, accurately allocating and tracking spend becomes more complex. Organizations need to prioritize their generative AI… Read More »Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock Kyle Blocksom AWS Machine Learning Blog

« Previous
1
…
87
88
89
90
91
…
954
Next »