News Feed - Page 112 of 957 - PhD Studio January 15, 2025

Scaling Diffusion transformers (DiT): An AI Framework for Optimizing Text-to-Image Models Across Compute Budgets Mohammad Asjad Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Large language models (LLMs) have demonstrated consistent scaling laws, revealing a power-law relationship between pretraining performance and computational resources. This relationship, expressed as C = 6ND (where C is compute, N is model size, and D is data quantity), has proven invaluable for optimizing… Read More »Scaling Diffusion transformers (DiT): An AI Framework for Optimizing Text-to-Image Models Across Compute Budgets Mohammad Asjad Artificial Intelligence Category – MarkTechPost

SecCodePLT: A Unified Platform for Evaluating Security Risks in Code GenAI Nikhil Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Code generation AI models (Code GenAI) are becoming pivotal in developing automated software demonstrating capabilities in writing, debugging, and reasoning about code. However, their ability to autonomously generate code raises concerns about security vulnerabilities. These models may inadvertently introduce insecure code, which could be… Read More »SecCodePLT: A Unified Platform for Evaluating Security Risks in Code GenAI Nikhil Artificial Intelligence Category – MarkTechPost

Google Unveils ‘Sample What You Can’t Compress’ in AI—A Game-Changer in High-Fidelity Image Compression Aswin Ak Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” The key challenge in the image autoencoding process is to create high-quality reconstructions that can retain fine details, especially when the image data has undergone compression. Traditional autoencoders, which rely on pixel-level losses such as mean squared error (MSE), tend to produce blurry outputs… Read More »Google Unveils ‘Sample What You Can’t Compress’ in AI—A Game-Changer in High-Fidelity Image Compression Aswin Ak Artificial Intelligence Category – MarkTechPost

SimLayerKV: An Efficient Solution to KV Cache Challenges in Large Language Models Asif Razzaq Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Recent advancements in large language models (LLMs) have significantly enhanced their ability to handle long contexts, making them highly effective in various tasks, from answering questions to complex reasoning. However, a critical bottleneck has emerged: the memory requirements for storing key-value (KV) caches escalate… Read More »SimLayerKV: An Efficient Solution to KV Cache Challenges in Large Language Models Asif Razzaq Artificial Intelligence Category – MarkTechPost

Graph-Constrained Reasoning (GCR): A Novel AI Framework that Bridges Structured Knowledge in Knowledge Graphs with Unstructured Reasoning in LLMs Asif Razzaq Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Large language models (LLMs) have demonstrated significant reasoning capabilities, yet they face issues like hallucinations and the inability to conduct faithful reasoning. These challenges stem from knowledge gaps, leading to factual errors during complex tasks. While knowledge graphs (KGs) are increasingly used to bolster… Read More »Graph-Constrained Reasoning (GCR): A Novel AI Framework that Bridges Structured Knowledge in Knowledge Graphs with Unstructured Reasoning in LLMs Asif Razzaq Artificial Intelligence Category – MarkTechPost

Meta AI Releases Meta Lingua: A Minimal and Fast LLM Training and Inference Library for Research Asif Razzaq Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Training and deploying large-scale language models (LLMs) is complex, requiring significant computational resources, technical expertise, and access to high-performance infrastructure. These barriers limit reproducibility, increase development time, and make experimentation challenging, particularly for academia and smaller research institutions. Addressing these issues requires a lightweight,… Read More »Meta AI Releases Meta Lingua: A Minimal and Fast LLM Training and Inference Library for Research Asif Razzaq Artificial Intelligence Category – MarkTechPost

Understanding Local Rank and Information Compression in Deep Neural Networks Shobha Kakkar Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Deep neural networks are powerful tools that excel in learning complex patterns, but understanding how they efficiently compress input data into meaningful representations remains a challenging research problem. Researchers from the University of California, Los Angeles, and New York University propose a new metric,… Read More »Understanding Local Rank and Information Compression in Deep Neural Networks Shobha Kakkar Artificial Intelligence Category – MarkTechPost

Baichuan-Omni: An Open-Source 7B Multimodal Large Language Model for Image, Video, Audio, and Text Processing Divyesh Vitthal Jawkhede Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Recent advancements in Large Language Models (LLMs) have reshaped the Artificial intelligence (AI)landscape, paving the way for the creation of Multimodal Large Language Models (MLLMs). These advanced models expand AI capabilities beyond text, allowing understanding and generation of content like images, audio, and video,… Read More »Baichuan-Omni: An Open-Source 7B Multimodal Large Language Model for Image, Video, Audio, and Text Processing Divyesh Vitthal Jawkhede Artificial Intelligence Category – MarkTechPost

Agent-as-a-Judge: An Advanced AI Framework for Scalable and Accurate Evaluation of AI Systems Through Continuous Feedback and Human-level Judgments Sana Hassan Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” Agentic systems have evolved rapidly in recent years, showing potential to solve complex tasks that mimic human-like decision-making processes. These systems are designed to act step-by-step, analyzing intermediate stages in tasks like humans do. However, one of the biggest challenges in this field is… Read More »Agent-as-a-Judge: An Advanced AI Framework for Scalable and Accurate Evaluation of AI Systems Through Continuous Feedback and Human-level Judgments Sana Hassan Artificial Intelligence Category – MarkTechPost

Meta AI Releases Meta Spirit LM: An Open Source Multimodal Language Model Mixing Text and Speech Asif Razzaq Artificial Intelligence Category – MarkTechPost

by

[[{“value”:” One of the primary challenges in developing advanced text-to-speech (TTS) systems is the lack of expressivity when transcribing and generating speech. Traditionally, large language models (LLMs) used for building TTS pipelines convert speech to text using automatic speech recognition (ASR), process it using an… Read More »Meta AI Releases Meta Spirit LM: An Open Source Multimodal Language Model Mixing Text and Speech Asif Razzaq Artificial Intelligence Category – MarkTechPost

« Previous
1
…
110
111
112
113
114
…
957
Next »