Skip to content

zetabyte

Liquid AI Released LFM2-Audio-1.5B: An End-to-End Audio Foundation Model with Sub-100 ms Response Latency Asif Razzaq Artificial Intelligence Category – MarkTechPost

​[[{“value”:” Liquid AI has released LFM2-Audio-1.5B, a compact audio–language foundation model that both understands and generates speech and text through a single end-to-end stack. It positions itself for low-latency, real-time assistants on resource-constrained devices, extending the LFM2 family into audio while retaining a small footprint.… Read More »Liquid AI Released LFM2-Audio-1.5B: An End-to-End Audio Foundation Model with Sub-100 ms Response Latency Asif Razzaq Artificial Intelligence Category – MarkTechPost

MLPerf Inference v5.1 (2025): Results Explained for GPUs, CPUs, and AI Accelerators Michal Sutter Artificial Intelligence Category – MarkTechPost

​[[{“value”:” What MLPerf Inference Actually Measures? MLPerf Inference quantifies how fast a complete system (hardware + runtime + serving stack) executes fixed, pre-trained models under strict latency and accuracy constraints. Results are reported for the Datacenter and Edge suites with standardized request patterns (“scenarios”) generated… Read More »MLPerf Inference v5.1 (2025): Results Explained for GPUs, CPUs, and AI Accelerators Michal Sutter Artificial Intelligence Category – MarkTechPost

How to Build an Advanced Agentic Retrieval-Augmented Generation (RAG) System with Dynamic Strategy and Smart Retrieval? Asif Razzaq Artificial Intelligence Category – MarkTechPost

​[[{“value”:” In this tutorial, we walk through the implementation of an Agentic Retrieval-Augmented Generation (RAG) system. We design it so that the agent does more than just retrieve documents; it actively decides when retrieval is needed, selects the best retrieval strategy, and synthesizes responses with… Read More »How to Build an Advanced Agentic Retrieval-Augmented Generation (RAG) System with Dynamic Strategy and Smart Retrieval? Asif Razzaq Artificial Intelligence Category – MarkTechPost

Compute-Optimal Quantization-Aware Training Apple Machine Learning Research

​[[{“value”:”Quantization-aware training (QAT) is a leading technique for improving the accuracy of quantized neural networks. Previ- ous work has shown that decomposing training into a full-precision (FP) phase followed by a QAT phase yields superior accuracy compared to QAT alone. However, the optimal allocation of… Read More »Compute-Optimal Quantization-Aware Training Apple Machine Learning Research

Zhipu AI Releases GLM-4.6: Achieving Enhancements in Real-World Coding, Long-Context Processing, Reasoning, Searching and Agentic AI Asif Razzaq Artificial Intelligence Category – MarkTechPost

​[[{“value”:” Zhipu AI has released GLM-4.6, a major update to its GLM series focused on agentic workflows, long-context reasoning, and practical coding tasks. The model raises the input window to 200K tokens with a 128K max output, targets lower token consumption in applied tasks, and… Read More »Zhipu AI Releases GLM-4.6: Achieving Enhancements in Real-World Coding, Long-Context Processing, Reasoning, Searching and Agentic AI Asif Razzaq Artificial Intelligence Category – MarkTechPost

Modernize fraud prevention: GraphStorm v0.5 for real-time inference Jian Zhang Artificial Intelligence

​[[{“value”:” Fraud continues to cause significant financial damage globally, with U.S. consumers alone losing $12.5 billion in 2024—a 25% increase from the previous year according to the Federal Trade Commission. This surge stems not from more frequent attacks, but from fraudsters’ increasing sophistication. As fraudulent activities… Read More »Modernize fraud prevention: GraphStorm v0.5 for real-time inference Jian Zhang Artificial Intelligence

OpenAI Launches Sora 2 and a Consent-Gated Sora iOS App Michal Sutter Artificial Intelligence Category – MarkTechPost

​[[{“value”:” OpenAI released Sora 2, a text-to-video-and-audio model focused on physical plausibility, multi-shot controllability, and synchronized dialogue/SFX. The OpenAI team has also launched a new invite-only Sora iOS app (U.S. and Canada first) that enables social creation, remixing, and consent-controlled “cameos” for inserting a verified… Read More »OpenAI Launches Sora 2 and a Consent-Gated Sora iOS App Michal Sutter Artificial Intelligence Category – MarkTechPost