Skip to content

Google AI Releases VaultGemma: The Largest and Most Capable Open Model (1B-parameters) Trained from Scratch with Differential Privacy Asif Razzaq Artificial Intelligence Category – MarkTechPost

​[[{“value”:” Google AI Research and DeepMind have released VaultGemma 1B, the largest open-weight large language model trained entirely with differential privacy (DP). This development is a major step toward building AI models that are both powerful and privacy-preserving. Why Do We Need Differential Privacy in… Read More »Google AI Releases VaultGemma: The Largest and Most Capable Open Model (1B-parameters) Trained from Scratch with Differential Privacy Asif Razzaq Artificial Intelligence Category – MarkTechPost

IBM AI Research Releases Two English Granite Embedding Models, Both Based on the ModernBERT Architecture Asif Razzaq Artificial Intelligence Category – MarkTechPost

​[[{“value”:” IBM has quietly built a strong presence in the open-source AI ecosystem, and its latest release shows why it shouldn’t be overlooked. The company has introduced two new embedding models—granite-embedding-english-r2 and granite-embedding-small-english-r2—designed specifically for high-performance retrieval and RAG (retrieval-augmented generation) systems. These models are… Read More »IBM AI Research Releases Two English Granite Embedding Models, Both Based on the ModernBERT Architecture Asif Razzaq Artificial Intelligence Category – MarkTechPost

How to Build a Multilingual OCR AI Agent in Python with EasyOCR and OpenCV Asif Razzaq Artificial Intelligence Category – MarkTechPost

​[[{“value”:” In this tutorial, we build an Advanced OCR AI Agent in Google Colab using EasyOCR, OpenCV, and Pillow, running fully offline with GPU acceleration. The agent includes a preprocessing pipeline with contrast enhancement (CLAHE), denoising, sharpening, and adaptive thresholding to improve recognition accuracy. Beyond… Read More »How to Build a Multilingual OCR AI Agent in Python with EasyOCR and OpenCV Asif Razzaq Artificial Intelligence Category – MarkTechPost

Automate advanced agentic RAG pipeline with Amazon SageMaker AI Sandeep Raveesh-Babu Artificial Intelligence

​[[{“value”:” Retrieval Augmented Generation (RAG) is a fundamental approach for building advanced generative AI applications that connect large language models (LLMs) to enterprise knowledge. However, crafting a reliable RAG pipeline is rarely a one-shot process. Teams often need to test dozens of configurations (varying chunking… Read More »Automate advanced agentic RAG pipeline with Amazon SageMaker AI Sandeep Raveesh-Babu Artificial Intelligence

Unlock model insights with log probability support for Amazon Bedrock Custom Model Import Manoj Selvakumar Artificial Intelligence

​[[{“value”:” You can use Amazon Bedrock Custom Model Import to seamlessly integrate your customized models—such as Llama, Mistral, and Qwen—that you have fine-tuned elsewhere into Amazon Bedrock. The experience is completely serverless, minimizing infrastructure management while providing your imported models with the same unified API… Read More »Unlock model insights with log probability support for Amazon Bedrock Custom Model Import Manoj Selvakumar Artificial Intelligence

Migrate from Anthropic’s Claude 3.5 Sonnet to Claude 4 Sonnet on Amazon Bedrock Melanie Li Artificial Intelligence

​[[{“value”:” This post is co-written with Gareth Jones from Anthropic. Anthropic’s Claude 4 Sonnet model has launched on Amazon Bedrock, marking a significant advancement in foundation model capabilities. Consequently, the deprecation timeline for Anthropic’s Claude 3.5 Sonnet (v1 and v2) was announced. This evolution creates… Read More »Migrate from Anthropic’s Claude 3.5 Sonnet to Claude 4 Sonnet on Amazon Bedrock Melanie Li Artificial Intelligence

BentoML Released llm-optimizer: An Open-Source AI Tool for Benchmarking and Optimizing LLM Inference Asif Razzaq Artificial Intelligence Category – MarkTechPost

​[[{“value”:” BentoML has recently released llm-optimizer, an open-source framework designed to streamline the benchmarking and performance tuning of self-hosted large language models (LLMs). The tool addresses a common challenge in LLM deployment: finding optimal configurations for latency, throughput, and cost without relying on manual trial-and-error.… Read More »BentoML Released llm-optimizer: An Open-Source AI Tool for Benchmarking and Optimizing LLM Inference Asif Razzaq Artificial Intelligence Category – MarkTechPost

Deepdub Introduces Lightning 2.5: A Real-Time AI Voice Model With 2.8x Throughput Gains for Scalable AI Agents and Enterprise AI Michal Sutter Artificial Intelligence Category – MarkTechPost

​[[{“value”:” Deepdub, an Israeli Voice AI startup, has introduced Lightning 2.5, a real-time foundational voice model designed to power scalable, production-grade voice applications. The new release delivers substantial improvements in performance and efficiency, positioning it for use in live interactive systems such as contact centers,… Read More »Deepdub Introduces Lightning 2.5: A Real-Time AI Voice Model With 2.8x Throughput Gains for Scalable AI Agents and Enterprise AI Michal Sutter Artificial Intelligence Category – MarkTechPost

TwinMind Introduces Ear-3 Model: A New Voice AI Model that Sets New Industry Records in Accuracy, Speaker Labeling, Languages and Price Michal Sutter Artificial Intelligence Category – MarkTechPost

​[[{“value”:” TwinMind, a California-based Voice AI startup, unveiled Ear-3 speech-recognition model, claiming state-of-the-art performance on several key metrics and expanded multilingual support. The release positions Ear-3 as a competitive offering against existing ASR (Automatic Speech Recognition) solutions from providers like Deepgram, AssemblyAI, Eleven Labs, Otter,… Read More »TwinMind Introduces Ear-3 Model: A New Voice AI Model that Sets New Industry Records in Accuracy, Speaker Labeling, Languages and Price Michal Sutter Artificial Intelligence Category – MarkTechPost

Enhance video understanding with Amazon Bedrock Data Automation and open-set object detection Dongsheng An Artificial Intelligence

​[[{“value”:” In real-world video and image analysis, businesses often face the challenge of detecting objects that weren’t part of a model’s original training set. This becomes especially difficult in dynamic environments where new, unknown, or user-defined objects frequently appear. For example, media publishers might want… Read More »Enhance video understanding with Amazon Bedrock Data Automation and open-set object detection Dongsheng An Artificial Intelligence