Skip to content

How to Build a Multilingual OCR AI Agent in Python with EasyOCR and OpenCV Asif Razzaq Artificial Intelligence Category – MarkTechPost

​[[{“value”:” In this tutorial, we build an Advanced OCR AI Agent in Google Colab using EasyOCR, OpenCV, and Pillow, running fully offline with GPU acceleration. The agent includes a preprocessing pipeline with contrast enhancement (CLAHE), denoising, sharpening, and adaptive thresholding to improve recognition accuracy. Beyond… Read More »How to Build a Multilingual OCR AI Agent in Python with EasyOCR and OpenCV Asif Razzaq Artificial Intelligence Category – MarkTechPost

Automate advanced agentic RAG pipeline with Amazon SageMaker AI Sandeep Raveesh-Babu Artificial Intelligence

​[[{“value”:” Retrieval Augmented Generation (RAG) is a fundamental approach for building advanced generative AI applications that connect large language models (LLMs) to enterprise knowledge. However, crafting a reliable RAG pipeline is rarely a one-shot process. Teams often need to test dozens of configurations (varying chunking… Read More »Automate advanced agentic RAG pipeline with Amazon SageMaker AI Sandeep Raveesh-Babu Artificial Intelligence

Unlock model insights with log probability support for Amazon Bedrock Custom Model Import Manoj Selvakumar Artificial Intelligence

​[[{“value”:” You can use Amazon Bedrock Custom Model Import to seamlessly integrate your customized models—such as Llama, Mistral, and Qwen—that you have fine-tuned elsewhere into Amazon Bedrock. The experience is completely serverless, minimizing infrastructure management while providing your imported models with the same unified API… Read More »Unlock model insights with log probability support for Amazon Bedrock Custom Model Import Manoj Selvakumar Artificial Intelligence

Migrate from Anthropic’s Claude 3.5 Sonnet to Claude 4 Sonnet on Amazon Bedrock Melanie Li Artificial Intelligence

​[[{“value”:” This post is co-written with Gareth Jones from Anthropic. Anthropic’s Claude 4 Sonnet model has launched on Amazon Bedrock, marking a significant advancement in foundation model capabilities. Consequently, the deprecation timeline for Anthropic’s Claude 3.5 Sonnet (v1 and v2) was announced. This evolution creates… Read More »Migrate from Anthropic’s Claude 3.5 Sonnet to Claude 4 Sonnet on Amazon Bedrock Melanie Li Artificial Intelligence

BentoML Released llm-optimizer: An Open-Source AI Tool for Benchmarking and Optimizing LLM Inference Asif Razzaq Artificial Intelligence Category – MarkTechPost

​[[{“value”:” BentoML has recently released llm-optimizer, an open-source framework designed to streamline the benchmarking and performance tuning of self-hosted large language models (LLMs). The tool addresses a common challenge in LLM deployment: finding optimal configurations for latency, throughput, and cost without relying on manual trial-and-error.… Read More »BentoML Released llm-optimizer: An Open-Source AI Tool for Benchmarking and Optimizing LLM Inference Asif Razzaq Artificial Intelligence Category – MarkTechPost

Deepdub Introduces Lightning 2.5: A Real-Time AI Voice Model With 2.8x Throughput Gains for Scalable AI Agents and Enterprise AI Michal Sutter Artificial Intelligence Category – MarkTechPost

​[[{“value”:” Deepdub, an Israeli Voice AI startup, has introduced Lightning 2.5, a real-time foundational voice model designed to power scalable, production-grade voice applications. The new release delivers substantial improvements in performance and efficiency, positioning it for use in live interactive systems such as contact centers,… Read More »Deepdub Introduces Lightning 2.5: A Real-Time AI Voice Model With 2.8x Throughput Gains for Scalable AI Agents and Enterprise AI Michal Sutter Artificial Intelligence Category – MarkTechPost

TwinMind Introduces Ear-3 Model: A New Voice AI Model that Sets New Industry Records in Accuracy, Speaker Labeling, Languages and Price Michal Sutter Artificial Intelligence Category – MarkTechPost

​[[{“value”:” TwinMind, a California-based Voice AI startup, unveiled Ear-3 speech-recognition model, claiming state-of-the-art performance on several key metrics and expanded multilingual support. The release positions Ear-3 as a competitive offering against existing ASR (Automatic Speech Recognition) solutions from providers like Deepgram, AssemblyAI, Eleven Labs, Otter,… Read More »TwinMind Introduces Ear-3 Model: A New Voice AI Model that Sets New Industry Records in Accuracy, Speaker Labeling, Languages and Price Michal Sutter Artificial Intelligence Category – MarkTechPost

Enhance video understanding with Amazon Bedrock Data Automation and open-set object detection Dongsheng An Artificial Intelligence

​[[{“value”:” In real-world video and image analysis, businesses often face the challenge of detecting objects that weren’t part of a model’s original training set. This becomes especially difficult in dynamic environments where new, unknown, or user-defined objects frequently appear. For example, media publishers might want… Read More »Enhance video understanding with Amazon Bedrock Data Automation and open-set object detection Dongsheng An Artificial Intelligence

How Skello uses Amazon Bedrock to query data in a multi-tenant environment while keeping logical boundaries Nicolas de Place Artificial Intelligence

​[[{“value”:” This is a guest post co-written with Skello. Skello is a leading human resources (HR) software as a service (SaaS) solution focusing on employee scheduling and workforce management. Catering to diverse sectors such as hospitality, retail, healthcare, construction, and industry, Skello offers features including… Read More »How Skello uses Amazon Bedrock to query data in a multi-tenant environment while keeping logical boundaries Nicolas de Place Artificial Intelligence

Create a private workforce on Amazon SageMaker Ground Truth with the AWS CDK Giorgio Pessot Artificial Intelligence

​[[{“value”:” Private workforces for Amazon SageMaker Ground Truth and Amazon Augmented AI (Amazon A2I) help organizations build proprietary, high-quality datasets while keeping high standards of security and privacy. The AWS Management Console provides a fast and intuitive way to create a private workforce, but many… Read More »Create a private workforce on Amazon SageMaker Ground Truth with the AWS CDK Giorgio Pessot Artificial Intelligence