Skip to content

zetabyte

Generative AI in the Real World: Faye Zhang on Using AI to Improve Discovery Ben Lorica and Faye Zhang AI & ML – Radar

​[[{“value”:” In this episode, Ben Lorica and AI Engineer Faye Zhang talk about discoverability: how to use AI to build search and recommendation engines that actually find what you want. Listen in to learn how AI goes way beyond simple collaborative filtering—pulling in many different… Read More »Generative AI in the Real World: Faye Zhang on Using AI to Improve Discovery Ben Lorica and Faye Zhang AI & ML – Radar

H Company Releases Holo1.5: An Open-Weight Computer-Use VLMs Focused on GUI Localization and UI-VQA Asif Razzaq Artificial Intelligence Category – MarkTechPost

​[[{“value”:” H Company (A french AI startup) releases Holo1.5, a family of open foundation vision models purpose-built for computer-use (CU) agents that act on real user interfaces via screenshots and pointer/keyboard actions. The release includes 3B, 7B, and 72B checkpoints with a documented ~10% accuracy… Read More »H Company Releases Holo1.5: An Open-Weight Computer-Use VLMs Focused on GUI Localization and UI-VQA Asif Razzaq Artificial Intelligence Category – MarkTechPost

Alibaba Releases Tongyi DeepResearch: A 30B-Parameter Open-Source Agentic LLM Optimized for Long-Horizon Research Asif Razzaq Artificial Intelligence Category – MarkTechPost

​[[{“value”:” Table of contents What the benchmarks show ? Architecture and inference profile Training pipeline: synthetic data + on-policy RL Role in document and web research workflows Key features of Tongyi DeepResearch-30B-A3B Summary Alibaba’s Tongyi Lab has open-sourced Tongyi-DeepResearch-30B-A3B, an agent-specialized large language model built… Read More »Alibaba Releases Tongyi DeepResearch: A 30B-Parameter Open-Source Agentic LLM Optimized for Long-Horizon Research Asif Razzaq Artificial Intelligence Category – MarkTechPost

IBM AI Releases Granite-Docling-258M: An Open-Source, Enterprise-Ready Document AI Model Asif Razzaq Artificial Intelligence Category – MarkTechPost

​[[{“value”:” IBM has released Granite-Docling-258M, an open-source (Apache-2.0) vision-language model designed specifically for end-to-end document conversion. The model targets layout-faithful extraction—tables, code, equations, lists, captions, and reading order—emitting a structured, machine-readable representation rather than lossy Markdown. It is available on Hugging Face with a live… Read More »IBM AI Releases Granite-Docling-258M: An Open-Source, Enterprise-Ready Document AI Model Asif Razzaq Artificial Intelligence Category – MarkTechPost

Supercharge your organization’s productivity with the Amazon Q Business browser extension Abhinand Sukumar Artificial Intelligence

​[[{“value”:” Generative AI solutions like Amazon Q Business are transforming the way employees work. Organizations in every industry are embracing these tools to help their workforce extract valuable insights from increasingly fragmented data to accelerate decision-making processes. However, the adoption of generative AI tools hasn’t… Read More »Supercharge your organization’s productivity with the Amazon Q Business browser extension Abhinand Sukumar Artificial Intelligence

Build Agentic Workflows with OpenAI GPT OSS on Amazon SageMaker AI and Amazon Bedrock AgentCore Vivek Gangasani Artificial Intelligence

​[[{“value”:” OpenAI has released two open-weight models, gpt-oss-120b (117 billion parameters) and gpt-oss-20b (21 billion parameters), both built with a Mixture of Experts (MoE) design and a 128K context window. These models are the leading open source models, according to Artificial Analysis benchmarks, and excel… Read More »Build Agentic Workflows with OpenAI GPT OSS on Amazon SageMaker AI and Amazon Bedrock AgentCore Vivek Gangasani Artificial Intelligence

Meta AI Researchers Release MapAnything: An End-to-End Transformer Architecture that Directly Regresses Factored, Metric 3D Scene Geometry Michal Sutter Artificial Intelligence Category – MarkTechPost

​[[{“value”:” A team of researchers from Meta Reality Labs and Carnegie Mellon University has introduced MapAnything, an end-to-end transformer architecture that directly regresses factored metric 3D scene geometry from images and optional sensor inputs. Released under Apache 2.0 with full training and benchmarking code, MapAnything… Read More »Meta AI Researchers Release MapAnything: An End-to-End Transformer Architecture that Directly Regresses Factored, Metric 3D Scene Geometry Michal Sutter Artificial Intelligence Category – MarkTechPost

How to Build an Advanced End-to-End Voice AI Agent Using Hugging Face Pipelines? Asif Razzaq Artificial Intelligence Category – MarkTechPost

​[[{“value”:” In this tutorial, we build an advanced voice AI agent using Hugging Face’s freely available models, and we keep the entire pipeline simple enough to run smoothly on Google Colab. We combine Whisper for speech recognition, FLAN-T5 for natural language reasoning, and Bark for… Read More »How to Build an Advanced End-to-End Voice AI Agent Using Hugging Face Pipelines? Asif Razzaq Artificial Intelligence Category – MarkTechPost