AI Blog

Latest insights, tutorials, and news about artificial intelligence. Stay updated with the rapidly evolving world of AI.

227 articles • Latest insights and tutorials

June 12, 2026by HowAIWorks Team

US Government Directs Anthropic to Suspend Access to Fable 5 and Mythos 5

Anthropic halts access to its latest models, Fable 5 and Mythos 5, following a US government directive citing national security concerns over potential jailbreak methods.

June 12, 2026by HowAIWorks Team

chinaai-infrastructure+3

China Invests $295B in Huawei-Powered AI Infrastructure

China announces a $295B plan to build a nationwide AI data center network, aiming to replace Nvidia and AMD with domestic Huawei chips.

June 12, 2026by HowAIWorks Team

ai-policyanthropic+3

Dario Amodei Proposes Exponential AI Policy

Anthropic CEO Dario Amodei outlines a 5-point policy framework to regulate AI development, addressing security, economy, and global leadership.

June 12, 2026by HowAIWorks Team

minimaxm3+5

MiniMax M3 Open-Sourced on Hugging Face

MiniMax has open-sourced its M3 model, a 428B MoE architecture optimized for long context and agentic scenarios, now available on Hugging Face.

June 9, 2026by HowAIWorks Team

anthropicopenai+2

OpenAI Hardware Engineer Moves to Anthropic

Anthropic has hired former OpenAI engineer Clive Chan to develop its own AI chips in order to reduce computing costs.

June 9, 2026by HowAIWorks Team

applesiri ai+3

Apple Unveils Siri AI Powered by Google Gemini

At WWDC 2026, Apple introduced a completely redesigned version of its voice assistant, Siri AI, featuring visual intelligence and on-device data processing.

June 9, 2026by HowAIWorks Team

AnthropicClaude Fable 5+4

Claude Fable 5: The Next Generation of Frontier Intelligence

Anthropic introduces Claude Fable 5, the state-of-the-art model for ambitious, long-running coding and knowledge work projects.

June 9, 2026by HowAIWorks Team

googleleap+4

LEAP: The System That Helped LLMs Solve All Problems of the Putnam 2025 Competition

Google has published a paper on LEAP, a system designed for automated theorem proving. It allows LLMs to construct formal proofs in the Lean language and has already solved 12 out of 12 Putnam 2025 problems.

June 9, 2026by HowAIWorks Team

tsmcintel+4

Google and Nvidia Shift Orders to Intel Amid TSMC Capacity Shortages

Due to a lack of TSMC production lines, Google and Nvidia are considering Intel as a backup chip manufacturer. Google has already ordered 3 million TPUs for 2028.

June 9, 2026by HowAIWorks Team

OpenAIChatGPT+4

OpenAI to Transform ChatGPT into a Superapp with Autonomous Agents

In the coming weeks, OpenAI will roll out the first major redesign of ChatGPT since 2022, transforming the service from a conversational chatbot into a platform for autonomous agents.

June 9, 2026by HowAIWorks Team

OpenAIIPO+5

OpenAI Takes Its First Official Step Towards an IPO

OpenAI has confidentially filed an S-1 form to go public, beginning the SEC review process.

June 9, 2026by HowAIWorks Team

ainews+3

Sakana AI to Focus on Algorithmic Evolution of AI

The Japanese startup has opened a Recursive Self-Improvement (RSI) research lab to create networks that optimize their own code.

June 9, 2026by Evgeny Ivanov

TencentHunyuan+3

Tencent Hunyuan Introduces UniRL: Universal RL Post-Training for Multimodal Models

Tencent Hunyuan releases UniRL, a unified infrastructure for reinforcement learning post-training across diverse model families including LLMs, VLMs, and diffusion models.

May 22, 2026by HowAIWorks Team

aideepseek+3

DeepSeek Slashes V4-Pro Prices by up to 90%

DeepSeek has announced a massive price reduction for its flagship DeepSeek-V4-Pro model, lowering token costs by 75% for inputs and 90% for outputs.

May 22, 2026by HowAIWorks Team

GoogleGemini+8

Google Antigravity Triples Gemini Request Limits

Google Antigravity team member Varun Mohan announces a permanent 3x increase in Gemini request limits for paid tiers and resets weekly user quotas.

May 22, 2026by HowAIWorks Team

Google SearchGemini+6

Google Tests Gemini-Powered Conversational Search Ads

Google is testing Gemini-powered conversational search ads, introducing interactive chatbots, dynamic product bundling, and direct native checkout.

May 22, 2026by HowAIWorks Team

OpenAISam Altman+7

Sam Altman Pushes for OpenAI IPO in September

OpenAI targets a September IPO as Sam Altman accelerates listing plans despite CFO cautions, following the dismissal of Elon Musk's lawsuit.

May 22, 2026by HowAIWorks Team

AI RegulationAI Safety+5

White House Proposes 90-Day Pre-Release AI Model Testing

The US administration proposes a voluntary 90-day review period for flagship AI models, prompted by cybersecurity concerns and Anthropic's Mythos model.

May 21, 2026by HowAIWorks Team

AlibabaQwen 3.7-Max+6

Qwen 3.7-Max: Alibaba's Long-Horizon Agent Engine

Alibaba launches Qwen 3.7-Max, a flagship AI model demonstrating 35-hour autonomous operation, 10x kernel speedup, and cross-agent generalization.

May 21, 2026by HowAIWorks Team

MITEmbedded Language Flows+6

Embedded Language Flows: MIT Revitalizes Text Diffusion

MIT researchers introduce Embedded Language Flows (ELF), a continuous-time flow matching framework that brings data-efficient diffusion models to text generation.

May 1, 2026by HowAIWorks Team

AlibabaQwen+6

Qwen-Scope: Alibaba's Open 'X-Ray' for Model Interpretability

Alibaba releases Qwen-Scope, a massive collection of Sparse Autoencoders (SAEs) that allows researchers to 'look inside' Qwen models and steer their behavior.

May 1, 2026by HowAIWorks Team

DeepSeekMultimodal AI+7

DeepSeek Teases Multimodal Capabilities: 'Now, We See You'

Xiaokang Chen of DeepSeek's multimodal team hints at upcoming vision features, signaling the lab's move toward integrated visual data understanding.

April 24, 2026by HowAIWorks Team

aideepseek+7

DeepSeek-V4: Pro and Flash Models with 1M Context

DeepSeek releases V4-Pro and V4-Flash models featuring 1.6T parameters, open-source weights, and a massive 1 million token context window.

April 24, 2026by HowAIWorks Team

GoogleDESIGN.md+5

Google DESIGN.md: Standard for AI-Native Design Systems

Google introduces DESIGN.md, an open-source standard that bridges the gap between brand guidelines and AI coding agents for consistent, high-quality UI generation.

April 24, 2026by HowAIWorks Team

OpenAIGPT-5.5+6

OpenAI Releases GPT-5.5: The Agentic Coding Revolution

OpenAI has announced GPT-5.5 (Spud), a massive new base model optimized for agentic coding, handling complex 20-hour tasks with ease. Here are the details.

April 24, 2026by HowAIWorks Team

Sony AIRobotics+6

Sony AI Ace: First Robot to Beat Pro Table Tennis Players

Sony AI introduces Ace, a groundbreaking table tennis robot featuring a 20.2ms reaction time and advanced reinforcement learning to defeat human experts.

April 24, 2026by HowAIWorks Team

xAIGrok+5

xAI Launches Grok Voice Think Fast 1.0

xAI announces Grok Voice Think Fast 1.0, a new flagship voice model built for complex workflows, real-time reasoning, and zero added latency.

April 23, 2026by HowAIWorks Team

claudeclaude-code+8

Claude Code Introduces /ultrareview: New Fleet of Bug-Hunters

Explore the new /ultrareview feature in Claude Code, a cloud-based multi-agent system designed to identify and verify deep-seated bugs before code merges.

April 23, 2026by HowAIWorks Team

Google CloudTPU+4

Google Unveils Eighth-Generation TPUs: TPU 8t and TPU 8i

Google introduces specialized TPU architectures for the agentic era, featuring TPU 8t for training and TPU 8i for inference and reasoning.

April 23, 2026by HowAIWorks Team

Kimi K2.6Dynamic GGUF+7

Kimi K2.6: Running 1 Trillion Parameters Locally

Unsloth releases Dynamic GGUF versions of Kimi K2.6, enabling the 1T parameter model to run on high-end local setups with speeds exceeding 40 tokens per second.

April 23, 2026by HowAIWorks Team

aiworld-models+8

Odyssey-2 Max: A New SOTA in Real-Time Physics World Models

Odyssey releases Odyssey-2 Max, an autoregressive world model achieving SOTA results in physics simulation and real-time user interactivity.

April 23, 2026by HowAIWorks Team

openharnessai-agents+7

OpenHarness: The Open-Source Infrastructure Layer for AI Agents

OpenHarness launches an open-source infrastructure layer for AI agents, providing unified tools for memory management, tool validation, and secure execution.

April 23, 2026by HowAIWorks Team

aixiaomi+7

Xiaomi MiMo-V2.5: The Next Generation of Open Agentic Models

Xiaomi announces the MiMo-V2.5 series, featuring flagship agentic performance and native omnimodality to rival frontier models like Claude 4.6 and GPT-5.4.

April 22, 2026by HowAIWorks Team

aiant-group+9

Ant Group's Ling-2.6-flash: Lean MoE for AI Agents

Ant Group releases Ling-2.6-flash, an efficient 104B MoE model optimized for 'intelligence per token,' agentic workflows, and fast long-context performance.

April 22, 2026by HowAIWorks Team

Google DeepMindDeep Research+5

Google DeepMind Unveils Deep Research and Deep Research Max

Google DeepMind introduces autonomous research agents powered by Gemini 3.1 Pro, featuring native MCP support for private data analysis.

April 22, 2026by HowAIWorks Team

Mac mini M4AI Agents+6

Mac Mini M4: The Ultimate Hub for Autonomous AI Agents

Why the Mac mini M4 has become the 2026 industry standard for 24/7 AI agent hosting and local LLM pipelines.

April 21, 2026by HowAIWorks Team

ClaudeAnthropic+6

Claude Masterclass: The Essential Guide to AI Workflows

Unlock the full potential of Anthropic's Claude with this curated masterlist of tools, guides, and strategic frameworks for 2026.

April 21, 2026by HowAIWorks Team

MITDeep Learning+6

MIT Deep Learning Fall 2024 Course Released for Free

MIT OpenCourseWare has released the full '6.7960 Deep Learning' course from Fall 2024, featuring Phillip Isola and comprehensive materials for self-study.

April 21, 2026by HowAIWorks Team

aikimi+7

Kimi K2.6 Release: Open Weights and 12-Hour Long-Horizon Coding

Moonshot AI releases Kimi K2.6, featuring open weights, impressive coding benchmarks, and support for agentic swarms with up to 300 sub-agents.

April 18, 2026by HowAIWorks Team

Claude CodeAnthropic+5

How a Google Engineer Automated 80% of Coding with Claude Code

Discover how a Google developer used Claude Code, a comprehensive CLAUDE.md file, and the Everything Claude Code OS to automate his routine tasks.

April 18, 2026by HowAIWorks Team

OpenAICodex+7

OpenAI Unveils Free Codex Skills: Transforming Assistants into Agents

OpenAI has just released a library of free, one-click skills for Codex, enabling robust automation for design, mobile development, and presentation workflows.

April 18, 2026by HowAIWorks Team

AI in MedicineOpenClaw+7

OpenClaw: 869 AI Skills for Medical Research

OpenClaw launches a medical skills library, providing open-source tools and datasets for AI agents to assist in clinical research and medical documentation.

April 3, 2026by HowAIWorks Team

AlibabaQwen 3.6-Plus+6

Alibaba Qwen 3.6-Plus: 1M Context Window and Agentic Coding

Alibaba officially unveils Qwen 3.6-Plus, a flagship model featuring a 1-million-token context window and optimized for repository-level agentic coding.

April 3, 2026by HowAIWorks Team

LLMKnowledge Base+6

Building Personal Knowledge Bases with LLMs: The Karpathy Method

Explore Andrej Karpathy's workflow for using LLMs to incrementally compile and manage a massive personal knowledge base in Obsidian.

April 3, 2026by HowAIWorks Team

CursorAI Agents+8

Introducing Cursor 3: A Unified Agentic Workspace

Introducing Cursor 3, a workspace designed from the ground up for AI agents, featuring multi-repo support and seamless local-cloud handoff.

April 3, 2026by HowAIWorks Team

Gemma 4Google+6

Google Gemma 4: The Next Frontier of Open Models for AI Agents

Google introduces Gemma 4, a new family of open models optimized for complex reasoning, autonomous agents, and tool use with up to 256K context window.

April 3, 2026by HowAIWorks Team

MWS AICotype Light 3+6

MWS AI Launches Cotype Light 3: 9B Multimodal

MWS AI launches Cotype-Light-3, an ultra-efficient 3B parameter model optimized for mobile and edge devices with near-lossless performance on core benchmarks.

April 2, 2026by HowAIWorks Team

Ant GroupLingBot-Depth+6

Ant Group Releases LingBot-Depth: A 2.7 TB RGB-D Dataset for Robotics

Ant Group has published LingBot-Depth on Hugging Face, a massive 2.7 TB dataset featuring over 3 million RGB-D examples for advancing spatial perception.

April 2, 2026by HowAIWorks Team

Claude CodeAnthropic+6

Mastering Claude Code: Systemic Patterns for Agentic Engineering

Anthropic releases an official repository of best practices for Claude Code, featuring advanced strategies for autonomous engineering and multi-file tasks.

April 2, 2026by HowAIWorks Team

Claude CodeAnthropic+7

Visual Masterclass: Mastering Claude Code and Agents

Master Anthropic's Claude Code with a comprehensive visual guide covering everything from basic setup to advanced multi-agent workflows and MCP integrations.

April 2, 2026by HowAIWorks Team

Claude CodeAnthropic+6

Claude Code: The Ultimate Resource Collection for Pro Developers

Master Anthropic's Claude Code with this curated selection of repositories, documentation, tutorials, and books.

April 2, 2026by HowAIWorks Team

ai-agentsmacos-apps+8

Collaborator: A Unified macOS Canvas for Agentic Development

Collaborator releases a dedicated macOS agent workspace, providing a seamless local environment for autonomous AI agents to interact with system tools.

April 2, 2026by HowAIWorks Team

aiglm-5v-turbo+7

GLM-5V-Turbo: The AI That Sees Your Screen and Writes the Code

GLM-5V-Turbo is a native multimodal model that transforms designs, screenshots, and UI layouts into runnable code with unprecedented accuracy.

April 2, 2026by HowAIWorks Team

aih-company+7

Holo3: H Company's SOTA Foundation Model for Desktop Agents

H Company unveils Holo3, a high-performance Mixture-of-Experts model family that sets a new industry standard for autonomous desktop application control.

April 2, 2026by HowAIWorks Team

Liquid AILFM2.5-350M+6

Liquid AI LFM2.5-350M: A Sub-500MB Agentic Powerhouse

Liquid AI releases LFM2.5-350M, a 350M parameter model trained on 28T tokens with RL, optimized for data extraction and agentic loops on edge devices.

April 2, 2026by HowAIWorks Team

Transformers.jsWebGPU+6

Transformers.js v4: Revolutionizing Web-Based AI

Hugging Face releases Transformers.js v4 with a new WebGPU runtime, drastic performance improvements, and support for massive models directly in the browser.

March 24, 2026by HowAIWorks Team

claudeclaude-code+8

Claude Code Auto-dream: New Agentic Memory Management

Claude Code introduces Auto-Dream memory, allowing autonomous agents to refine their understanding of codebases during idle time for better accuracy.

March 24, 2026by HowAIWorks Team

AIWorld Models+8

LeWorldModel: Yann LeCun's End-to-End JEPA Breakthrough

Yann LeCun introduces LeWorldModel (LeWM), the first end-to-end JEPA trained from raw pixels, solving the collapse problem in world models.

March 24, 2026by HowAIWorks Team

AI AgentsMeta+6

Mark Zuckerberg's AI Assistant for CEO Tasks

Mark Zuckerberg is building an AI agent to streamline his duties as Meta's CEO, enabling a flatter organization and reducing management layers.

March 24, 2026by HowAIWorks Team

NVIDIAAI+6

NVIDIA Kimodo: AI-Powered 3D Motion Generation

NVIDIA releases Kimodo, a diffusion-based generative model for realistic 3D motion, supporting diverse skeletons including SMPL-X and Unitree G1.

March 24, 2026by HowAIWorks Team

Terence TaoAI in Mathematics+4

Terence Tao: The Future of AI in Mathematics

Renowned mathematician Terence Tao discusses how AI is transforming mathematical research, idea generation, and the shift from creation to filtration.

March 20, 2026by HowAIWorks Team

CursorComposer 2+8

Introducing Composer 2: The New Frontier of AI Coding in Cursor

Cursor launches Composer 2, a next-gen model with record-breaking results on SWE-bench and Terminal-Bench, trained for long-horizon autonomous tasks.

March 20, 2026by HowAIWorks Team

ElevenLabsAI Music+6

ElevenLabs Music Marketplace: Monetize Your AI-Generated Tracks

ElevenLabs launches Music Marketplace in ElevenCreative, allowing users to publish, license, and earn from their AI-generated music tracks for commercial use.

March 20, 2026by HowAIWorks Team

Microsoft AIMAI-Image-2+8

Microsoft MAI-Image-2: A New Frontier for Photorealistic AI Imagery

Microsoft unveils MAI-Image-2: a creator-focused successor with enhanced photorealism, reliable in-image text, and hyper-detailed scene generation.

March 20, 2026by HowAIWorks Team

aixiaomi+8

Xiaomi MiMo-V2: Three New State-of-the-Art AI Models

Xiaomi releases MiMo-V2, a groundbreaking AI trio featuring a 1T-parameter Pro model, an omnimodal agent, and a next-gen TTS system.

March 19, 2026by HowAIWorks Team

Google LabsStitch+5

Google Labs Stitch: Future of AI-Native UI Design

Google Labs unveils Stitch AI, a generative design tool that autonomously creates and iterates on UI components and design systems using simple prompts.

March 19, 2026by HowAIWorks Team

aiminimax+10

MiniMax M2.7: Early Echoes of AI Self-Evolution

MiniMax unveils M2.7, a breakthrough model that participates in its own evolution through autonomous agent harnesses and advanced software engineering.

March 16, 2026by HowAIWorks Team

Markov AIComputer Use+6

Computer Use Large: The Largest Open-Source Dataset for AI Agents

Markov AI releases 'computer-use-large', a massive dataset of 48,000+ screen recordings for training AI agents to use professional software.

March 9, 2026by HowAIWorks Team

AlibabaGUI Agents+6

Alibaba GUI-Owl-1.5 & Mobile-Agent-v3.5: The Next Era of GUI Agents

Alibaba releases GUI-Owl 1.5, a multi-modal mobile agent that can autonomously navigate and interact with complex smartphone applications through vision.

March 9, 2026by HowAIWorks Team

AlibabaQwen+4

Alibaba Qwen Team Faces Key Departures and Restructuring

Alibaba's Qwen team undergoes restructuring following key departures, aiming to accelerate the development of next-generation large language models.

March 9, 2026by HowAIWorks Team

GPT-5.4OpenAI+4

Introducing OpenAI GPT-5.4: New Frontier in AI Workflows

OpenAI launches GPT-5.4 with 1M token context, native computer interaction, and 33% fewer errors. Discover how it redefines professional AI workflows.

March 9, 2026by HowAIWorks Team

TencentHY-WU+7

Tencent HY-WU: Dynamic LoRA for Precise Image Editing

Tencent introduces HY-WU, a Weight Unleashing framework that generates dynamic LoRA adapters to solve gradient conflicts in multi-task image editing.

March 3, 2026by HowAIWorks Team

AlibabaQwen+6

Qwen 3.5: Scaling Intelligence in Compact Models

Alibaba's new Qwen 3.5 series packs flagship intelligence into compact sizes (0.8B to 9B), featuring native multimodality and enhanced agentic capabilities.

March 3, 2026by HowAIWorks Team

ChatGPT-5.4OpenAI+6

ChatGPT-5.4 Leaks: 2M Context, Full-Res Vision, and Agentic Power

Recent leaks regarding ChatGPT 5.4 reveal major upgrades in agentic orchestration, native 3D world understanding, and significant efficiency improvements.

March 3, 2026by HowAIWorks Team

Google GeminiAI Automation+5

Automate Your Workflows with Gemini Scheduled Actions

Learn how to use Gemini's new recurring actions to schedule daily summaries, weekly updates, and automated reports directly in the Gemini App.

March 3, 2026by HowAIWorks Team

Liquid AILFM 2.5+8

Liquid AI LFM2.5-1.2B-Thinking: On-Device Reasoning Under 1GB

Liquid AI LFM2.5-1.2B-Thinking is a breakthrough 1.2B reasoning model that fits in 900MB, delivering high-performance logic on phones and laptops.

February 25, 2026by HowAIWorks Team

AlibabaQwen+5

Alibaba Qwen 3.5: 1M Token Context and Efficiency

Alibaba announces Qwen 3.5 Medium, an 8B parameter model featuring enhanced reasoning and a 256k context window for professional coding and research tasks.

February 25, 2026by HowAIWorks Team

ByteDanceSeedance+5

Seedance 2.0: Breakthroughs and Copyright Launch Delay

ByteDance unveils Seedance 2.0, a powerful AI video model, but postpones its global launch amid significant copyright infringement allegations.

February 23, 2026by HowAIWorks Team

Claude CodeAI Engineering+6

Garry Tan's Claude Code Senior Engineer Prompt

Discover how Y Combinator CEO Garry Tan uses Claude Code as a senior engineer to ship complex features with fully tested code in under an hour.

February 23, 2026by HowAIWorks Team

Gemini 3.1 ProAI Models+5

Gemini 3.1 Pro: A Smarter Model for Complex Tasks

Google announces Gemini 3.1 Pro, featuring a 77.1% ARC-AGI-2 score and advanced reasoning for agentic workflows and complex system synthesis.

February 23, 2026by HowAIWorks Team

PaddleOCRBaidu+9

PaddleOCR-VL-1.5: SOTA Multimodal Document Parsing

Baidu announces PaddleOCR-VL-1.5, a 0.9B VLM achieving 94.5% on OmniDocBench v1.5 with breakthrough robustness in real-world scenarios.

February 19, 2026by HowAIWorks Team

Google GeminiLyria 3+4

Gemini App Lyria 3: Create Music from Text and Images

Create custom 30-second music tracks in Gemini using Google's Lyria 3 model. Generate songs from text or images with integrated SynthID watermarking.

February 19, 2026by HowAIWorks Team

AILLM+6

UltraData-Math: Scaling High-Quality Mathematical Reasoning

OpenBMB releases UltraData-Math, a 290B+ token dataset with a unique tiered grading system to boost LLM performance in complex mathematical tasks.

February 16, 2026by HowAIWorks Team

ClaudeAnthropic+11

Claude Cowork: AI Agent for macOS and Windows

Anthropic's Cowork research preview brings Claude Code's agentic capabilities to everyone, now available on Windows with new global instructions.

February 16, 2026by HowAIWorks Team

aiglm+8

GLM-5: Beyond Vibe Coding to Agentic Engineering

Zhipu AI unveils GLM-5, a state-of-the-art model designed for complex multi-file software engineering and long-horizon autonomous tasks.

February 16, 2026by HowAIWorks Team

NanbeigeLLM+5

Nanbeige4.1-3B: Compact Powerhouse with Strong Reasoning

Nanbeige releases a 4.1.3B parameter model that sets new open-weights benchmarks for efficiency and reasoning on small-scale hardware and mobile devices.

February 16, 2026by HowAIWorks Team

WaymoWorld Model+8

Waymo World Model: Generative AI for Safer Autonomous Driving

Waymo introduces the Waymo World Model, a breakthrough generative AI built on Genie 3 that simulates hyper-realistic driving scenarios for safer navigation.

February 16, 2026by HowAIWorks Team

XiaomiRobotics+7

Xiaomi-Robotics-0: Scaling VLA Models for Real-Time Robot Control

Xiaomi announces Robotics 0, a new division focused on building a universal robot operating system and affordable humanoid robots for home and industrial use.

February 3, 2026by HowAIWorks Team

NVIDIAPersonaPlex+8

NVIDIA PersonaPlex: Controlled Full-Duplex Speech AI

Discover PersonaPlex, NVIDIA's breakthrough in full-duplex speech AI that allows precise control over voice and persona for natural, low-latency interactions.

February 2, 2026by HowAIWorks Team

LingBot-DepthSpatial Perception+5

LingBot-Depth: Precision Spatial Perception for Embodied AI

LingBot-Depth is a high-precision spatial perception model from Robbyant that delivers metrically accurate 3D measurements for robots and autonomous systems.

January 30, 2026by HowAIWorks Team

Qwen3-ASRSpeech Recognition+6

Qwen3-ASR: SOTA Multilingual Speech Recognition and Forced Alignment

Alibaba's Qwen team releases Qwen3-ASR and Qwen3-ForcedAligner, setting new benchmarks in multilingual speech-to-text and precise timestamping.

January 30, 2026by HowAIWorks Team

LLMInference Optimization+7

Tencent HPC-Ops: SOTA Performance for LLM Inference

Tencent releases HPC-Ops, a production-grade high-performance operator library for LLM inference, delivering up to 2.22x speedup on NVIDIA H20 GPUs.

January 30, 2026by HowAIWorks Team

Computer VisionMultimodal+6

Youtu-VL: Unified Vision-Language Supervision

Tencent Youtu Lab introduces Youtu-VL, a 4B parameter model that pioneers the 'vision-as-target' paradigm for advanced visual perception.

January 29, 2026by HowAIWorks Team

Google DeepMindMultilingual AI+5

ATLAS: New Scaling Laws for Multilingual AI Models

Google DeepMind's Atlas research explores multilingual scaling laws, providing a framework for training highly efficient models across hundreds of languages.

January 29, 2026by HowAIWorks Team

HunyuanImage-3.0Tencent+7

HunyuanImage-3.0: Tencent's Massive 80B MoE Multimodal Model

Tencent releases HunyuanImage-3.0, the largest open-source MoE image generation model with a unified multimodal architecture and 80 billion parameters.

January 29, 2026by HowAIWorks Team

TencentHunyuanImage 3.0+6

HunyuanImage 3.0-Instruct: Tencent's Massive Native Multimodal Leap

Tencent releases HunyuanImage 3.0-Instruct, the world's largest open-source image MoE model with 80B parameters, unifying understanding and generation.

January 29, 2026by HowAIWorks Team

NVIDIAEarth-2+6

NVIDIA Earth-2: World's First Open AI Weather Models

NVIDIA launches Earth-2, a family of open, accelerated models for global weather forecasting, nowcasting, and data assimilation.

January 27, 2026by HowAIWorks Team

AI AgentsAgentic Commerce+10

China's Tech Giants Race for Agentic Commerce

Chinese tech giants like Alibaba, Tencent, and ByteDance are racing to build AI-powered agentic commerce super apps.

January 27, 2026by HowAIWorks Team

QwenLLM+5

Qwen3-Max-Thinking: A New Era for Reasoning Models

Alibaba Cloud introduces Qwen3-Max-Thinking, a flagship reasoning model with adaptive tool-use and test-time scaling, rivaling GPT-5.2 and Claude Opus 4.5.

January 26, 2026by HowAIWorks Team

AdobeAcrobat+4

Adobe Acrobat AI: Presentations, Podcasts & Chat

Adobe announced a major update to Acrobat: generate presentations and podcasts from PDFs, edit documents via AI chat, and collaborate in PDF Spaces.

January 26, 2026by HowAIWorks Team

AI AgentsClawdBot+6

ClawdBot: The Open Source Personal AI Assistant

ClawdBot is an open-source, self-hosted AI agent that connects to your favorite chat apps and executes real-world tasks through your own infrastructure.

January 26, 2026by HowAIWorks Team

github-copilotsdk+6

GitHub Copilot SDK: Build AI Agents Anywhere

GitHub launches the Copilot SDK for building agentic applications, enabling developers to integrate Copilot's reasoning directly into their own software.

January 26, 2026by HowAIWorks Team

Mafin 2.5PageIndex+7

Mafin 2.5: Reasoning RAG Hits 98.7% Accuracy

Discover how Mafin 2.5 and the PageIndex framework are revolutionizing financial document analysis by replacing vector similarity with structured reasoning.

January 26, 2026by HowAIWorks Team

MicrosoftClaude Code+5

Microsoft Adopts Claude Code from Anthropic

Microsoft is deploying Anthropic's Claude Code for its internal teams, favoring it over GitHub Copilot in key development scenarios.

January 26, 2026by HowAIWorks Team

AIQwen+7

Real-Qwen-Image-V2: New Era of AI Realism

Reviewing Real-Qwen-Image-V2 — a fine-tuned version of Qwen-Image-2512 focused on photorealism, sharpness, and optimized facial aesthetics.

January 26, 2026by HowAIWorks Team

Stepfun AIStep3-VL-10B+5

Step3-VL-10B: Redefining Multimodal AI

Stepfun AI releases Step3-VL-10B, a 10B parameter multimodal model that outperforms giants 20x its size through innovative Parallel Coordinated Reasoning.

January 26, 2026by HowAIWorks Team

AIASR+6

VibeVoice-ASR: Long-Form Speech Breakthrough

Discover VibeVoice-ASR, Microsoft's new model capable of 60-minute single-pass speech-to-text with speaker diarization and custom hotwords.

January 23, 2026by HowAIWorks Team

aianthropic+10

Assistant Axis: Controlling LLM Character

Anthropic research reveals how LLMs drift between personas. The Assistant Axis stabilizes model behavior and prevents harmful outputs via activation capping.

January 23, 2026by HowAIWorks Team

QwenAlibaba+10

Qwen3-TTS Open Sourced: Voice Design and Clone

Alibaba open-sources Qwen3-TTS family with voice design, cloning, and ultra-high-quality speech generation across 10 languages.

January 21, 2026by HowAIWorks Team

GLM-4.7-FlashLLM+5

GLM-4.7-Flash: King of MoE Models in 30B Class

Z.ai releases GLM-4.7-Flash, a 30B MoE model with exceptional reasoning, coding, and agentic capabilities, rivaling much larger models.

January 21, 2026by HowAIWorks Team

Liquid AILFM 2.5+6

Liquid AI LFM2.5-1.2B-Thinking: Compact Power

Exploring Liquid AI's newest 1.2B reasoning model optimized for agentic tasks, RAG, and high-speed edge inference with LIV convolution architecture.

January 21, 2026by HowAIWorks Team

ManusApp Development+6

Manus App Sharing: Streamlining Mobile Dev

Manus simplifies the path from app development to real-world testing with automated AAB packaging for Android and direct TestFlight integration for iOS.

January 21, 2026by HowAIWorks Team

NVIDIABlackwell+5

NVIDIA Blackwell: Performance Leaps for MoE

Discover how NVIDIA Blackwell and TensorRT-LLM deliver up to 2.8x throughput increases for Mixture of Experts (MoE) models like DeepSeek-R1.

January 12, 2026by HowAIWorks Team

ClaudeAnthropic+9

Claude Cowork: AI Assistant for Files & Folders

Anthropic launches Cowork, a research preview that gives Claude access to your computer files, enabling autonomous task completion beyond coding.

January 9, 2026by HowAIWorks Team

nvidiatest-time-training+9

NVIDIA TTT-E2E: Test-Time Training Long Context

NVIDIA researchers introduce TTT-E2E, a test-time training approach that enables models to handle ultra-long contexts efficiently through dynamic adaptation.

December 20, 2025by HowAIWorks Team

googlegemini+8

Gemini 3 Flash: Frontier Intelligence for Speed

Google releases Gemini 3 Flash, a fast AI model with Pro-grade reasoning at Flash-level speed. Achieves 90.4% on GPQA Diamond and 3x faster than 2.5 Pro.

December 20, 2025by HowAIWorks Team

metaai-models+10

Meta Develops Mango and Avocado AI Models

Meta announces Mango image/video AI model and Avocado LLM, targeting first-half 2026 release. Led by Scale AI founder Alexandr Wang with $14B investment.

December 20, 2025by HowAIWorks Team

googlegemma+9

T5Gemma 2: Next-Gen Encoder-Decoder Models

Google DeepMind releases T5Gemma-2, a hybrid model combining the strengths of T5 and Gemma for industry-leading performance on translation and reasoning tasks.

December 18, 2025by HowAIWorks Team

openaigpt-5+10

OpenAI Accelerates Biological Research with GPT-5

OpenAI and Red Queen Bio use GPT-5 to optimize molecular cloning protocols, achieving 79-fold efficiency increase in wet lab research.

December 15, 2025by HowAIWorks Team

AI ToolsClaude+3

Skill Seeker: Docs to Claude AI Skills

Skill Seeker is a powerful open-source tool that converts documentation, GitHub repositories, and PDFs into optimized skills for Claude AI.

December 15, 2025by HowAIWorks Team

AI NewsZoom+3

Zoom AI Hits 48.1% on Humanity's Last Exam

Zoom's federated AI approach achieves 48.1% on the rigorous Humanity's Last Exam benchmark, surpassing Google Gemini 3 Pro.

December 6, 2025by HowAIWorks Team

TencentHY 2.0+9

Tencent HY 2.0: MoE Model with 73.4 IMO Score

Tencent releases HY 2.0 foundation model with MoE architecture, 256K context, and major gains in reasoning, coding, and instruction following capabilities.

December 5, 2025by HowAIWorks Team

aianthropic+10

How AI Transforms Work: Anthropic Research 2025

Anthropic research: AI boosts productivity 50%, enables new work types. Insights from 132 engineers on how Claude transforms software engineering work.

December 5, 2025by HowAIWorks Team

aianthropic+10

Anthropic Interviewer: Tool for AI Impact

Anthropic releases a new AI interviewer research tool designed to conduct structured interviews and synthesize insights using advanced language understanding.

December 5, 2025by HowAIWorks Team

LidarAutonomous Vehicles+10

Guide to Automotive Lidar Technology

Guide to automotive LiDAR in 2025, covering advancements in solid-state sensors, long-range detection, and AI-driven point cloud processing for safer driving.

December 5, 2025by HowAIWorks Team

aigoogle-workspace+10

Google Workspace Studio: AI Agents for Work

Google launches Workspace Studio, enabling anyone to create AI agents that automate everyday work tasks using Gemini 3, with no coding required.

December 5, 2025by HowAIWorks Team

Kling AIVideo Generation+8

Kling AI Video 2.6: Native Audio Generation

Kling AI Video 2.6 introduces native audio generation, enabling users to create cinematic videos with synchronized sound effects and background music.

December 5, 2025by HowAIWorks Team

PerplexityBrowseSafe+10

Perplexity BrowseSafe: Safer AI Browsers

Perplexity introduces BrowseSafe, an open detection model and benchmark for protecting AI agents from prompt injection attacks in browser environments.

December 5, 2025by HowAIWorks Team

ai hardwaregpu+8

TPUs vs GPUs vs ASICs: AI Hardware Guide 2025

Complete guide to TPUs, GPUs, and ASICs for AI workloads. Compare architectures, performance, efficiency, and market trends as of December 2025.

December 5, 2025by HowAIWorks Team

TransformersHugging Face+9

Transformers v5: PyTorch-First Library Update

Hugging Face releases Transformers v5 with PyTorch-only backend, quantization as first-class feature, and enhanced interoperability across the AI ecosystem.

December 5, 2025by HowAIWorks Team

UMARobotics+9

UMA Launches Physical AI Robotics from Europe

UMA, founded by Tesla and Google DeepMind veterans, launches to build humanoid robots for real-world deployment in warehouses, hospitals, and factories.

December 3, 2025by HowAIWorks Team

ByteDanceDoubao+10

Doubao-Seed-Code: ByteDance's New Coding Model

ByteDance's Volcano Engine launches Doubao-Seed-Code, achieving state-of-the-art on SWE-Bench-Verified with 62.7% lower costs and 256k context.

December 3, 2025by HowAIWorks Team

aigoogle-cloud+10

Google Cloud Launches Advent of Agents 2025

Google Cloud launches Advent of Agents 2025: a 25-day program to build production-ready AI agents using ADK, Agent Engine, and Gemini 3 models.

December 3, 2025by HowAIWorks Team

Mistral AIMistral 3+9

Mistral 3: Next Generation Open Multimodal AI

Mistral AI announces Mistral 3 with Large 3 and Ministral 3 series, featuring state-of-the-art performance, multimodal capabilities, and Apache 2.0 licensing.

December 3, 2025by HowAIWorks Team

aiaudio+10

Step Audio R1: First Audio Reasoning Model

Step Audio R1 is the first audio model to unlock Chain-of-Thought reasoning, solving inverted scaling and surpassing Gemini 2.5 Pro in complex audio tasks.

December 2, 2025by HowAIWorks Team

AmazonAWS+10

Amazon Nova 2: Four New Models Plus Forge and Act

Amazon announces Nova 2 model family with Lite, Pro, Sonic, and Omni, plus Nova Forge for custom models and Nova Act for reliable AI agents.

December 2, 2025by HowAIWorks Team

ChinaNVIDIA+10

China Claims 14nm Chips Rival NVIDIA's 4nm

China's new 14nm processor with 18nm DRAM achieves 120 TFLOPS, outperforming NVIDIA A100 GPUs and addressing memory bandwidth challenges.

December 2, 2025by HowAIWorks Team

Kling AIVideo Generation+8

Kling AI Launches O1 Multimodal Video Generator

Kling AI launches O1 multimodal model for video and image generation, enabling integrated content creation with advanced AI capabilities.

December 1, 2025by HowAIWorks Team

aideepseek+10

DeepSeek-V3.2: GPT-5 Level Reasoning & Agent AI

DeepSeek releases V3.2 and V3.2-Speciale models with GPT-5 level performance, gold-medal reasoning capabilities, and thinking in tool-use for agents.

December 1, 2025by HowAIWorks Team

aigoogle+10

Google TPUv7 Ironwood: Challenging Nvidia

Google's TPUv7 Ironwood commercializes AI chips externally. Anthropic's 1M TPU order signals potential end to Nvidia's CUDA dominance.

November 29, 2025by HowAIWorks Team

NVIDIAJensen Huang+8

NVIDIA CEO: Automate Every Task with AI

Jensen Huang tells NVIDIA employees to automate every possible task with AI, addressing concerns about job security as company grows to 36,000 workers.

November 28, 2025by HowAIWorks Team

ICLRPeer Review+9

ICLR 2026: 21% of Peer Reviews Are AI-Generated

ICLR 2026 faces controversy as 21% of peer reviews were fully AI-generated, with over half showing AI use, raising academic integrity concerns.

November 28, 2025by HowAIWorks Team

StanfordAI Agents+9

Stanford Launches AI Agentic Paper Reviewer

Stanford ML Group releases PaperReview.ai, an agentic system that provides rapid research paper feedback grounded in latest arXiv publications.

November 28, 2025by HowAIWorks Team

Machine LearningMathematics+8

Complete Roadmap of Math for Machine Learning

A comprehensive guide to the three pillars of ML mathematics: linear algebra, calculus, and probability theory.

November 26, 2025by HowAIWorks Team

NeurIPSMachine Learning+8

NeurIPS 2025 Best Paper Awards: 7 Papers

NeurIPS 2025 announces best paper awards, highlighting breakthroughs in Sparse Attention, agentic reasoning, and energy-efficient neural network architectures.

November 26, 2025by HowAIWorks Team

airobotics+10

UBTech Walker S2: $37M Border Patrol Robot Deal

UBTech Walker S2 humanoid robots are deployed for border patrol, demonstrating the feasibility of autonomous robotic systems in complex outdoor environments.

November 25, 2025by HowAIWorks Team

ClaudeAnthropic+8

Claude Opus 4.5: Best AI for Coding & Agents

Anthropic releases Claude Opus 4.5 with state-of-the-art coding performance, improved efficiency, and new effort parameter for developers.

November 21, 2025by HowAIWorks Team

GoogleGoogle Workspace+10

Google Workspace Oct: Veo 3.1, Security Updates

Google Workspace October 2025 updates: Veo 3.1 in Vids, Gemini in Sheets, ransomware protection, and expanded AI features across Workspace apps.

November 21, 2025by HowAIWorks Team

aiopenai+10

OpenAI GPT-5: Accelerating Scientific Research

OpenAI reveals GPT-5 early experiments showing breakthrough capabilities in mathematics, physics, biology, and computer science research acceleration.

November 21, 2025by HowAIWorks Team

TencentVideo Generation+8

Tencent HunyuanVideo-1.5: Efficient Video Gen

Tencent releases HunyuanVideo-1.5, a compact 8.3B-parameter video generation model with SSTA attention and 1080p super-resolution support.

November 19, 2025by HowAIWorks Team

GoogleSundar Pichai+9

Google CEO Warns of AI Investment Irrationality

Sundar Pichai warns of irrationality in AI investment cycles, comparing current boom to dotcom era while acknowledging no company is immune if bubble bursts.

November 18, 2025by HowAIWorks Team

AlibabaQwen+8

Alibaba Launches Qwen App to Challenge ChatGPT

Alibaba launches Qwen app powered by Qwen3, offering free access and competing directly with ChatGPT in the consumer AI market.

November 18, 2025by HowAIWorks Team

Jeff BezosProject Prometheus+8

Bezos Project Prometheus: AI Initiative Revealed

Jeff Bezos launches Project Prometheus, a major AI initiative focused on developing advanced artificial intelligence technologies for global challenges.

November 18, 2025by HowAIWorks Team

GoogleGemini+10

Google Launches Gemini 3: Most Intelligent AI

Google introduces Gemini 3, its most intelligent AI model with enhanced reasoning, multimodality, and coding capabilities, plus new Google Antigravity platform.

November 18, 2025by HowAIWorks Team

aigrok+7

Grok 4.1: xAI's Breakthrough in Emotional AI

xAI releases Grok 4.1 with #1 LMArena ranking, 64.78% user preference, and enhanced creativity, emotional intelligence, and collaboration capabilities.

November 17, 2025by HowAIWorks Team

ChinaAnalog Computing+10

China's Analog Chip 1,000x Faster Than GPUs

Chinese researchers announce a breakthrough in analog computing using RRAM, enabling ultra-low power AI inference for edge devices and wearable technology.

November 17, 2025by HowAIWorks Team

GoogleGemini+8

Gemini 3 Rumors: Google's Next AI Model

Early rumors about Google's Gemini 3 suggest a focus on real-time world simulation, integrated agentic reasoning, and a 100M+ token context window.

November 17, 2025by HowAIWorks Team

GoogleCode Wiki+10

Google's Code Wiki: Accelerating Understanding

Google launches Code Wiki, an AI-powered platform that automatically generates and maintains structured documentation for code repositories using Gemini.

November 17, 2025by HowAIWorks Team

linkedingenerative-ai+8

LinkedIn AI Cookbook: Scaling People Search

LinkedIn reveals how it scaled generative AI-powered people search to 1.3 billion users using model distillation and collaborative design techniques.

November 17, 2025by HowAIWorks Team

aiopenai+9

OpenAI: Interpretability via Sparse Circuits

OpenAI researchers publish findings on Sparse Circuits, revealing how neural networks process information through dedicated, interpretable internal pathways.

November 17, 2025by HowAIWorks Team

Qwen CodeAI CLI Tools+10

Qwen Code v0.2.2: Stability Improvements

Alibaba's Qwen Code CLI tool releases v0.2.2 with performance enhancements and bug fixes, continuing rapid development of this AI-powered developer tool.

November 14, 2025by HowAIWorks Team

Y CombinatorAI Coding+9

Chad IDE: Y Combinator's Brainrot Coding Tool

Y Combinator-backed Chad IDE lets developers gamble, watch TikToks, and play games while AI coding assistants work, sparking debate about productivity.

November 14, 2025by HowAIWorks Team

UBTechHumanoid Robots+8

UBTech's Walker S2: $112M in Factory Orders

UBTech Robotics secures $112 million in orders for Walker S2 humanoid robots as Chinese factories adopt human-shaped automation for industrial tasks.

November 11, 2025by HowAIWorks Team

KaggleGoogle+9

Kaggle & Google Release Free AI Agents Guide

Kaggle and Google publish free 42-page whitepaper on AI agents covering architectures, training methods, and LangChain/LangGraph frameworks.

November 11, 2025by HowAIWorks Team

LongCatOmni-Modal AI+9

LongCat-Flash-Omni: 560B Omni-Modal Model

LongCat releases Flash-Omni, a multi-modal reasoning model that achieves industry-leading performance on real-time vision and audio understanding benchmarks.

November 11, 2025by HowAIWorks Team

Sakana AISudoku-Bench+8

Sakana AI Launches Sudoku-Bench for AI Reasoning

Sakana AI introduces Sudoku-Bench, a creative reasoning benchmark testing human-like problem-solving through Sudoku variants without tool use.

November 10, 2025by HowAIWorks Team

Spatial IntelligenceFei-Fei Li+8

Spatial Intelligence: AI's Next Frontier

Fei-Fei Li explains why spatial intelligence is AI's next breakthrough, enabling machines to understand and interact with the physical world.

November 10, 2025by HowAIWorks Team

MetaASR+7

Meta Omnilingual ASR: 1,600+ Languages Support

Meta introduces Omnilingual ASR supporting 1,600+ languages including 500 low-resource languages, with in-context learning for new languages.

November 6, 2025by HowAIWorks Team

GoogleIronwood TPU+8

Google Ironwood TPU and Axion VMs: AI Inference

Google announces Ironwood TPUs and Axion VMs, providing high-performance, energy-efficient infrastructure for training and deploying frontier AI models.

November 5, 2025by HowAIWorks Team

QwenAlibaba+7

Qwen3-Max-Thinking: Perfect Reasoning Scores

Alibaba's Qwen3-Max-Thinking achieves 100% on AIME 2025 and HMMT, matching OpenAI's top model on reasoning benchmarks while emphasizing step-by-step solutions.

November 4, 2025by HowAIWorks Team

GoogleProject Suncatcher+7

Google Suncatcher: Space AI Infrastructure

Google unveils Project Suncatcher, an ambitious initiative to build space-based AI infrastructure for global low-latency intelligence and sustainable compute.

October 30, 2025by HowAIWorks Team

LangChainDeepAgents+3

LangChain Doubles Down on DeepAgents v0.2

LangChain ships DeepAgents v0.2: plugin backends, offloading big tool outputs, conversation summarization, and safer recovery from interrupted tool calls.

October 29, 2025by HowAIWorks Team

CursorComposer+6

Cursor 2.0: Composer, Multi-Agent UI, Terminal

Cursor 2.0 launches Composer (fast agentic coding), parallel multi-agent workflows, GA browser testing, improved reviews, and sandboxed terminals.

October 26, 2025by HowAIWorks Team

AI TradingQwen+10

Qwen3 Max Leads NOF1 AI Arena: 79% Return

The latest NoF1 AI Arena leaderboard reveals Qwen3-Max as the top-performing model in reasoning and coding, surpassing global competitors in head-to-head tests.

October 26, 2025by HowAIWorks Team

MetaPyTorch+9

OpenEnv: Standard Agent Training Environments

OpenEnv launches a standard for agentic execution environments, enabling AI agents to operate securely across cloud and local platforms with unified protocols.

October 24, 2025by HowAIWorks Team

GoogleEarth AI+11

Google Earth AI: Geospatial Reasoning Updates

Google announces major updates to Earth AI, including Geospatial Reasoning powered by Gemini and expanded access to advanced geospatial data analysis.

October 24, 2025by HowAIWorks Team

IBMToucan+18

IBM Releases Toucan: Largest Tool-Calling Dataset

IBM and University of Washington release Toucan, a groundbreaking dataset of 1.5 million real-world tool-calling scenarios designed to train better AI agents.

October 23, 2025by HowAIWorks Team

DeepSeekOCR+12

DeepSeek: Revolutionary OCR Context Compression

DeepSeek researchers introduce a novel OCR context compression technique that reduces token usage by 80% while maintaining accuracy on complex document tasks.

October 23, 2025by HowAIWorks Team

GoogleGoogle Skills+12

Google Launches Google Skills: 3,000 AI Courses

Google launches a new AI skills platform featuring certified courses and hands-on labs to help professionals master generative AI and machine learning.

October 23, 2025by HowAIWorks Team

OpenAIChatGPT+8

OpenAI Company Knowledge: Business Data Insights

OpenAI launches Company Knowledge for ChatGPT Enterprise, enabling seamless integration with workplace tools like Slack and SharePoint for team productivity.

October 23, 2025by HowAIWorks Team

aipytorch+6

PyTorch Monarch: Distributed Programming

PyTorch announces Project Monarch, a new compiler backend that significantly improves training efficiency for Mixture-of-Experts (MoE) models on NVIDIA GPUs.

October 22, 2025by HowAIWorks Team

AnthropicGoogle+13

Anthropic in Talks with Google for Cloud Deal

AI startup Anthropic discusses a multi-billion dollar cloud computing deal with Google to significantly accelerate AI development and infrastructure scaling.

October 19, 2025by HowAIWorks Team

AnthropicData Poisoning+12

Anthropic: 250 Malicious Docs Can Poison Any LLM

Anthropic publishes research on data poisoning, proposing new defense mechanisms to protect large language models from adversarial training data attacks.

October 19, 2025by HowAIWorks Team

GoogleLAVA+8

Google LAVA: AI-Powered VM Allocation

Google Research introduces LAVA, an AI-driven system that optimizes cloud computing resource allocation through advanced machine learning techniques.

October 19, 2025by HowAIWorks Team

MobileLLMMeta+8

MobileLLM-Pro: Meta's 1B On-Device Model

Meta Reality Labs releases MobileLLM-Pro, a 1B parameter language model optimized for on-device inference with 128k context window and near-lossless int4.

October 19, 2025by HowAIWorks Team

PaddleOCRBaidu+8

PaddleOCR-VL: Baidu's 0.9B Vision-Language Model

Baidu releases PaddleOCR-VL, a multi-modal OCR model that combines advanced visual perception with text recognition for superior document understanding.

October 17, 2025by HowAIWorks Team

AnthropicClaude+17

Anthropic Agent Skills: Customizable AI Tasks

Anthropic announces Agent Skills, a system that allows Claude to load specialized instructions and resources to improve performance on specific tasks.

October 17, 2025by HowAIWorks Team

aibytedance+9

ByteDance's Doubao: China's Leading AI Chatbot

ByteDance's Doubao surpasses DeepSeek to become China's most popular AI chatbot, ranking fourth globally with user-friendly design and viral social features.

October 17, 2025by HowAIWorks Team

ainvidia+10

NVIDIA Omni-Embed-Nemotron-3B: Multimodal RAG

NVIDIA releases Omni-Embed-Nemotron-3B, a versatile multimodal embedding model for text, image, audio, and video content in RAG systems.

October 17, 2025by HowAIWorks Team

aioracle+9

Oracle AI Database 26ai: AI in Data Management

Oracle announces AI Database 26ai, featuring native vector search and autonomous tuning to accelerate generative AI application development and scaling.

October 16, 2025by HowAIWorks Team

NvidiaMicrosoft+10

Nvidia, Microsoft, xAI Lead $40B Deal

Aligned Data Centers secures a $40 billion investment to build the next generation of AI-optimized infrastructure with liquid cooling and sustainable energy.

October 15, 2025by HowAIWorks Team

AppleM5 Chip+10

Apple M5 Chip: Next-Generation AI Performance

Apple announces M5 chip with 4x GPU AI performance, Neural Accelerators, and enhanced unified memory for MacBook Pro, iPad Pro, and Apple Vision Pro.

October 15, 2025by HowAIWorks Team

AnthropicClaude Haiku+16

Claude Haiku 4.5: Near-Frontier Performance

Anthropic announces Claude 4.5 Haiku, bringing frontier-level intelligence to its fastest and most affordable model tier with enhanced reasoning capabilities.

October 15, 2025by HowAIWorks Team

GoogleCoral NPU+10

Google Coral NPU: Full-Stack Platform for Edge AI

Google announces Coral NPU, an open-source platform for ultra-low-power edge AI with RISC-V architecture, enabling all-day AI on wearables and IoT devices.

October 15, 2025by HowAIWorks Team

WaymoAutonomous Vehicles+10

Waymo Brings Autonomous Rides to London in 2026

Waymo announces its expansion to London, marking its first international deployment of autonomous ride-hailing services using the fifth-generation Waymo Driver.

October 14, 2025by HowAIWorks Team

MicrosoftMAI-Image-1+8

Microsoft MAI-Image-1: Top 10 Image Gen Model

Microsoft releases MAI-Image-1, a next-generation diffusion model featuring state-of-the-art text-to-image consistency and advanced architectural efficiency.

October 14, 2025by HowAIWorks Team

ainvidia+8

NVIDIA DGX Spark: Petaflop System for SpaceX

NVIDIA launches DGX Spark, the world's smallest AI supercomputer with 1 petaflop performance and 128GB memory, first delivered to Elon Musk at SpaceX.

October 13, 2025by HowAIWorks Team

aiant-group+13

Ant Group Unveils Ling-1T: Trillion-Param Model

Ant Group releases Ling-1T, a trillion-parameter open-source AI model with state-of-the-art coding, reasoning, and multimodal capabilities.

October 13, 2025by HowAIWorks Team

aiopenai+8

OpenAI & Broadcom: Strategic AI Partnership

OpenAI partners with Broadcom to develop and deploy 10 gigawatts of custom AI accelerators, revolutionizing AI infrastructure for the next generation.

October 12, 2025by HowAIWorks Team

qwenvision-ai+8

Qwen3-VL Cookbooks: Guide to Multimodal Vision AI

Explore Qwen3-VL Cookbooks with practical examples for multimodal AI development, including vision-language tasks, image analysis, and integration guides.

October 12, 2025by HowAIWorks Team

aidiffusion-models+8

RND1: Largest Open Diffusion Language Model

Radical Numerics introduces RND1-Base, a 30B parameter diffusion language model converted from autoregressive architecture with 15% efficiency gains.

October 11, 2025by HowAIWorks Team

GoogleGemini+8

Google Gemini Enterprise: AI for Workplace

Google launches Gemini Enterprise, a comprehensive AI platform that unifies models, agents, and workflows to transform how organizations work, run.

October 8, 2025by HowAIWorks Team

GoogleGemini+8

Google Launches Gemini 2.5 Computer Use Model

Google releases Gemini 2.5 Computer Use model, enabling AI agents to interact with user interfaces through web browsers and mobile apps with lower latency.

October 6, 2025by HowAIWorks Team

aiopenai+8

OpenAI Introduces AgentKit: AI Agent Platform

OpenAI launches AgentKit, a complete toolkit for building, deploying, and optimizing AI agents with visual builder, chat interfaces, and evaluation tools.

October 6, 2025by HowAIWorks Team

OpenAIChatGPT+11

OpenAI Introduces Apps in ChatGPT: New Era

OpenAI launches ChatGPT apps integration with Booking.com, Spotify, Canva and more, plus Apps SDK for developers to build custom applications.

October 2, 2025by HowAIWorks Team

nvidiareinforcement-learning+8

NVIDIA RLP: RL Pretraining for AI Models

NVIDIA introduces RLP (Reinforcement Learning Pretraining), a novel method for training base models with RL to enhance reasoning and strategic decision-making.

October 2, 2025by HowAIWorks Team

OpenAIAI Valuation+8

OpenAI Reaches $500B Valuation: World's Largest

OpenAI completes $6.6B share sale at $500 billion valuation, surpassing SpaceX to become the world's most valuable startup.

October 1, 2025by HowAIWorks Team

aiagents+7

Context Engineering: AI Agent Optimization Guide

Anthropic reveals advanced strategies for managing context in AI agents, from token optimization to long-horizon task handling and multi-agent architectures.

October 1, 2025by HowAIWorks Team

OpenAISora+8

Sora 2: Advanced Video and Audio Generator

OpenAI launches Sora 2, a state-of-the-art video and audio generation model with improved physics accuracy, synchronized audio, and enhanced safety features.

September 30, 2025by HowAIWorks Team

aiglm+9

GLM-4.6: Zhipu AI's Advanced Coding Model

Zhipu AI releases GLM-4.6 with 200K context window, enhanced coding capabilities, and 15% token efficiency improvements for real-world development tasks.

September 29, 2025by HowAIWorks Team

aiagents+8

Building AI Agents with Claude Agent SDK

Anthropic releases the Claude Agent SDK, providing a comprehensive toolkit for developers to build, test, and deploy autonomous AI agents with Claude.

September 29, 2025by HowAIWorks Team

aiclaude+7

Claude Sonnet 4.5: Anthropic's Advanced Model

Anthropic releases Claude Sonnet 4.5 with state-of-the-art coding capabilities, improved reasoning, and the new Claude Agent SDK for developers.

September 29, 2025by HowAIWorks Team

OpenAIChatGPT+10

OpenAI Instant Checkout: Revolutionizing Shopping

OpenAI launches Instant Checkout in ChatGPT, revolutionizing AI commerce with seamless shopping directly in conversations using the new Agentic Commerce.

September 29, 2025by HowAIWorks Team

aiopenai+7

OpenAI Parental Controls: AI Safety for Families

OpenAI introduces parental controls for ChatGPT, enabling safe AI usage for teens with account linking, content filtering, and parental oversight features.

September 28, 2025by HowAIWorks Team

aitools+5

Cursor CLI: AI-Powered Terminal Assistant

Cursor launches CLI tool bringing AI assistance directly to your terminal. Code faster with intelligent command-line automation and script generation.

September 25, 2025by HowAIWorks Team

GoogleGemini+8

Google Updates Gemini 2.5 Flash Models

Google releases improved Gemini 2.5 Flash and Flash-Lite models with 50% cost reduction, better agentic capabilities, and enhanced multimodal features.

September 20, 2025by HowAIWorks Team

aigrok+7

Grok-4 Fast: xAI's New Efficient AI Model

xAI announces Grok-4 Fast with 40% fewer tokens, 98% cost reduction, and 2M context window. Learn about the new efficient AI model.

September 17, 2025by HowAIWorks Team

AI AgentsPayments+8

Google Launches Agent Payments Protocol (AP2)

Google announces AP2, an open protocol for secure AI agent payments with 60+ industry partners including Mastercard, PayPal, and Coinbase.

September 16, 2025by HowAIWorks Team

AI Models3D Modeling+8

Tencent Hunyuan 3D 3.0: Triple Modeling Accuracy

Tencent releases Hunyuan-3D 3.0, a state-of-the-art generative model for high-fidelity 3D asset creation from text prompts and single images in seconds.

September 15, 2025by HowAIWorks Team

aillm+8

Google Speculative Cascades: Faster LLM Inference

Google Research introduces speculative cascades, a revolutionary technique combining speculative decoding with cascades for better LLM inference.

September 14, 2025by HowAIWorks Team

aiagents+8

AI Agent Tools: Anthropic Development Guide

Discover Anthropic's proven techniques for building high-quality tools that maximize AI agent performance, from prototyping to evaluation and optimization.

September 14, 2025by HowAIWorks Team

aimemory+8

mem-agent: AI Model with Persistent Memory

mem-agent: A 4B parameter AI model with persistent memory that rivals models 50x larger. Trained with reinforcement learning on Obsidian-like memory systems.

September 10, 2025by HowAIWorks Team

aiimage-editing+9

Nano Banana: Google's AI Photo Editing Tool 2025

Nano-Banana launches an AI-powered image editor that uses diffusion models to perform complex edits, restyling, and object removal through simple text prompts.

September 10, 2025by HowAIWorks Team

aioracle+8

Oracle & OpenAI: $40B Partnership

Oracle and OpenAI's $40B partnership includes Nvidia chips and 4.5GW data centers. Learn about Stargate project and AI infrastructure impact.

September 9, 2025by HowAIWorks Team

aiapple+10

AirPods Pro 3: AI Live Translation Revolution

AirPods Pro 3 feature AI-powered Live Translation in 9 languages. Discover how Apple Intelligence enables real-time speech translation through earbuds.

September 1, 2025by HowAIWorks Team

aimodels+5

Introducing Our New AI Models Section

We've added a comprehensive /models section featuring detailed profiles of the latest AI models, from GPT-5 to Claude Opus 4.1 and beyond.

August 1, 2025by HowAIWorks Team

aieducation+1

Welcome to HowAIWorks.ai

Learn about our mission to make AI education accessible and simple for everyone.

Want to Learn More?

Explore our comprehensive lessons and glossary to deepen your AI knowledge.

Browse Glossary Browse Glossary