All Articles

2026 ⁵

May ¹

Let Models Choose Models: Embedding-Driven Smart Routing for LLMs

May 26, 2026 · 5 min

April ¹

AI Programming Enters the Skill Era: From Prompts to the Leap Toward Capability Packaging

April 22, 2026 · 5 min

March ¹

Are AAA Games Falling One by One? An In-Depth Look at the Hypervisor Cracking Controversy

March 26, 2026 · 3 min

January ²

Clawdbot: Empower Your Own Powerful AI Assistant on Azure

January 25, 2026 · 4 min

Microsoft TRELLIS: A Large Model for Production-Grade 3D Asset Generation and Guide to Deployment on Azure

January 19, 2026 · 6 min

2025 ¹⁵

December ⁶

Comprehensive Analysis of LLM Inference Parallelism Strategies: TP / DP / PP / EP Principles and vLLM Performance Verification

December 24, 2025 · 5 min

Accelerating LLM Inference: Decoupling Prefill and Decode (PD Disaggregation)

December 22, 2025 · 5 min

Image Generation Enters the Platform Era: GPT-Image-1.5 in Microsoft Foundry

December 17, 2025 · 6 min

Building a Smart Address Parsing Chrome / Edge Extension with Azure OpenAI

December 15, 2025 · 4 min

A Beginner’s Guide to LLM Architectures

December 10, 2025 · 7 min

Kubernetes Ingress NGINX Retirement: Comprehensive Migration Plan and Practice Guide to Gateway API

December 1, 2025 · 5 min

November ⁴

Fara-7B: Microsoft’s Efficient Agentic Model for Computer Use

November 28, 2025 · 3 min

Quickly Configure the Latest Gemini 3 Pro Model for Github Copilot to Accelerate Development Experience

November 20, 2025 · 3 min

Understanding KV-Cache - The Core Acceleration Technology for LLM Inference

November 18, 2025 · 6 min

Easily Generate Videos with Sora 2 from Azure AI Foundry

November 10, 2025 · 5 min

October ¹

A Comprehensive Guide to LLM Fine-Tuning: Methods, Comparisons, and Best-Fit Scenarios

October 28, 2025 · 4 min

September ²

Optimizing Inference with Parameter/Data (P/D) Separation in vLLM Framework

September 29, 2025 · 5 min

Getting Started with Microsoft’s Latest Open-Source Long-Form Speech Model VibeVoice

September 18, 2025 · 4 min

July ¹

A Beginner’s Guide to Inference with the SGLang Framework

July 10, 2025 · 5 min

February ¹

Easily Deploy and use DeepSeek-R1 with Azure AI Foundry

February 10, 2025 · 7 min

2024 ²

June ²

Building Your Own ChatGPT on Azure Without Writing Any Code

June 25, 2024 · 5 min

Azure 101 Series: Microsoft Azure Overview

June 19, 2024 · 13 min

2026 5

May 1