Azure

AI Programming Enters the Skill Era: From Prompts to the Leap Toward Capability Packaging

Over the past year, the mainstream way developers interact with AI for programming has gone through a clear evolution: Prompt -> Prompt Engineering -> Context Engineering -> Skill (Capability Packaging) If a prompt is a one-off invocation of model capability, then a Skill is essentially a reusable intelligent capability module. It enables AI to evolve from “answering questions” to “executing tasks.” This article will systematically explain: What a Skill means in the context of AI programming The fundamental difference between a Skill and a prompt How to apply Skill thinking in VS Code + GitHub Copilot Chat Real engineering-level examples What Is a Skill?...

Clawdbot: Empower Your Own Powerful AI Assistant on Azure

With the rapid development of generative AI and intelligent agents, self-hosted intelligent systems are gaining increasing attention from developers and technical teams. Clawdbot is one such open-source, self-hostable personal AI assistant. It not only engages in conversation with users but also executes tasks, integrates with messaging platforms, provides automation capabilities, and can be deployed on cloud platforms like Azure. This article will step-by-step introduce Clawdbot’s capabilities, architecture, and how to deploy and start using it on Microsoft Azure....

Microsoft TRELLIS: A Large Model for Production-Grade 3D Asset Generation and Guide to Deployment on Azure

At the end of 2025, Microsoft Research released an open-source large model project for 3D content creation called TRELLIS, accompanied by the academic paper “Structured 3D Latents for Scalable and Versatile 3D Generation”. This project significantly improves the quality and flexibility of text/image-to-3D asset generation through a unified structured latent space and advanced flow model technology. It also expands the multi-format output and editing capabilities of 3D models, making it one of the key technologies in the current 3D AI model ecosystem....

Image Generation Enters the Platform Era: GPT-Image-1.5 in Microsoft Foundry

In recent years, generative AI technology has evolved rapidly. Beyond natural language processing, image generation and editing capabilities have become a key frontier of AI innovation. In this trend, OpenAI launched the GPT Image series models, which are also available within the Azure OpenAI Service. The newly released GPT-Image-1.5 can be seen as the new flagship in the field of image generation, offering significant improvements in performance, efficiency, and controllability....

Building a Smart Address Parsing Chrome / Edge Extension with Azure OpenAI

In the e-commerce and logistics sectors, there is an overlooked but extremely time-consuming pain point: entering complex address formats correctly into fixed fields in logistics systems. Addresses sent by customers come in all sorts of strange formats—sometimes comma-separated, sometimes all in one line, with completely random orders, and often missing key information like province or state. In this article, I want to share how I built Auto Address, a Chrome and Edge browser extension that leverages the power of Azure OpenAI to solve this problem....

Easily Generate Videos with Sora 2 from Azure AI Foundry

With Azure AI Foundry opening support for Sora 2 (OpenAI’s generative video model), developers can now access top-tier video generation capabilities in an enterprise-grade, compliant, and controllable environment. This tutorial will take you from zero to production, showing how to call Sora 2 via the Playground and the Python SDK to complete a “text-to-video” workflow. Prerequisites Before starting, you need: Get an Azure subscription You need an Azure subscription. If you’re unsure how to get one, refer to the subscription registration section in my earlier article....

Optimizing Inference with Parameter/Data (P/D) Separation in vLLM Framework

Large language models often encounter GPU memory bottlenecks during inference deployment: Model parameters (P) can reach hundreds of GB and must remain resident in GPU memory. Input/output data (D) changes dynamically with each request but is often coupled with parameters on the same device, leading to imbalanced memory usage and limited scalability. To solve this problem, we can leverage the vLLM framework to implement Parameter/Data (P/D) Separation, improving the flexibility and throughput of inference systems....

Getting Started with Microsoft’s Latest Open-Source Long-Form Speech Model VibeVoice

What is VibeVoice? VibeVoice is a research framework released by Microsoft Research for long-form, multi-speaker, conversational speech synthesis. Target scenarios include entire podcast episodes, audio dramas, or interviews: it can maintain speaker consistency within a single generation and handle natural turn-taking. The model family includes multiple scales (e.g., 1.5B, 7B, etc.) and is available on Hugging Face as microsoft/VibeVoice-1.5B, along with model cards, weights, installation guides, and responsible use notes....

A Beginner’s Guide to Inference with the SGLang Framework

As large language models (LLMs) grow in popularity, the focus for enterprises and individuals has shifted from training to inference (in other words, moving from “building wheels” to practical usage). In the field of inference, the two hottest frameworks are undoubtedly vLLM and SGLang. As a rising star, SGLang has also attracted attention. Today, we’ll explore SGLang through a beginner-friendly tutorial to help more people understand both LLM inference and the SGLang framework....

Easily Deploy and use DeepSeek-R1 with Azure AI Foundry

The popularity of DeepSeek has once again showcased the charm of AI. However, this has not led to a reduction in the demand for computing power. Instead, it has brought about another wave of demand for computing power by building more AI business scenarios in low-cost, user-friendly artificial intelligence. Today, we will quickly experience the elegance of DeepSeek through Azure AI Foundry (formerly Azure AI Studio). Prerequisites First, you need to have an Azure subscription....

Building Your Own ChatGPT on Azure Without Writing Any Code

Using ChatGPT to help us solve problems in our work and daily life has become a habit. However, after using the official GPT-4o heavily, we may encounter temporary quota issues. Today, we will show you how to easily build your own personalized ChatGPT application using Azure OpenAI services. Prerequisites Before we begin, make sure you have an Azure global subscription. If you don’t have one yet, you can easily start an Azure subscription through Pay-as-you-go:...

Azure 101 Series: Microsoft Azure Overview

Azure is a cloud computing platform and service provided by Microsoft. It offers a range of infrastructure as a service (IaaS), platform as a service (PaaS), and software as a service (SaaS) solutions for building, deploying, and managing various types of applications and services. Overview Azure provides a wide range of features and services, including virtual machines, storage, databases, artificial intelligence, machine learning, blockchain, Internet of Things (IoT), containers, and serverless computing....