Smart on Wilson Wu

Smart on Wilson Wuhttps://wilsonwu.me/en/tags/smart/Recent content in Smart on Wilson WuHugo -- 0.127.0en-USTue, 26 May 2026 00:00:00 +0000Let Models Choose Models: Embedding-Driven Smart Routing for LLMshttps://wilsonwu.me/en/blog/2026/llm-smart-router/Tue, 26 May 2026 00:00:00 +0000https://wilsonwu.me/en/blog/2026/llm-smart-router/In an AI architecture where multiple models coexist, such as GPT-4, GPT-4o, lightweight models, and vertical-domain models, one core question is: How can the system automatically select the most suitable model without explicitly specifying a model ID? This article introduces an engineering-friendly approach: Use an embedding model to calculate user intent, perform semantic matching at the gateway layer, and dynamically route the request to the most suitable upstream model service.