Minion：比Anthropic更早实现大模型Programmatic Tool Calling范式的国产开源项目

2025/12/10 21:44:46

502 阅读

本文转载自 Minion 开源项目的作者，旨在为读者呈现 Programmatic Tool Calling（PTC）与代码编排式 Agent 架构的技术背景与实践脉络。

2025年11月24日，Anthropic正式发布了Programmatic Tool Calling (PTC)特性，允许Claude通过代码而非单次API调用来编排工具执行。这一创新被认为是Agent开发的重要突破，能够显著降低token消耗、减少延迟并提升准确性。

然而，作为minion框架的创建者，我想分享一个有趣的事实：minion从一开始就采用了这种架构理念。在PTC概念被正式提出之前，minion已经在生产环境中证明了这种方法的价值。

PTC解决了什么问题？

Anthropic在博文中指出了传统Tool Calling的两个核心问题：

DataLearner 官方微信

欢迎关注 DataLearner 官方微信，获得最新 AI 技术推送

返回博客列表

# Minion的典型工作流
1. LLM分析用户需求，制定执行计划
2. LLM生成Python代码来编排工具调用
3. 代码在隔离环境中执行，处理所有数据操作
4. 只有最终结果返回给LLM

# Minion中的实现（伪代码）
async def check_budget_compliance():
    # LLM生成的计划代码
    team = await get_team_members("engineering")

    # 并行获取所有数据
    levels = list(set(m["level"] for m in team))
    budgets = {
        level: await get_budget_by_level(level)
        for level in levels
    }

    # 数据处理在本地完成
    exceeded = []
    for member in team:
        expenses = await get_expenses(member["id"], "Q3")
        total = sum(e["amount"] for e in expenses)
        budget = budgets[member["level"]]

        if total > budget["travel_limit"]:
            exceeded.append({
                "name": member["name"],
                "spent": total,
                "limit": budget["travel_limit"]
            })

    return exceeded  # 只返回关键结果

# Minion可以直接使用任何Python库
import pandas as pd
import numpy as np
from sklearn.cluster import KMeans

# 强大的数据处理
df = pd.DataFrame(expense_data)
analysis = df.groupby('category').agg({
    'amount': ['sum', 'mean', 'std'],
    'count': 'size'
})

# 复杂的数据科学任务
model = KMeans(n_clusters=3)
clusters = model.fit_predict(spending_patterns)

class BudgetAnalyzer:
    def __init__(self):
        self.cache = {}
        self.history = []

    async def analyze_department(self, dept):
        # 状态在整个分析过程中保持
        if dept in self.cache:
            return self.cache[dept]

        result = await self._deep_analysis(dept)
        self.cache[dept] = result
        self.history.append(result)
        return result

async def robust_fetch(user_id, max_retries=3):
    for attempt in range(max_retries):
        try:
            return await get_expenses(user_id, "Q3")
        except RateLimitError:
            await asyncio.sleep(2 ** attempt)
        except DataNotFoundError:
            return []  # 合理的默认值
    raise Exception(f"Failed after {max_retries} attempts")

# 高效的并行处理
async def analyze_all_departments():
    departments = ["eng", "sales", "marketing", "ops"]

    # 同时分析所有部门
    results = await asyncio.gather(*[
        analyze_department(dept)
        for dept in departments
    ])

    # 整合分析结果
    return consolidate_results(results)

用户请求
    ↓
[LLM：理解意图，制定计划]
    ↓
[生成Python代码]
    ↓
[代码执行环境：调用工具、处理数据、控制流程]
    ↓
[返回结构化结果]
    ↓
[LLM：解读结果，生成用户友好的响应]

# Minion的工具分层策略
class MinionToolRegistry:
    def __init__(self):
        self.core_tools = []      # 始终加载
        self.domain_tools = {}    # 按需加载
        self.rare_tools = {}      # 搜索发现

    def get_tools_for_task(self, task_description):
        # 智能工具选择
        tools = self.core_tools.copy()

        # 基于任务描述添加相关工具
        if "database" in task_description:
            tools.extend(self.domain_tools["database"])

        if "visualization" in task_description:
            tools.extend(self.domain_tools["plotting"])

        return tools

# 使用embedding的工具搜索
from sentence_transformers import SentenceTransformer

class SemanticToolSearch:
    def __init__(self, tool_descriptions):
        self.model = SentenceTransformer('all-MiniLM-L6-v2')
        self.tool_embeddings = self.model.encode(tool_descriptions)

    def find_tools(self, query, top_k=5):
        query_embedding = self.model.encode([query])
        similarities = cosine_similarity(query_embedding, self.tool_embeddings)
        return self.get_top_tools(similarities, top_k)

async def detect_anomalies():
    # LLM规划：需要获取数据、清洗、特征工程、异常检测

    # 执行代码直接处理大数据集
    transactions = await fetch_all_transactions(start_date, end_date)
    # 1M+ records, 但不进入LLM context

    df = pd.DataFrame(transactions)
    df = clean_data(df)
    features = engineer_features(df)

    # 使用机器学习检测异常
    anomalies = detect_with_isolation_forest(features)

    # 只返回异常摘要给LLM
    return {
        "total_transactions": len(df),
        "anomalies_found": len(anomalies),
        "top_anomalies": anomalies.head(10).to_dict()
    }

async def comprehensive_customer_analysis(customer_id):
    # 并行获取所有数据源
    crm_data, support_tickets, usage_logs, billing_history = await asyncio.gather(
        fetch_crm_data(customer_id),
        fetch_support_tickets(customer_id),
        fetch_usage_logs(customer_id),
        fetch_billing_history(customer_id)
    )

    # 本地数据融合和分析
    customer_profile = {
        "health_score": calculate_health_score(...),
        "churn_risk": predict_churn_risk(...),
        "upsell_opportunities": identify_opportunities(...),
        "support_sentiment": analyze_ticket_sentiment(support_tickets)
    }

    return customer_profile

async def deploy_with_validation():
    # 多步骤工作流，每步都有条件逻辑

    # 1. 运行测试
    test_results = await run_test_suite()
    if test_results.failed > 0:
        return {"status": "blocked", "reason": "tests failed"}

    # 2. 构建和推送镜像
    image = await build_docker_image()
    await push_to_registry(image)

    # 3. 金丝雀部署
    canary = await deploy_canary(image, percentage=10)
    await asyncio.sleep(300)  # 监控5分钟

    metrics = await get_canary_metrics(canary)
    if metrics.error_rate > 0.01:
        await rollback_canary(canary)
        return {"status": "rolled_back", "metrics": metrics}

    # 4. 完整部署
    await deploy_full(image)
    return {"status": "success", "image": image.tag}

# 简单任务：直接工具调用
if task.complexity < THRESHOLD:
    result = await simple_tool_call(task)

# 复杂任务：生成编排代码
else:
    orchestration_code = await llm.generate_code(task)
    result = await execute_code(orchestration_code)

# 记忆化的数据获取
@lru_cache(maxsize=1000)
async def cached_get_user_data(user_id):
    return await fetch_user_data(user_id)

# 增量更新而非全量重算
async def update_analysis(new_data):
    previous_state = load_checkpoint()
    delta = compute_delta(previous_state, new_data)
    updated_state = apply_delta(previous_state, delta)
    return updated_state

# 规划用强模型
plan = await claude_opus.create_plan(user_request)

# 代码生成用专门模型
code = await codegen_model.generate(plan)

# 执行和监控
result = await execute_with_monitoring(code)

# 用户交互用快速模型
response = await claude_haiku.format_response(result)

# Anthropic的PTC需要特定配置
{
    "tools": [
        {
            "type": "code_execution_20250825",
            "name": "code_execution"
        },
        {
            "name": "get_team_members",
            "allowed_callers": ["code_execution_20250825"],
            ...
        }
    ]
}

# Claude生成工具调用
{
    "type": "server_tool_use",
    "id": "srvtoolu_abc",
    "name": "code_execution",
    "input": {
        "code": "team = get_team_members('engineering')\\\\n..."
    }
}

# Minion的工具定义是标准Python
class MinionTools:
    @tool
    async def get_team_members(self, department: str):
        """Get all members of a department"""
        return await self.db.query(...)

    @tool
    async def get_expenses(self, user_id: str, quarter: str):
        """Get expense records"""
        return await self.expenses_api.fetch(...)

# LLM生成的是完整的Python函数
async def analyze_budget():
    # 直接调用工具函数
    team = await tools.get_team_members("engineering")

    # 完整的Python语言能力
    expenses_by_user = {
        member.id: await tools.get_expenses(member.id, "Q3")
        for member in team
    }

    # 任意复杂度的数据处理
    analysis = perform_complex_analysis(expenses_by_user)
    return analysis

传统模式：LLM <-> Tool <-> LLM <-> Tool <-> LLM
          (慢)   (贵)   (脆弱)

编排模式：LLM -> [Code: Tool+Tool+Tool+Processing] -> LLM
          (快)   (省)   (可靠)

Minion：比Anthropic更早实现大模型Programmatic Tool Calling范式的国产开源项目 | DataLearnerAI

Minion：比Anthropic更早实现大模型Programmatic Tool Calling范式的国产开源项目

PTC解决了什么问题？

DataLearner 官方微信

1. Context污染问题

2. 推理开销与手动综合

Minion的解决方案：天然的PTC架构

核心设计理念

实际案例对比

Minion的优势：更进一步

1. 完整的Python生态系统

2. 状态管理和持久化

3. 错误处理和重试逻辑

4. 并行和异步操作

性能数据对比

架构哲学：谁应该做什么？

Tool Search Tool：Minion的动态工具发现

分层工具暴露

向量搜索工具发现

实际应用：Minion在生产环境

案例1：大规模数据分析

案例2：多源数据整合

案例3：自动化工作流

超越PTC：Minion的未来方向

1. 混合推理模式

2. 增量计算和缓存

3. 多模型协作

开源的力量：社区驱动的创新

技术细节：实现对比

PTC的实现方式

Minion的实现方式

为什么这个架构如此重要？

1. 经过验证的架构

2. 先发优势

3. 更广泛的适用性

4. 社区和生态

结论：架构的必然收敛

相关资源

视频演示