vLLM
推测解码
GPU
Harness Engineering
Harness Engineering 指的是一种开发方式:工程师不直接写大量代码,而是设计环境、规则和测试反馈系统,让 AI Agent 自动生成并改进代码。- Effective harnesses for long-running agents
- Harness engineering: leveraging Codex in an agent-first world
- Minions: Stripe’s one-shot, end-to-end coding agents
- Minions: Stripe’s one-shot, end-to-end coding agents—Part 2
- Vibe Coding AReaL:零手打代码开发分布式 RL 训练框架
