email-writing Case #6

Easy Domain: Communication & Email email

User Instruction

View on GitHub

Please send an email to my leader, Mary, at the company email system website (http://localhost:5174/, open it in browser) informing her that the serious BUG she discovered last night has been fixed, and asking when she would be available to discuss plans for the new project. Her email address is mary.lee@work.mosi.inc

Task Description

EN: Help the user compose and send an email to a specified recipient

中文: 让OpenClaw帮助用户给指定收件人编辑并发送邮件

Complexity Factors

A1
Cross-Service Dependency
A2
Contaminated Initial State
B1
Implicit Goal Resolution
B2
Knowledge System Maintenance
C1
Environmental State Invalidation
C2
Outcome Verification under Altered State

Evaluation

Verifier Type: verify.py
Partial Credit: Yes
Reward Range: 0 – 1

Results for This Task

Model Avg Score Attempts All Passed
deepseek-v4-flash 1 3
deepseek-v4-pro 1 3
gpt-5.5 1 3
qwen3.5-397b-a17b 1 3
qwen3.6-27b 1 3
qwen3.6-flash 1 3
qwen3.6-plus 1 3
qwen3.5-27b 0.667 3
qwen3.5-flash 0 3

Public Trajectories

Run trajectories for this task live on HuggingFace.

View trajectories on HuggingFace