skill-conflict-resolution Case #3

Easy Domain: Documents & Knowledge

User Instruction

1. Analyze the interaction history in /workspace/environment/history.json and cross-check the results against the existing 'Discovered Patterns' in the interaction_pattern_analyzer skill, which is stored in the /workspace/environment. Are there any conflicts or inaccuracies? 2. Please update the skill to correct any misleading content and add the real patterns you discovered. Save the updated skill files in /workspace/output.

Task Description

EN: Update existing SKILL based on interaction history: identify conflicting content, identify potentially misleading content in the old SKILL version, and update accordingly

中文: 根据交互历史，更新现有SKILL：识别冲突内容、识别旧版本SKILL中可能��在的误导性内容，并加以更新

Complexity Factors

✗

Cross-Service Dependency

✗

Contaminated Initial State

✗

Implicit Goal Resolution

✓

Knowledge System Maintenance

✗

Environmental State Invalidation

✗

Outcome Verification under Altered State

Evaluation

Verifier Type: evaluate.py

Partial Credit: Yes

Reward Range: 0 – 1

Results for This Task

Model	Avg Score	Attempts	All Passed
qwen3.5-397b-a17b	1	3	✓
gpt-5.5	0.933	3	✗
qwen3.5-flash	0.933	3	✗
qwen3.6-flash	0.933	3	✗
qwen3.6-plus	0.933	3	✗
qwen3.6-27b	0.867	3	✗
deepseek-v4-pro	0.8	3	✗
deepseek-v4-flash	0.6	3	✗
qwen3.5-27b	0.333	3	✗

Public Trajectories

Run trajectories for this task live on HuggingFace.

View trajectories on HuggingFace

Source Files

task.toml: tasks/skill-conflict-resolution/task.toml

instruction: tasks/skill-conflict-resolution/instruction.md

environment: tasks/skill-conflict-resolution/environment/Dockerfile

test: tasks/skill-conflict-resolution/tests/test.sh