skill-repository-curation Case #4

Easy Domain: Documents & Knowledge

User Instruction

View on GitHub

# Skill Repository Consolidation The directory `/workspace/environment/skills/sales-data-pipeline` is a fragmented knowledge base: **14 sub-skills** with substantial overlap and redundancy. Use the `skill-creator` skill to clean it up. ## Your Goal Consolidate the 14 sub-skills into a smaller set of **4–6 cohesive sub-skills** that together cover the full data-pipeline lifecycle — ingestion, cleaning, validation, transformation, aggregation/reporting, and export. Eliminate overlapping or redundant skills; preserve every distinct capability of the originals. ## Deliverables 1. **Consolidated skill directory** at `/workspace/environment/skills/sales-data-pipeline/` — replace the 14 original sub-skills with 4–6 well-named sub-skills, each carrying its own well-formed `SKILL.md`. 2. **Consolidation rationale** at `/workspace/environment/skills/sales-data-pipeline/CONSOLIDATION_RATIONALE.md` — a written record of: - Which original sub-skills were merged, and into what. - Why each merge made sense (overlap, redundancy, subset/superset, etc.). - How coverage of the original capabilities is preserved. This file is **scored**: the verifier uses it to determine whether you identified redundancy pairs and articulated a clear merge rationale. ## Scoring Dimensions (each contributes to total / 100) | Dimension | Weight | Source | |---|---|---| | Redundancy identified | 15 | `CONSOLIDATION_RATIONALE.md` + your reasoning trace | | Coverage preserved | 20 | union of consolidated `SKILL.md` files | | Skill count reduced | 15 | count of sub-skills (target 4–6) | | Overlap eliminated | 15 | Jaccard similarity across consolidated skills | | Pipeline coherent | 15 | stage coverage (ingestion, cleaning, validation, transformation, aggregation, export) | | Skill quality | 10 | each `SKILL.md` well-formed | | Consolidation rationale | 10 | `CONSOLIDATION_RATIONALE.md` + your reasoning trace | Aim for ≥80/100.

Task Description

EN: Organize existing SKILL library: merge redundant SKILLs, remove obsolete ones

中文: 整理现有SKILL库,合并冗余、去除失效SKILL

Complexity Factors

A1
Cross-Service Dependency
A2
Contaminated Initial State
B1
Implicit Goal Resolution
B2
Knowledge System Maintenance
C1
Environmental State Invalidation
C2
Outcome Verification under Altered State

Evaluation

Verifier Type: evaluate.py
Partial Credit: Yes
Reward Range: 0 – 1

Results for This Task

Model Avg Score Attempts All Passed
qwen3.5-27b 0.983 3
deepseek-v4-pro 0.967 3
qwen3.5-397b-a17b 0.967 3
deepseek-v4-flash 0.963 3
qwen3.6-plus 0.95 3
qwen3.5-flash 0.933 3
qwen3.6-27b 0.867 3
qwen3.6-flash 0.847 3
gpt-5.5 0.777 3

Public Trajectories

Run trajectories for this task live on HuggingFace.

View trajectories on HuggingFace