qwen_training_data_miner_prototype
OfficialMine 012.txt for training data.
Software Engineering#log analysis#pattern extraction#instruction tuning#training data#ai model training#data mining
AuthorFOUNDUPS
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill automates the extraction of high-quality, domain-specific training examples from a large log file (012.txt) to create instruction-tuning datasets for training AI models like Gemma.
Core Features & Use Cases
- Data Mining: Scans a large text file (012.txt) for specific patterns related to defined knowledge domains (e.g., MPS scoring, WSP application).
- Example Extraction: Converts identified patterns into a structured instruction-tuning format, including input, output, and rationale.
- Quality Filtering: Applies strict criteria to ensure only high-quality, complete, and unambiguous examples are retained.
- Dataset Generation: Outputs a ready-to-use JSON dataset for model training, complete with metadata, recommended configurations, and autonomous execution instructions.
- Use Case: Automatically generate a dataset for training an AI to score tasks using the MPS methodology by mining historical decision data.
Quick Start
Use the qwen_training_data_miner_prototype skill to mine 012.txt for MPS scoring examples and generate a training dataset.
Dependency Matrix
Required Modules
None requiredComponents
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: qwen_training_data_miner_prototype Download link: https://github.com/FOUNDUPS/Foundups-Agent/archive/main.zip#qwen-training-data-miner-prototype Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.