Skill Packs
Data Pipeline Analyst
Turn messy CSVs into clean, queryable datasets with validation
// First 7 days
What can be running fast.
01
Get a ready-to-run system that replaces blank-page setup.
02
Ship a usable package with 10 included files and working structure.
03
Move from purchase to first setup in about 3 min.
// Included files
What is inside the package.
Description
What is Data Pipeline Analyst?
A data engineering skill for agents that work with structured data. The agent inspects schemas, detects anomalies, writes transformation logic, validates output against expectations, and produces data quality reports. Supports CSV, JSON, SQL databases, and Parquet files. Includes common transforms for deduplication, normalization, and enrichment.
Upgrade path
- 01Start with this package and validate the workflow.
- 02Add specialized skills or bundles once the core system is stable.
- 03Use the community to sharpen positioning, demos, and feedback loops.
# Data Pipeline Analyst Skill
## Pipeline Protocol
### 1. Schema Inspection
- Read first 100 rows and infer column types
- Report: row count, null rate per column, unique counts
- Flag: mixed types, encoding issues, date format inconsistencies
### 2. Quality Check
- Duplicates: check by primary key or full-row hash
- Outliers: flag values > 3 std devs from mean (numeric cols)
- Missing: report null percentage, suggest imputation strategy
### 3. Transform Plan
Before writing any transform code:
- State input schema -> output schema
- List every column that changes and why
- Estimate output row count
- Write validation query to confirm correctness// Community acceleration
Use the room after the purchase.
Bring your workflow into the Solo Unicorn community for sharper feedback, operator critique, and more visibility once the system is live.
Related products
More from this shelf.
automate / Write the test first, every time, without being told
TDD Master Skill
Write the test first, every time, without being told
automate / Systematic bug isolation instead of random print statements
Debug Detective
Systematic bug isolation instead of random print statements
automate / Catch real bugs, skip the nitpicks
Code Review Pro
Catch real bugs, skip the nitpicks
automate / Deep research with sources, not hallucinated summaries
Research Analyst
Deep research with sources, not hallucinated summaries