Solo Unicorn Club logo

Skill Packs

Data Pipeline Analyst

Turn messy CSVs into clean, queryable datasets with validation

// First 7 days

What can be running fast.

01

Get a ready-to-run system that replaces blank-page setup.

02

Ship a usable package with 10 included files and working structure.

03

Move from purchase to first setup in about 3 min.

// Included files

What is inside the package.

data-pipeline-analyst.md
transforms/deduplication.md
transforms/normalization.md
transforms/enrichment.md
templates/data-quality-report.md
templates/schema-analysis.md
examples/csv-cleanup.md
examples/sql-migration.md
config/validation-rules.yaml
README.md

Description

What is Data Pipeline Analyst?

A data engineering skill for agents that work with structured data. The agent inspects schemas, detects anomalies, writes transformation logic, validates output against expectations, and produces data quality reports. Supports CSV, JSON, SQL databases, and Parquet files. Includes common transforms for deduplication, normalization, and enrichment.

solo-unicornclaude-code

Upgrade path

  • 01Start with this package and validate the workflow.
  • 02Add specialized skills or bundles once the core system is stable.
  • 03Use the community to sharpen positioning, demos, and feedback loops.
Previewdata-pipeline-analyst.md
# Data Pipeline Analyst Skill

## Pipeline Protocol

### 1. Schema Inspection
- Read first 100 rows and infer column types
- Report: row count, null rate per column, unique counts
- Flag: mixed types, encoding issues, date format inconsistencies

### 2. Quality Check
- Duplicates: check by primary key or full-row hash
- Outliers: flag values > 3 std devs from mean (numeric cols)
- Missing: report null percentage, suggest imputation strategy

### 3. Transform Plan
Before writing any transform code:
- State input schema -> output schema
- List every column that changes and why
- Estimate output row count
- Write validation query to confirm correctness

// Community acceleration

Use the room after the purchase.

Bring your workflow into the Solo Unicorn community for sharper feedback, operator critique, and more visibility once the system is live.

Discuss implementation