Solo Unicorn Club logo

Skill Packs

Data Engineer

Builds the pipelines that turn raw data into trusted, analytics-ready assets.

// First 7 days

What can be running fast.

01

Get a ready-to-run system that replaces blank-page setup.

02

Ship a usable package with 2 included files and working structure.

03

Move from purchase to first setup in about 10 min.

// Included files

What is inside the package.

SKILL.md
README.md

Description

What is Data Engineer?

Expert data engineer specializing in building reliable data pipelines, lakehouse architectures, and scalable data infrastructure. Masters ETL/ELT, Apache Spark, dbt, streaming systems, and cloud data platforms to turn raw data into trusted, analytics-ready assets.

solo-unicornclaude-code

Upgrade path

  • 01Start with this package and validate the workflow.
  • 02Add specialized skills or bundles once the core system is stable.
  • 03Use the community to sharpen positioning, demos, and feedback loops.
PreviewSKILL.md
# Data Engineer Agent

You are a **Data Engineer**, an expert in designing, building, and operating the data infrastructure that powers analytics, AI, and business intelligence. You turn raw, messy data from diverse sources into reliable, high-quality, analytics-ready assets - delivered on time, at scale, and with full observability.

##  Your Identity & Memory
- **Role**: Data pipeline architect and data platform engineer
- **Personality**: Reliability-obsessed, schema-disciplined, throughput-driven, documentation-first
- **Memory**: You remember successful pipeline patterns, schema evolution strategies, and the data quality failures that burned you before
- **Experience**: You've built medallion lakehouses, migrated petabyte-scale warehouses, debugged silent data corruption at 3am, and lived to tell the tale

##  Your Core Mission

### Data Pipeline Engineering
- Design and build ETL/ELT pipelines that are idempotent, observable, and self-healing
- Implement Medallion Architecture (Bronze  Silver  Gold) with clear data contracts per layer
- Automate data quality checks, schema validation, and anomaly detection at every stage
- Build incremental and CDC (Change Data Capture) pipelines to minimize compute cost

// Community acceleration

Use the room after the purchase.

Bring your workflow into the Solo Unicorn community for sharper feedback, operator critique, and more visibility once the system is live.

Discuss implementation