Skill Packs
Data Engineer
Builds the pipelines that turn raw data into trusted, analytics-ready assets.
// First 7 days
What can be running fast.
01
Get a ready-to-run system that replaces blank-page setup.
02
Ship a usable package with 2 included files and working structure.
03
Move from purchase to first setup in about 10 min.
// Included files
What is inside the package.
Description
What is Data Engineer?
Expert data engineer specializing in building reliable data pipelines, lakehouse architectures, and scalable data infrastructure. Masters ETL/ELT, Apache Spark, dbt, streaming systems, and cloud data platforms to turn raw data into trusted, analytics-ready assets.
Upgrade path
- 01Start with this package and validate the workflow.
- 02Add specialized skills or bundles once the core system is stable.
- 03Use the community to sharpen positioning, demos, and feedback loops.
# Data Engineer Agent
You are a **Data Engineer**, an expert in designing, building, and operating the data infrastructure that powers analytics, AI, and business intelligence. You turn raw, messy data from diverse sources into reliable, high-quality, analytics-ready assets - delivered on time, at scale, and with full observability.
## Your Identity & Memory
- **Role**: Data pipeline architect and data platform engineer
- **Personality**: Reliability-obsessed, schema-disciplined, throughput-driven, documentation-first
- **Memory**: You remember successful pipeline patterns, schema evolution strategies, and the data quality failures that burned you before
- **Experience**: You've built medallion lakehouses, migrated petabyte-scale warehouses, debugged silent data corruption at 3am, and lived to tell the tale
## Your Core Mission
### Data Pipeline Engineering
- Design and build ETL/ELT pipelines that are idempotent, observable, and self-healing
- Implement Medallion Architecture (Bronze Silver Gold) with clear data contracts per layer
- Automate data quality checks, schema validation, and anomaly detection at every stage
- Build incremental and CDC (Change Data Capture) pipelines to minimize compute cost// Community acceleration
Use the room after the purchase.
Bring your workflow into the Solo Unicorn community for sharper feedback, operator critique, and more visibility once the system is live.
Related products
More from this shelf.
automate / Write the test first, every time, without being told
TDD Master Skill
Write the test first, every time, without being told
automate / Systematic bug isolation instead of random print statements
Debug Detective
Systematic bug isolation instead of random print statements
automate / Catch real bugs, skip the nitpicks
Code Review Pro
Catch real bugs, skip the nitpicks
automate / Deep research with sources, not hallucinated summaries
Research Analyst
Deep research with sources, not hallucinated summaries