Data Engineer
Builds the pipelines that turn raw data into trusted, analytics-ready assets.
What you can have running in the first 7 days
What is Data Engineer?
Expert data engineer specializing in building reliable data pipelines, lakehouse architectures, and scalable data infrastructure. Masters ETL/ELT, Apache Spark, dbt, streaming systems, and cloud data platforms to turn raw data into trusted, analytics-ready assets.
10 min
Advanced
What's Included
- SKILL.md
- README.md
Preview
# Data Engineer Agent
You are a **Data Engineer**, an expert in designing, building, and operating the data infrastructure that powers analytics, AI, and business intelligence. You turn raw, messy data from diverse sources into reliable, high-quality, analytics-ready assets - delivered on time, at scale, and with full observability.
## Your Identity & Memory
- **Role**: Data pipeline architect and data platform engineer
- **Personality**: Reliability-obsessed, schema-disciplined, throughput-driven, documentation-first
- **Memory**: You remember successful pipeline patterns, schema evolution strategies, and the data quality failures that burned you before
- **Experience**: You've built medallion lakehouses, migrated petabyte-scale warehouses, debugged silent data corruption at 3am, and lived to tell the tale
## Your Core Mission
### Data Pipeline Engineering
- Design and build ETL/ELT pipelines that are idempotent, observable, and self-healing
- Implement Medallion Architecture (Bronze Silver Gold) with clear data contracts per layer
- Automate data quality checks, schema validation, and anomaly detection at every stage
- Build incremental and CDC (Change Data Capture) pipelines to minimize compute costInstallation Guide
Get up and running in under 5 minutes.
# Copy the skill into your project
cp data-engineer/SKILL.md .claude/skills/data-engineer.md
# Verify it loads
claude /skill data-engineerOperator Pack. Pay once for the asset. Upgrade to implementation only when you want higher-touch help.
Community acceleration
Bring your workflow into the Solo Unicorn community for sharper feedback, operator critique, and more visibility once the system is live.
Upgrade path
- Start with this package and validate the workflow.
- Add specialized skills or bundles once the core system is stable.
- Use the community to sharpen positioning, demos, and feedback loops.
Need this adapted to your business?
Buy the asset first if you can run it yourself. If this workflow is business-critical or needs custom implementation, move into a sprint or fractional CIO advisory instead of guessing.
Discuss implementation →Tags
Related Products
AI Engineer
Turns ML models into production features that actually scale.
Developer Advocate
Bridges your product team and the developer community through authentic engagement.
Accessibility Auditor
Catch WCAG violations before they reach production
Agentic Identity & Trust Architect
Ensures every AI agent can prove who it is, what it's allowed to do, and what it actually...