Solo Unicorn Club logoSolo Unicorn
2,750 words

Snowflake Deep Dive — The AI-Powered Data Cloud

Company Deep DiveSnowflakeAIData CloudData WarehouseCortex
Snowflake Deep Dive — The AI-Powered Data Cloud

Snowflake Deep Dive — The AI-Powered Data Cloud

Opening

Snowflake was once Wall Street's most coveted SaaS company — its 2020 IPO surged 111% on the first day, with even Buffett making a rare IPO investment. But growth slowed in 2023–2024 and the stock price halved from its peak. Then the AI wave arrived. By late 2025, Cortex AI revenue hit an annualized $100 million, a quarter ahead of most analyst forecasts. FY2026 product revenue reached approximately $4.47 billion, up 29%.

I encounter Snowflake at enterprise clients more frequently than perhaps any other data tool. It's many companies' "first serious cloud data warehouse." This article breaks down what Snowflake is actually doing today and how its AI bet is playing out.

The Problem They Solve

Traditional data warehouses (Oracle, Teradata) had two fundamental problems: expensive, and inflexible. You had to pre-purchase capacity and pay regardless of usage. Scaling up when data volumes grew took months. Cross-departmental data sharing meant exporting CSVs.

Snowflake solved these problems with a compute-storage separation architecture: storage is billed by actual usage (close to S3 pricing), compute resources spin up and down on demand within seconds. Data sharing requires no data movement — just grant access directly.

Target customers: mid-to-large enterprise data analytics and BI teams, especially organizations already using Tableau, Looker, or Power BI. Industry coverage spans finance, retail, tech, and healthcare — essentially any enterprise that needs SQL analytics is within Snowflake's range.

Why now: Enterprise data volumes are growing far faster than budgets. IDC data shows global enterprise data doubles every two years, while IT budget growth sits at just 5–8% annually. Consumption-based elastic pricing lets CFOs accept a "pay for what you use" model instead of locking into three-year contracts with Oracle. Snowflake captured the 2020–2023 "de-Oracle" window, winning droves of customers migrating from legacy data warehouses.

Product Matrix

Core Products

Snowflake Data Cloud: The core data warehouse supporting SQL queries, semi-structured data processing (JSON, Parquet), data sharing, and the Marketplace.

Snowflake Cortex: An AI services layer launched in 2024 that runs LLM inference directly within the Snowflake environment. Includes SQL functions like AI_COMPLETE (text generation), AI_CLASSIFY (classification), and AI_SUMMARIZE (summarization). Supports models including Claude, Llama, and Mistral.

Snowflake Intelligence: Launched in late 2025, it lets non-technical users query data using natural language — a ThoughtSpot-like search experience embedded natively in the Snowflake ecosystem.

Cortex Search: A search service for building RAG applications on top of Snowflake data, billed by indexed data volume and token usage.

Snowpark: A runtime for running Python, Java, and Scala code inside Snowflake, letting data scientists do ML without moving data out.

Snowflake Marketplace: A marketplace for data and applications. Third-party vendors can publish datasets, ML models, and data apps on the Marketplace. Customers can subscribe to third-party data directly within Snowflake without building ETL pipelines. This model creates network effects at the data layer.

Technical Differentiation

Snowflake's moat is "data gravity" — once enterprises put their data in Snowflake, the queries, permissions, and pipelines built around that data are locked in. Data sharing and the Marketplace further amplify network effects: your partners are also on Snowflake, enabling direct data interchange.

Compared to Databricks, Snowflake's SQL experience is more mature and faster — Snowflake's query optimizer is deeply tuned for analytical SQL, making onboarding near-zero effort for data analysts. Compared to BigQuery, Snowflake's cross-cloud capability (AWS, Azure, GCP all supported) is the key differentiator. Compared to Redshift, Snowflake's compute-storage separation architecture delivers a dramatically better scaling experience — Redshift scaling still requires waiting for nodes to warm up.

Snowflake's data gravity effect is its deepest moat: once enterprises place core data in Snowflake, the BI tools, ETL pipelines, and data sharing relationships built around it are all locked in. Migration costs aren't just technical — they include the cost of rebuilding business relationships.

Business Model

Pricing Strategy

Snowflake is purely consumption-based — no seat-based licenses. The core billing unit is the Credit.

Billing Dimension Model Reference Price
Compute (Virtual Warehouse) Credit consumption $2–4/Credit (on-demand), $1.50–2.50/Credit (annual commitment)
Storage Actual usage ~$23–40/TB/month
Cortex AI functions Per token Claude-4-opus: 12 Credits/million tokens
Cortex Search Indexed data volume + tokens 6.3 Credits/GB/month (continuous billing)

Pay close attention to Cortex Search's "idle tax" — whether or not you're running queries, as long as the index exists, you're billed per GB/month continuously. A 50GB index + 20GB embeddings = 70GB, costing a fixed 441 Credits per month.

Revenue Model

The upside of consumption-based pricing is that revenue correlates with customer value; the downside is seasonal volatility. Snowflake smooths this through annual commitment contracts (Capacity Contracts) where customers pre-purchase Credits at discounted rates. FY2027 guidance projects product revenue of approximately $5.7 billion, implying 27% growth.

Funding and Valuation

Snowflake IPO'd in September 2020 and currently has a market cap of approximately $75 billion. Over the past 12 months, the ARR gap between Snowflake and Databricks has narrowed from Snowflake leading by $8.8 billion to nearly even. Databricks is valued at $134 billion — nearly 2x Snowflake — the market is voting with valuations on which company looks more like an "AI company."

Customers and Market

Flagship Customers

  • Capital One: Migrated its core data analytics platform to Snowflake, replacing Teradata
  • Instacart: Supply chain and user behavior analytics run on Snowflake
  • Siemens: Global platform for industrial data sharing and analytics
  • DoorDash: Real-time operational analytics and pricing models

Snowflake has over 6,100 accounts using Cortex to build AI applications. Net revenue retention is approximately 120%, down from the historical peak of 170%+, reflecting slowing consumption growth.

Market Size

The cloud data warehouse market is projected at approximately $35 billion in 2026. Snowflake's $4.47 billion in product revenue represents roughly 13% market share. Including the AI services market, the TAM could expand to over $50 billion.

Competitive Landscape

Dimension Snowflake Databricks Google BigQuery Amazon Redshift
SQL performance Strong Moderate (catching up) Strong Moderate
Native AI/ML Moderate (Cortex, early stage) Strong (Mosaic AI) Strong (Vertex AI) Moderate
Unstructured data Weak Strong Moderate Weak
Data sharing Strong (Marketplace) Moderate Weak Weak
Cross-cloud All three major clouds All three major clouds GCP only AWS only
Pricing model Consumption (Credit) Consumption (DBU) Consumption Reserved + on-demand
Growth 29% YoY 65% YoY Not separately disclosed Not separately disclosed

Key observation: Snowflake and Databricks are encroaching on each other's territory from opposite directions. Snowflake is extending from SQL analytics into AI; Databricks is extending from ML into SQL. In the near term, Snowflake retains an edge in SQL scenarios, but Databricks has a clear lead in the AI direction.

What I've Actually Seen

The good: Snowflake is incredibly analyst-friendly. I've seen data teams migrate from Teradata to Snowflake and achieve 5–10x faster query performance while cutting total costs by 40%. The data sharing feature is a genuine differentiator — sharing data across organizations without ETL, just grant access. For teams already running Tableau/Looker, Snowflake is the most seamless back-end choice.

The complicated: Consumption-based pricing is a double-edged sword. Flexible on one hand, unpredictable on the other. I've seen a client burn through $5,000 on a single poorly written SQL query. Cortex AI pricing adds further complexity, and Cortex Search's continuous billing model has caught many clients off guard during proof-of-concept phases. For teams without FinOps discipline, a Snowflake monthly bill can send a CFO's blood pressure through the roof.

The reality: Snowflake's competitiveness in the AI direction still needs to be proven. Cortex at $100 million annualized sounds impressive — until you compare it to Databricks' $1.4 billion in AI revenue. Many of the 6,100 Cortex users are still in trial phases. I've yet to see many enterprises running production-grade ML models on Snowflake. Snowflake's long-term bet is that enterprises will choose to do AI where the data already lives, rather than move data to an AI platform. The logic holds, but Cortex needs to close the capability gap with Databricks.

My Verdict

  • Suitable: Enterprises whose core need is SQL analytics and BI. Snowflake offers the best combination of query performance, ease of use, and ecosystem maturity.
  • Suitable: Scenarios requiring cross-organizational data sharing (finance, healthcare, supply chain).
  • Suitable: Teams already using Tableau/Looker/Power BI — Snowflake is the smoothest upgrade path.
  • Skip if: Your core need is ML model training and inference. Databricks or native cloud ML services are more mature right now.
  • Skip if: Budget is tight and you lack a FinOps team. The flexibility of consumption pricing becomes a cost trap without proper oversight.

In one line: Snowflake is the default choice for enterprise data analytics, but the AI hand is still being played — whether it's a winning one will become clear by the second half of 2026.

Discussion

Do you buy Snowflake's "do AI where your data already lives" argument? Or will enterprises ultimately move data to dedicated AI platforms? Has your team started using Cortex?