You're viewing the preview version of this page. For the full experience, please return to the .

Modern Data Warehousing,
Built for Real-TimeInsights

Unify batch and streaming data, accelerate BI analytics, and deliver trusted business insights at scale on a high-performance, real-time analytical database.

Why modern Data Warehousingchanges the business.

When the warehouse is unified, fast, real-time, and elastic, four things shift at once: the trust of the data, the speed of analysis, the freshness of decisions, and the cost of running it all.

01 / Unified Business View

Unify fragmented data
into a single source of truth.

Data from orders, customers, payments, CRM, ERP, ads, and logs often tells different versions of the same story. A modern data warehouse brings these signals together into one trusted analytical layer, so every team can work from consistent metrics and make decisions with confidence.

Where it shows up
  • Executive Business Dashboards
  • Sales and Revenue Analytics
  • Marketing ROI Analysis
01 / 04

Already running in production.

Three teams run Apache Doris as the analytical core of their data warehouse: at scale, with real concurrency, on live business data.

Case 01 · SF Technology

SF Express: Replacing Presto with Apache Doris for BI and Ad-hoc Analytics

Apache Doris cut our P95 query latency by nearly 70% and let us migrate 100% of ad-hoc and BI workloads off Presto, with far better stability and lower cost.

Outcome
  • P95 query latency reduced by ~70%, with sub-10s queries up from 72% to 88%
  • 48% lower compute cost and 96% data cache hit rate on lakehouse queries
  • 100% of ad-hoc and BI workloads migrated, with 97% SQL compatibility
Read Case Study
Case 02 · Xiaomi

Xiaomi: A Unified Lakehouse with Apache Doris and Apache Paimon

Apache Doris and Apache Paimon let us consolidate fragmented engines and storage into one lakehouse, with 6× faster queries and 5× higher concurrency than Presto.

Outcome
  • Query latency cut from 60s to 10s; aggregation from 40s to 8s
  • 5× higher concurrent throughput vs. Presto, with 25–75% lower latency under load
  • One unified stack for hot Doris storage and cold Paimon data across user behavior, device, and operations analytics
Read Case Study
Case 03 · Cainiao

Cainiao: A Real-Time Lakehouse for Global Logistics at Cainiao

Data updates can be completed within seconds, and queries can be responded to within hundreds of milliseconds.

Outcome
  • 90% lower cost and 72% faster average response on the real-time data platform
  • 1,000–2,000 QPS point queries (10–100ms) and 200–300 QPS sub-second multi-table joins
  • 25+ Doris clusters and 10,000+ CPUs across 3 regions running with zero failures, powering inventory, package, and order tracking for 80M daily packages
Read Case Study

What Data Warehousing demandsand how Apache Doris answers.

Five things a modern data warehouse has to be good at, and the specific Apache Doris capabilities that meet each one.

Technical requirements

Modeling for Warehouse Workloads

Modern data warehouses need to support detail records, fact and dimension tables, wide tables, aggregated metrics, and business-ready datasets. Strong modeling keeps definitions consistent, improves query performance, and makes trusted data reusable across teams.

Real-Time Data Freshness

Business teams need live visibility into what is happening now. That requires streaming ingestion, CDC, incremental updates, and fresh data that becomes queryable within seconds, not after yesterday’s batch.

Incremental Updates and Reliable Batch Processing

Modern warehouse workloads need both incremental refresh and large-scale batch execution. Materialized views and incremental computation keep aggregates, rollups, and reporting tables fresh without full recomputation, while reliable batch processing supports end-of-day jobs, backfills, and historical workloads.

Lakehouse & Open Architecture

Operational databases, event streams, SaaS applications, and open lakehouse tables all contain critical business data. Modern warehouses need to integrate these sources and query data in place across formats like Iceberg, Hudi, Delta Lake, and Hive, without copying everything into another silo.

Enterprise Governance and Operations

As the warehouse becomes the shared analytics foundation, it must be secure, reliable, auditable, and easy to operate. That requires fine-grained access control, workload isolation, high availability, audit logs, and simplified operations, so every team can safely depend on the same platform.

Apache Doris capabilities

01 / 05

Build Modern Data Warehousing
with Apache Doris.

Get Started