Skip to main content

Migrating Data from Other AP Systems

This document describes common approaches to migrating data from other AP (analytical) systems to Apache Doris, helping you choose a migration path that fits your source system and existing technology stack.

Migration Approaches Overview

Depending on the source system and the tools available, there are three main approaches to migrating data from other AP systems to Doris:

Migration ApproachApplicable ScenarioKey Components
Multi-Catalog + Insert IntoThe source system is Hive, Iceberg, Hudi, or another system that supports external catalog mappingMulti-Catalog, Insert Into
Intermediate file transferThe source system supports exporting to common data formats such as CSVData export tools, Doris file import
Connector integrationAn existing Spark or Flink job is in place, or programmatic migration is requiredSpark/Flink AP Connector, Doris Connector

Detailed Migration Approaches

Approach 1: Map as an external table via Multi-Catalog and then import

This approach applies to systems that Doris Multi-Catalog can recognize, such as Hive, Iceberg, and Hudi.

  • In Doris, use Multi-Catalog to map the source system as an external table.
  • Use Insert Into to write the data from the external table into a Doris internal table.

Approach 2: Transfer through intermediate files

This approach applies when the source system is not convenient to integrate with directly but supports data export.

  • Export the data from the original AP system to a common data format such as CSV.
  • Import the exported data files into Doris.

This approach applies when a Spark or Flink data processing pipeline is already in place, or when data needs to be cleaned or transformed during migration.

  • Use the Spark or Flink Connector that corresponds to the AP system to read the source data.
  • Use the Doris Connector to write the data into Doris.

FAQ

Q: What if the AP system I am using is not in the list above?

If you have other migration tools or approaches that could be added to this list, please contact dev@doris.apache.org.

NOTE

If you have other migration tools that could be added to this list, please contact dev@doris.apache.org.