Compilation and Deployment
1. Overviewβ
This document details the compilation and deployment process of Doris in a decoupled storage-compute model, highlighting the differences from the integrated storage-compute model, especially the compilation, configuration, and management of the newly added Meta Service (MS) module.
2. Obtaining Binariesβ
2.1 Direct Downloadβ
Compiled binaries (including all Doris modules) can be obtained from the Doris Download Page (select version 3.0.2 or higher).
2.2 Compilation Output (Optional)β
Compile using the build.sh
script provided in the codebase. The new MS module is compiled with the --cloud
parameter.
sh build.sh --fe --be --cloud
After compilation, a new ms
directory will be added in the output
directory:
output
βββ be
βββ fe
βββ ms
βββ bin
βββ conf
βββ lib
3. Meta Service Deploymentβ
3.1 Configurationβ
In the ./conf/doris_cloud.conf
file, you mainly need to modify the following two parameters:
brpc_listen_port
: The listening port for the Meta Service, default is 5000.fdb_cluster
: Connection information for the FoundationDB cluster, which can be obtained when deploying FoundationDB. (If using thefdb_ctl.sh
provided by Doris for deployment, this value can be found in the$FDB_HOME/conf/fdb.cluster
file.)
Example configuration:
brpc_listen_port = 5000
fdb_cluster = xxx:yyy@127.0.0.1:4500
Note: The value of fdb_cluster
should match the content of the /etc/foundationdb/fdb.cluster
file on the FoundationDB deployment machine. (If using the fdb_ctl.sh
provided by Doris for deployment, this value can be found in the $FDB_HOME/conf/fdb.cluster
file.)
Example: The last line of the file is the value to fill in the doris_cloud.conf for the fdb_cluster field
cat /etc/foundationdb/fdb.cluster
# DO NOT EDIT!
# This file is auto-generated, it is not to be edited by hand.
cloud_ssb:A83c8Y1S3ZbqHLL4P4HHNTTw0A83CuHj@127.0.0.1:4500
3.2 Start and Stopβ
Environment Requirements
Ensure that the JAVA_HOME
environment variable is correctly set to point to OpenJDK 17, and navigate to the ms
directory.
Start Command
export JAVA_HOME=${path_to_jdk_17}
bin/start.sh --daemon
The startup script returns a value of 0 to indicate that the startup was successful; otherwise, the startup fails. Upon successful startup, the last line of the standard output text will be "doris_cloud start successfully."
Stop Command
bin/stop.sh
For production environment, please ensure that the total number of Meta Service is at least three.
4. Independent Deployment of Data Recycling Function (Optional)β
The Meta Service itself has metadata management and recycling functions, which can be deployed independently. If you want to deploy them independently, you can refer to this section.
Preparation Work
-
Create a new working directory (e.g.,
recycler
). -
Copy the contents of the
ms
directory to the new directory:cp -r ms recycler
Configuration
Modify the BRPC listening port brpc_listen_port
and the value of fdb_cluster
in the configuration file of the new directory.
Start Data Recycling Function
export JAVA_HOME=${path_to_jdk_17}
bin/start.sh --recycler --daemon
Start Only Metadata Operation Function
export JAVA_HOME=${path_to_jdk_17}
bin/start.sh --meta-service --daemon
5. Startup Process for FE and BEβ
This section details the steps to start FE (Frontend) and BE (Backend) in a decoupled storage-compute architecture.
5.1 Startup Orderβ
- Start the first FE instance with the MASTER role.
- Add other FE and BE instances to the cluster.
- Add the first Storage Vault.
5.2 Start the MASTER Role FEβ
5.2.1 Configure fe.confβ
In the fe.conf
file, the following key parameters need to be configured:
-
deploy_mode
- Description: Specifies the Doris startup mode.
- Format:
cloud
indicates the decoupled storage-compute mode; others indicate the integrated storage-compute mode. - Example:
cloud
-
cluster_id
- Description: A unique identifier for the cluster in the decoupled storage-compute architecture. Each cluster must have a distinct cluster_id to avoid conflicts.
- Format: Integer (int type).
- Example: A random
cluster_id
can be generated using the following shell command.echo $(($((RANDOM << 15)) | $RANDOM))
cautionEach cluster must have a distinct cluster_id to avoid conflicts
-
meta_service_endpoint
- Description: The address and port of the Meta Service.
- Format:
IP address:port number
. - Example:
127.0.0.1:5000
, multiple meta services can be configured separated by commas.
5.2.2 Start FEβ
Example start command:
bin/start_fe.sh --daemon
The first FE process initializes the cluster and operates in the FOLLOWER role. Use a MySQL client to connect to the FE and use show frontends
to confirm that the recently started FE is the master.
5.3 Add Other FE Nodesβ
Other nodes should also modify the configuration file and start according to the above steps. Use a MySQL client to connect to the FE with the MASTER role and use the following SQL command to add additional FE nodes:
ALTER SYSTEM ADD FOLLOWER "host:port";
Replace host:port
with the actual address and editlog port of the FE node. More information refer to ADD FOLLOWER and ADD OBSERVER.
For production environment, please ensure that the total number of Frontend (FE) nodes in the FOLLOWER role, including the first FE, remains an odd number. In general, three FOLLOWERS are sufficient. Frontend nodes in the OBSERVER role can be any number.
5.4 Add BE Nodesβ
To add Backend nodes to the cluster, perform the following steps for each Backend:
5.4.1 Configure be.confβ
In the be.conf
file, the following key parameters need to be configured:
-
deploy_mode
- Description: Specifies the Doris startup mode.
- Format:
cloud
indicates the decoupled storage-compute mode; others indicate the integrated storage-compute mode. - Example:
cloud
-
file_cache_path
- Description: The disk paths and other parameters used for file cache, represented as an array, with one entry for each disk. The
path
specifies the disk path, andtotal_size
limits the size of the cache; -1 or 0 will use the entire disk space. - format: [{"path":"/path/to/file_cache","total_size":21474836480},{"path":"/path/to/file_cache2","total_size":21474836480}]
- Example: [{"path":"/path/to/file_cache","total_size":21474836480},{"path":"/path/to/file_cache2","total_size":21474836480}]
- Default: [{"path":"${DORIS_HOME}/file_cache"}]
- Description: The disk paths and other parameters used for file cache, represented as an array, with one entry for each disk. The
5.4.1 Start and Add BEβ
-
Start the Backend:
Use the following command to start the Backend:
bin/start_be.sh --daemon
-
Add the Backend to the cluster:
Connect to any Frontend using a MySQL client and execute:
ALTER SYSTEM ADD BACKEND "<ip>:<heartbeat_service_port>" [PROPERTIES properties];
Replace
<ip>
with the IP address of the new Backend and<heartbeat_service_port>
with its configured heartbeat service port (default is 9050).You can set the computing group for the BE using PROPERTIES.
For more detailed usage, please refer to ADD BACKEND and REMOVE BACKEND.
-
Verify Backend Status:
Check the Backend log file (
be.log
) to ensure it has successfully started and joined the cluster.You can also use the following SQL command to check the Backend status:
SHOW BACKENDS;
This will display all Backends in the cluster and their current status.
6. Create Storage Vaultβ
Storage Vault is an important component in the Doris decoupled storage and computing architecture. They represent a shared storage layer for storing data. You can create one or more Storage Vaults using HDFS or S3-compatible object storage. One Storage Vault can be set as the default Storage Vault, and system tables and tables without a specified Storage Vault will be stored in this default Storage Vault. The default Storage Vault cannot be deleted. Here are the methods to create a Storage Vault for your Doris cluster:
6.1 Create HDFS Storage Vaultβ
To create a Storage Vault using SQL, connect to your Doris cluster using a MySQL client.
CREATE STORAGE VAULT IF NOT EXISTS hdfs_vault
PROPERTIES (
"type"="hdfs",
"fs.defaultFS"="hdfs://127.0.0.1:8020"
);
6.2 Create S3 Storage Vaultβ
To create a Storage Vault using S3-compatible object storage, follow these steps:
-
Connect to your Doris cluster using a MySQL client.
-
Execute the following SQL command to create an S3 Storage Vault:
CREATE STORAGE VAULT IF NOT EXISTS s3_vault
PROPERTIES (
"type"="S3",
"s3.endpoint"="s3.us-east-1.amazonaws.com",
"s3.access_key" = "ak",
"s3.secret_key" = "sk",
"s3.region" = "us-east-1",
"s3.root.path" = "ssb_sf1_p2_s3",
"s3.bucket" = "doris-build-1308700295",
"provider" = "S3"
);
To create a Storage Vault on other object storage, please refer to Create Storage Vault.
6.3 Set Default Storage Vaultβ
Use the following SQL statement to set a default Storage Vault.
SET <storage_vault_name> AS DEFAULT STORAGE VAULT
7. Notesβ
- Only the Meta Service process for metadata operations should be configured as the
meta_service_endpoint
target for FE and BE. - The data recycling function process should not be configured as the
meta_service_endpoint
target.