Skip to main content

Data Integrity Issues

This document is mainly used to record common data integrity issues encountered during the use of Doris. It will be updated periodically.

The term "duplicate key data in the table" refers to the appearance of duplicate key data in the merge-on-write Unique table. Duplicate key issues in merge-on-write Unique tables can be fixed by triggering a full compaction, while other types of integrity issues may require specific solutions based on the situation. For assistance, please contact community support.

Issue DescriptionOccurrence ConditionsAffected VersionsFix VersionsImpacted ScopeFix PR
Partial column updates in merge-on-write Unique table reintroduce previously deleted dataA __DORIS_DELETE_SIGN__ column is specified during partial column updates, and the historical data contains rows marked as deleted by the __DORIS_DELETE_SIGN__ column<2.1.8, <3.0.4>=2.1.8, >=3.0.4Compute-storage coupled mode, compute-storage decoupled mode, partial column updates#46194
Duplicate key data in the tableConcurrent imports in merge-on-write Unique table in compute-storage decoupled mode<3.0.4>=3.0.4Compute-storage decoupled mode#46039
Duplicate key data in the tableConcurrent imports between imports and between import and compaction in merge-on-write Unique table in compute-storage decoupled mode<3.0.4>=3.0.4Compute-storage decoupled mode#44975
System-generated values for auto-increment column are 0 or duplicateNetwork anomalies between BE and FE<2.1.8, <3.0.3>=2.1.8, >=3.0.3Compute-storage coupled mode, compute-storage decoupled mode, auto-increment column#43774
Stream Load import does not delete data that meets the delete condition in merge-on-write Unique tableStream Load import with merge_type: MERGE, partial_columns: true, and delete parameters<2.0.15, <2.17, <3.0.3>=2.0.15, >=2.17, >=3.0.3Compute-storage coupled mode, compute-storage decoupled mode, partial column updates#40730
Partial column updates result in unintended update of auto-increment column valuesThe table has auto-increment column on the Value column, and the partial column update import does not specify values for those auto-increment column<2.1.6, <3.0.2>=2.1.6, >=3.0.2Compute-storage coupled mode, compute-storage decoupled mode, auto-increment column#39996
Duplicate key data in the tableUser adds sequence column functionality to a merge-on-write Unique table that does not support sequence columns using ALTER TABLE tbl ENABLE FEATURE "SEQUENCE_LOAD" WITH ..., followed by a new import<2.0.15, <2.1.6, <3.0.2>=2.0.15, >=2.1.6, >=3.0.2Compute-storage coupled mode, compute-storage decoupled mode#39958
Duplicate key data in the tableConcurrent imports or concurrent imports and compactions in merge-on-write Unique table in compute-storage decoupled mode<3.0.1>=3.0.1Compute-storage decoupled mode#39018
Partial column update import results in data corruption in merge-on-write Unique tableConcurrent partial column updates in merge-on-write Unique table, with BE restart during the import process<2.0.15, <2.1.6, <3.0.2>=2.0.15, >=2.1.6, >=3.0.2Compute-storage coupled mode, compute-storage decoupled mode, partial column updates#38331
Duplicate key data in the tableConcurrent imports and compaction in merge-on-write Unique table in compute-storage decoupled mode<3.0.2>=3.0.2Compute-storage decoupled mode#37670, #41309, #39791
Duplicate key data in the tableLarge single import in merge-on-write Unique table with sequence column triggers segment compaction<2.0.15, <2.1.6, <3.0.2>=2.0.15, >=2.1.6, >=3.0.2Compute-storage coupled mode, compute-storage decoupled mode#38369
Duplicate key data in the tableFailed full clone on merge-on-write Unique table in compute-storage coupled mode<2.0.13, <2.1.5, <3.0.0>=2.0.13, >=2.1.5, >=3.0.0Compute-storage coupled mode#37001
Duplicate key data in the tableStream Load imports with internal retry process in merge-on-write Unique table in compute-storage decoupled mode<3.0.0>=3.0.0Compute-storage decoupled mode#36670
Inconsistent multi-replica data in merge-on-write Unique tablePartial column update import with __DORIS_DELETE_SIGN__ column on merge-on-write Unique table, and inconsistent Base Compaction progress across replicas during import<2.0.15, <2.1.5, <3.0.0>=2.0.15, >=2.1.5, >=3.0.0Compute-storage coupled mode, compute-storage decoupled mode, partial column updates#36210
Duplicate key data in the tableConcurrent partial column updates and BE restart during imports on merge-on-write Unique table<2.0.11, <2.1.4, <3.0.0>=2.0.11, >=2.1.4, >=3.0.0Compute-storage coupled mode, compute-storage decoupled mode, partial column updates#35739