Release 3.0.7

Behavior Changes

Adjust the permission requirements for show frontends and show backends to align with the corresponding RESTful API, i.e., requiring the SELECT_PRIV permission on the information_schema database
Admin and root users with specified domains are no longer considered system users
Storage: The default number of concurrent transactions per database is adjusted to 10000

New Features

Query Optimizer

Support MySQL's aggregate roll-up syntax GROUP BY ... WITH ROLLUP

Query Execution

Like statement supports escape syntax

Semi-structured Data Management

Support building non-tokenized inverted indexes and ngram bloomfilter indexes only for new data by setting the session variable enable_add_index_for_new_data=true

New Functions

Added data functions: cot/sec/cosec

Improvements

Data Ingestion

Optimize error message prompts for SHOW CREATE LOAD

Primary Key Model

Add segment key bounds truncation capability to avoid single large import failures

Storage

Enhance the reliability of compaction and imported data
Optimize balance speed
Optimize table creation speed
Optimize compaction default parameters and observability
Optimize the issue of query error -230
Add system table backend_tablets
Optimize the performance of querying information_schema.tables from follower nodes in cloud mode

Storage-Compute Decoupled

Enhance observability of Meta-service recycler
Support cross-compute group incremental preheating during import compaction
Optimize Storage vault connectivity check
Support updating storage backend information via MS API

Lakehouse

Optimize ORC zlib decompression performance in x86 environment and fix potential issues
Optimize the default number of concurrent threads for external table reading
Optimize error messages for Catalogs that do not support DDL operations

Asynchronous Materialized Views

Optimize the performance of transparent rewriting planning

Query Optimizer

The group_concat function now allows parameters of non-string types
The sum and avg functions allow parameters of non-numeric types
Expand the scope of support for delayed materialization in TOP-N queries, enabling delayed materialization when querying partial columns
When creating partitions, list partitions allow inclusion of MAX_VALUE
Optimize the performance of sampling and collecting statistical information for aggregate model tables
Optimize the accuracy of NDV values when sampling and collecting statistical information

Inverted Index

Unify the order of properties displayed for inverted indexes in show create table
Add per-condition profile metrics (such as hit rows and execution time) for inverted index filter conditions to facilitate performance analysis
Enhance the display of inverted index-related information in profiles

Permissions

Ranger supports setting permissions for storage vault and compute group

Bug Fixes

Data Ingestion

Fix the correctness issue that may occur when importing CSV files with multi-character separators
Fix the issue where the result of ROUTINE LOAD task display is incorrect after modifying task properties
Fix the issue where the one-stream multi-table import plan becomes invalid after primary node restart or Leader switch
Fix the issue where all scheduling tasks are blocked because ROUTINE LOAD tasks cannot find available BE nodes
Fix the concurrent read-write conflict issue of runningTxnIds

Primary Key Model

Optimize the import performance of mow tables under high-frequency concurrent imports
mow table full compaction releases space of deleted data
Fix the potential import failure issue of mow tables in extreme scenarios
Optimize the compaction performance of mow tables
Fix the potential correctness issue of mow tables during concurrent imports and schema changes
Fix the issue where schema change on empty mow tables may cause import stuck or schema change failure
Fix the memory leak issue of mow delete bitmap cache
Fix the potential correctness issue of mow tables after schema change

Storage

Fix the missing rowset issue in clone process caused by compaction
Fix the issue of inaccurate size calculation and default value for autobucket
Fix the potential correctness issue caused by bucket columns
Fix the issue where single-column tables cannot be renamed
Fix the potential memory leak issue of memtable
Fix the inconsistent error reporting issue for unsupported operations in empty table transaction writes

Storage-Compute Decoupled

Several fixes for File cache
Fix the issue where cumulative point may roll back during schema process
Fix the issue where background tasks affect automatic restart
Fix the unhandled exception issue in data recycling process in azure environment
Fix the issue where file cache is not cleaned up in time when compacting a single rowset

Lakehouse

Fix the transaction commit failure issue for Iceberg table writes in Kerberos environment
Fix the query issue for hudi in kerberos environment
Fix the potential deadlock issue in multi-Catalog scenarios
Fix the metadata inconsistency issue caused by concurrent Catalog refresh in some cases
Fix the issue where ORC footer is read multiple times in some cases
Fix the issue where Table Valued Function cannot read compressed json files
SQL Server Catalog supports identifying IDENTITY column information
SQL Convertor supports specifying multiple URLs for high availability

Asynchronous Materialized Views

Fix the issue where partition compensation may be performed incorrectly when the query is optimized to an empty result set

Query Optimizer

Fix the issue where factors other than sql_select_limit affect DML execution results
Fix the issue where materialized CTEs may report errors in extreme cases when starting local shuffle
Fix the issue where prepared insert statements cannot be executed on non-master nodes
Fix the result error issue when casting ipv4 to string

Permissions

When a user has multiple roles, the permissions of the multiple roles will be merged before authorization

Query Execution

Fix issues with some json functions
Fix the potential BE Core issue when the asynchronous thread pool is full
Fix the incorrect result issue of hll_to_base64
Fix the result error issue when casting decimal256 to float
Fix two memory leak issues
Fix the be core issue caused by bitmap_from_base64
Fix the potential be core issue caused by array_map function
Fix the potential error issue of split_by_regexp function
Fix the potential result error issue of bitmap_union function under extremely large data volumes
Fix the potential core issue of format round function under some boundary values

Inverted Index

Fix the memory leak issue of inverted indexes in abnormal situations
Fix the error reporting issue when writing and querying empty index files
Capture IO exceptions in inverted index string reading to avoid process crash due to exceptions

Complex Data Types

Fix the potential type inference error when Variant Nested data types conflict
Fix the parameter type inference error of map function
Fix the issue where data is incorrectly converted to NULL when specifying '$.' as the path in jsonpath
Fix the issue where the serialization format cannot be restored when a subfield of Variant contains .

Others

Fix the insufficient length issue of the IP field in the auditlog table
Fix the issue where the query id recorded in the audit log is that of the previous query when SQL parsing fails

Behavior Changes​

New Features​

Query Optimizer​

Query Execution​

Semi-structured Data Management​

New Functions​

Improvements​

Data Ingestion​

Primary Key Model​

Storage​

Storage-Compute Decoupled​

Lakehouse​

Asynchronous Materialized Views​

Query Optimizer​

Inverted Index​

Permissions​

Bug Fixes​

Data Ingestion​

Primary Key Model​

Storage​

Storage-Compute Decoupled​

Lakehouse​

Asynchronous Materialized Views​

Query Optimizer​

Permissions​

Query Execution​

Inverted Index​

Complex Data Types​

Others​

Behavior Changes

New Features

Query Optimizer

Query Execution

Semi-structured Data Management

New Functions

Improvements

Data Ingestion

Primary Key Model

Storage

Storage-Compute Decoupled

Lakehouse

Asynchronous Materialized Views

Query Optimizer

Inverted Index

Permissions

Bug Fixes

Data Ingestion

Primary Key Model

Storage

Storage-Compute Decoupled

Lakehouse

Asynchronous Materialized Views

Query Optimizer

Permissions

Query Execution

Inverted Index

Complex Data Types

Others