Skip to main content

ARRAY_FIRST

array_first

SinceVersion 2.0.0

Description

Returns the first element in the array that satisfies the lambda expression condition. The function applies the lambda expression to elements in the array, finds the first element that satisfies the condition and returns it.

Syntax

array_first(lambda, array1, ...)

Parameters

  • lambda:lambda expression used to evaluate array elements, returns true/false or expressions that can be converted to boolean values
  • array1, ...:one or more ARRAY type parameters

Supported types for T:

  • Numeric types: TINYINT, SMALLINT, INT, BIGINT, LARGEINT, FLOAT, DOUBLE, DECIMAL
  • String types: CHAR, VARCHAR, STRING
  • Date and time types: DATE, DATETIME, DATEV2, DATETIMEV2
  • Boolean type: BOOLEAN
  • IP types: IPV4, IPV6
  • Complex data types: ARRAY, MAP, STRUCT

Return Value

Return type: T

Return value meaning:

  • Returns the first element in the array that satisfies the lambda expression condition
  • NULL: if the array is empty or no element satisfies the condition

Usage notes:

  • The number of parameters in the lambda expression must match the number of array parameters
  • All input arrays must have the same length
  • Supports searching in multi-arrays and complex type arrays
  • Empty arrays return NULL
  • Does not support NULL input parameters
  • Lambda can use any scalar expression, but cannot use aggregate functions
  • Lambda expressions can call other higher-order functions, but the return types must be compatible
  • For null values in array elements: null elements will be passed to the lambda expression for processing, and lambda can check for null values

Query Examples:

Find the first element greater than or equal to 3 in a floating-point array:

SELECT array_first(x -> x >= 3, [1.1, 2.2, 3.3, 4.4, 5.5]);
+-----------------------------------------------------+
| array_first(x -> x >= 3, [1.1, 2.2, 3.3, 4.4, 5.5]) |
+-----------------------------------------------------+
| 3.3 |
+-----------------------------------------------------+

Find the first element with length greater than 2 in a string array:

SELECT array_first(x -> length(x) > 2, ['a', 'bb', 'ccc', 'dddd', 'eeeee']);
+----------------------------------------------------------------------+
| array_first(x -> length(x) > 2, ['a', 'bb', 'ccc', 'dddd', 'eeeee']) |
+----------------------------------------------------------------------+
| ccc |
+----------------------------------------------------------------------+

Empty array returns NULL:

SELECT array_first(x -> x > 0, []);
+-------------------------------------+
| array_first(x -> x > 0, []) |
+-------------------------------------+
| NULL |
+-------------------------------------+

NULL input parameter will error:

SELECT array_first(x -> x > 2, NULL);
ERROR 1105 (HY000): errCode = 2, detailMessage = lambda argument must be array but is NULL

SELECT array_first(NULL);
ERROR 1105 (HY000): errCode = 2, detailMessage = Can not build function: 'array_first', expression: array_first(NULL), The 1st arg of array_filter must be lambda but is NULL

Array containing null values, lambda can check for null:

SELECT array_first(x -> x is not null, [null, 1, null, 3, null, 5]);
+--------------------------------------------------------------+
| array_first(x -> x is not null, [null, 1, null, 3, null, 5]) |
+--------------------------------------------------------------+
| 1 |
+--------------------------------------------------------------+

Multi-array search, find the first element where the first array is greater than the second array:

SELECT array_first((x, y) -> x > y, [1, 2, 3, 4, 5], [1.1, 2.2, 3.3, 4.4, 5.5]);
+--------------------------------------------------------------------------+
| array_first((x, y) -> x > y, [1, 2, 3, 4, 5], [1.1, 2.2, 3.3, 4.4, 5.5]) |
+--------------------------------------------------------------------------+
| NULL |
+--------------------------------------------------------------------------+

Complex type examples:

Nested array search, find the first sub-array with length greater than 2:

SELECT array_first(x -> size(x) > 2, [[1,2],[3,4,5],[6],[7,8,9,10]]);
+---------------------------------------------------------------+
| array_first(x -> size(x) > 2, [[1,2],[3,4,5],[6],[7,8,9,10]]) |
+---------------------------------------------------------------+
| [3, 4, 5] |
+---------------------------------------------------------------+

Map type search, find the first element where the value with key 'a' is greater than 10:

SELECT array_first(x -> x['a'] > 10, [{'a':5}, {'a':15}, {'a':20}]);
+---------------------------------------------------------------+
| array_first(x -> x['a'] > 10, [{'a':5}, {'a':15}, {'a':20}]) |
+---------------------------------------------------------------+
| {"a":15} |
+---------------------------------------------------------------+

Error when parameter count is wrong:

SELECT array_first(x -> x > 0, [1,2,3], [4,5,6], [7,8,9]);
ERROR 1105 (HY000): errCode = 2, detailMessage = lambda x -> (x > 0) arguments' size is not equal parameters' size

Error when the number of parameters in lambda expression doesn't match the number of array parameters:

SELECT array_first((x, y) -> x > y, [1,2,3], [4,5]);
ERROR 1105 (HY000): errCode = 2, detailMessage = (10.16.10.6)[INVALID_ARGUMENT]in array map function, the input column size are not equal completely, nested column data rows 1st size is 3, 2th size is 2.

Error when passing non-array type:

SELECT array_first(x -> x > 0, 'not_an_array');
ERROR 1105 (HY000): errCode = 2, detailMessage = lambda argument must be array but is 'not_an_array'

Keywords

ARRAY, FIRST, ARRAY_FIRST