postgresql 检查Postgres数组中是否存在NULL

voase2hg 于 2023-08-04 发布在 PostgreSQL

关注(0)|答案(4)|浏览(136)

与this question类似，如何查找数组中是否存在NULL值？
这里有一些尝试。

SELECT num, ar, expected,
  ar @> ARRAY[NULL]::int[] AS test1,
  NULL = ANY (ar) AS test2,
  array_to_string(ar, ', ') <> array_to_string(ar, ', ', '(null)') AS test3
FROM (
  SELECT 1 AS num, '{1,2,NULL}'::int[] AS ar, true AS expected
  UNION SELECT 2, '{1,2,3}'::int[], false
) td ORDER BY num;

 num |     ar     | expected | test1 | test2 | test3
-----+------------+----------+-------+-------+-------
   1 | {1,2,NULL} | t        | f     |       | t
   2 | {1,2,3}    | f        | f     |       | f
(2 rows)

字符串
只有array_to_string的技巧显示了期望值。有没有更好的方法来测试这个？

postgresql

来源：https://stackoverflow.com/questions/34848009/check-if-null-exists-in-postgres-array

4条答案

按热度按时间

oknwwptz1#

Postgres 9.5或更高版本

使用**array_position()**。基本上：

SELECT array_position(arr, NULL) IS NOT NULL AS array_has_null

字符串
请参见下面的演示。

Postgres 9.3或更高版本

您可以使用内置函数array_remove()或array_replace()进行测试。

Postgres 9.1或任意版本

如果你 * 知道 * 一个元素永远不可能存在于你的数组中，你可以使用这个 fast 表达式。假设你有一个正数数组，-1永远不会在里面：

-1 = ANY(arr) IS NULL

型
相关答案及详细解释：

在PostgreSQL中数组是否都是NULL

如果你不能绝对确定，你可以使用unnest()中一个昂贵但安全的方法。喜欢的：

(SELECT bool_or(x IS NULL) FROM unnest(arr) x)

型
或者：

EXISTS (SELECT 1 FROM unnest(arr) x WHERE x IS NULL)

型
但是，您可以使用CASE表达式获得 * 快速和安全 *。使用一个不太可能的数字，并回退到安全的方法，如果它应该存在。您可能需要单独处理arr IS NULL案例。请参见下面的演示。

Demo

SELECT num, arr, expect
     , -1 = ANY(arr) IS NULL                                    AS t_1   --  50 ms
     , (SELECT bool_or(x IS NULL) FROM unnest(arr) x)           AS t_2   -- 754 ms
     , EXISTS (SELECT 1 FROM unnest(arr) x WHERE x IS NULL)     AS t_3   -- 521 ms
     , CASE -1 = ANY(arr)
         WHEN FALSE THEN FALSE
         WHEN TRUE THEN EXISTS (SELECT 1 FROM unnest(arr) x WHERE x IS NULL)
         ELSE NULLIF(arr IS NOT NULL, FALSE)  -- catch arr IS NULL       --  55 ms
      -- ELSE TRUE  -- simpler for columns defined NOT NULL              --  51 ms
       END                                                      AS t_91
     , array_replace(arr, NULL, 0) <> arr                       AS t_93a --  99 ms
     , array_remove(arr, NULL) <> arr                           AS t_93b --  96 ms
     , cardinality(array_remove(arr, NULL)) <> cardinality(arr) AS t_94  --  81 ms
     , COALESCE(array_position(arr, NULL::int), 0) > 0          AS t_95a --  49 ms
     , array_position(arr, NULL) IS NOT NULL                    AS t_95b --  45 ms
     , CASE WHEN arr IS NOT NULL
            THEN array_position(arr, NULL) IS NOT NULL END      AS t_95c --  48 ms
FROM  (
   VALUES (1, '{1,2,NULL}'::int[], true)     -- extended test case
        , (2, '{-1,NULL,2}'      , true)
        , (3, '{NULL}'           , true)
        , (4, '{1,2,3}'          , false)
        , (5, '{-1,2,3}'         , false)
        , (6, NULL               , null)
   ) t(num, arr, expect);

型
测试结果：

num |  arr        | expect | t_1    | t_2  | t_3 | t_91 | t_93a | t_93b | t_94 | t_95a | t_95b | t_95c
-----+-------------+--------+--------+------+-----+------+-------+-------+------+-------+-------+-------
   1 | {1,2,NULL}  | t      | t      | t    | t   | t    | t     | t     | t    | t     | t     | t
   2 | {-1,NULL,2} | t      | **f** --!! | t    | t   | t    | t     | t     | t    | t     | t     | t
   3 | {NULL}      | t      | t      | t    | t   | t    | t     | t     | t    | t     | t     | t
   4 | {1,2,3}     | f      | f      | f    | f   | f    | f     | f     | f    | f     | f     | f
   5 | {-1,2,3}    | f      | f      | f    | f   | f    | f     | f     | f    | f     | f     | f
   6 | NULL        | NULL   | **t** --!! | NULL | f   | NULL | NULL  | NULL  | NULL | f     | f     | NULL

型
注意array_remove()和array_position()不允许用于多维数组。t_93a右边的所有表达式都只适用于一维数组。

db<>fiddle here * - Postgres 13，更多测试

老麻雀

Benchmark设置

添加的时间来自Postgres 9.5中的200 k行基准测试。这是我的设置：

CREATE TABLE t AS
SELECT row_number() OVER() AS num
     , array_agg(elem) AS arr
     , bool_or(elem IS NULL) AS expected
FROM  (
   SELECT CASE WHEN random() > .95 THEN NULL ELSE g END AS elem  -- 5% NULL VALUES
        , count(*) FILTER (WHERE random() > .8)
                   OVER (ORDER BY g) AS grp  -- avg 5 element per array
   FROM   generate_series (1, 1000000) g  -- increase for big test case
   ) sub
GROUP  BY grp;

型

函数封装

为了重复使用，我会在Postgres9.5中创建一个函数如下：

CREATE OR REPLACE FUNCTION f_array_has_null (anyarray)
  RETURNS bool
  LANGUAGE sql IMMUTABLE PARALLEL SAFE AS
 'SELECT array_position($1, NULL) IS NOT NULL';

型
PARALLEL SAFE仅适用于Postgres 9.6或更高版本。
使用多态输入类型，这适用于 * 任何 * 数组类型，而不仅仅是int[]。
将其设置为IMMUTABLE以允许性能优化和索引表达式。

PostgreSQL是否支持“重音不敏感”排序规则？

但不要将其设置为STRICT，这将禁用“函数内联”并损害性能，因为array_position()本身不是STRICT。请参阅：

Function executes faster without STRICT modifier?的

如果您需要捕获案例arr IS NULL：

CREATE OR REPLACE FUNCTION f_array_has_null (anyarray)
  RETURNS bool
  LANGUAGE sql IMMUTABLE PARALLEL SAFE AS
 'SELECT CASE WHEN $1 IS NOT NULL
              THEN array_position($1, NULL) IS NOT NULL END';

型
对于Postgres9.1，使用上面的t_91表达式。其余适用不变。
密切相关：

How to determine if NULL is contained in an array in Postgres?的

赞(0）回复(0）举报 2023-08-04

piwo6bdm2#

PostgreSQL的UNNEST（）函数是一个更好的选择。你可以写一个简单的函数来检查数组中的NULL值。

create or replace function NULL_EXISTS(val anyelement) returns boolean as
$$
select exists (
    select 1 from unnest(val) arr(el) where el is null
);
$$
language sql

字符串
比如说，

SELECT NULL_EXISTS(array [1,2,NULL])
      ,NULL_EXISTS(array [1,2,3]);

型
测试结果：

null_exists null_exists 
----------- -------------- 
t           f

型
所以，您可以在查询中使用NULL_EXISTS()函数，如下所示。

SELECT num, ar, expected,NULL_EXISTS(ar)
FROM (
  SELECT 1 AS num, '{1,2,NULL}'::int[] AS ar, true AS expected
  UNION SELECT 2, '{1,2,3}'::int[], false
) td ORDER BY num;

型

赞(0）回复(0）举报 2023-08-04

vwoqyblh3#

PostgreSQL 9.5（我知道你指定了9.1，但无论如何）有array_position()函数来做你想做的事情，而不必使用效率极低的unnest()来处理像这样微不足道的事情（参见test4）：

patrick@puny:~$ psql -d test
psql (9.5.0)
Type "help" for help.

test=# SELECT num, ar, expected,
  ar @> ARRAY[NULL]::int[] AS test1,
  NULL = ANY (ar) AS test2,
  array_to_string(ar, ', ') <> array_to_string(ar, ', ', '(null)') AS test3,
  coalesce(array_position(ar, NULL::int), 0) > 0 AS test4
FROM (
  SELECT 1 AS num, '{1,2,NULL}'::int[] AS ar, true AS expected
  UNION SELECT 2, '{1,2,3}'::int[], false
) td ORDER BY num;
 num |     ar     | expected | test1 | test2 | test3 | test4
-----+------------+----------+-------+-------+-------+-------
   1 | {1,2,NULL} | t        | f     |       | t     | t
   2 | {1,2,3}    | f        | f     |       | f     | f
(2 rows)

字符串

赞(0）回复(0）举报 2023-08-04

zzlelutf4#

我用这个

select 
    array_position(array[1,null], null) is not null

字符串
array_position-返回数组中第二个参数第一次出现的下标，从第三个参数指示的元素或第一个元素开始（数组必须是一维的）

赞(0）回复(0）举报 2023-08-04

我来回答

postgresql 检查Postgres数组中是否存在NULL

4条答案

Postgres 9.5或更高版本

Postgres 9.3或更高版本

Postgres 9.1或任意版本

Demo

Benchmark设置

函数封装

相关问题

热门标签

最新问答