从 DB2 迁移到 MySQL

Question

Lennart - Slava Ukraini

Asked: 2020-07-14 01:54:17 +0800 CST2020-07-14 01:54:17 +0800 CST 2020-07-14 01:54:17 +0800 CST

last_value 在具有附加维度的左连接中忽略 null？

772

这只是我开始考虑的一个愚蠢的例子，但似乎无法提出一个优雅的解决方案。假设一个日历表（我将使用 Db2 语法）：

create table calendar
( dt date not null primary key
);

insert into calendar (dt)
with t (dt) as ( values cast('2020-01-01' as date)
                 union all
                 select dt + 1 day from t where dt < '2020-01-11')
select dt from t                 
;

和一个保持平衡的表：

create table balance 
( dt date not null
, amount int not null
,     primary key (dt)
);

insert into balance (dt, amount)
values ('2020-01-03',100) ,('2020-01-05', -50);

如果我们想复制最后一个已知余额，我们可以使用 LAST_VALUE 和 'IGNORE NULLS'，例如：

select c.dt, last_value(b.amount, 'IGNORE NULLS') over (order by c.dt)
from calendar c
left join balance b
    on c.dt = b.dt;

但是，如果我们添加一个维度，比如 cid (customer_id)，那么 last_value 的含义就不再明显了。通常我们会按 cid 进行分区，但 cid 由于左连接而丢失：

create table balance1 
( cid int not null
, dt date not null
, amount int not null
, primary key (cid, dt)
);

insert into balance1 (cid, dt, amount)
values (1, '2020-01-03',100) ,(1, '2020-01-05', -50)
     , (2, '2020-01-04',75), (2, '2020-01-08',55), (2, '2020-01-10', -35);


select c.dt, last_value(b.amount, 'IGNORE NULLS') over (partition by ? 
                                                        order by c.dt)
from calendar c
left join balance b
    on c.dt = b.dt;

我能想到的最好办法是在日历和余额中的不同客户之间使用笛卡尔积：

select cid, dt, last_value(amount, 'IGNORE NULLS') over (partition by cid order by dt)
from (
  select cid, dt, amount from balance1
  union
  select distinct b.cid, c.dt, null 
  from balance1 b
  cross join calendar c
  where not exists (
    select 1 from balance1 
    where dt = c.dt and cid = b.cid
  )
) t  
order by dt, cid
;

不是那么漂亮，我正在寻找一个更优雅的解决方案。我在上面和这个Fiddle中使用了 Db2 语法，但这是我所追求的原则，所以任何供应商语法都可以。

1 个回答

Voted

Pavol Adam · Answer 1 · 2020-07-15T04:21:11+08:00

Best Answer

Pavol Adam

2020-07-15T04:21:11+08:002020-07-15T04:21:11+08:00

为什么不是这个？

SELECT def.cid, c.dt, LAST_VALUE(b.amount, 'IGNORE NULLS') OVER(PARTITION BY def.cid ORDER BY c.dt)
  FROM calendar c
    INNER JOIN (SELECT DISTINCT cid FROM balance1) AS def(cid) ON 1=1
    LEFT JOIN balance1 b ON (c.dt, def.cid) = (b.dt, b.cid)
  ORDER BY c.dt, def.cid
;

dbfiddle中的比较

1

last_value 在具有附加维度的左连接中忽略 null？

连接到 PostgreSQL 服务器：致命：主机没有 pg_hba.conf 条目

如何让sqlplus的输出出现在一行中？

选择具有最大日期或最晚日期的日期

如何列出 PostgreSQL 中的所有模式？

列出指定表的所有列

如何在不修改我自己的 tnsnames.ora 的情况下使用 sqlplus 连接到位于另一台主机上的 Oracle 数据库

你如何mysqldump特定的表？

使用 psql 列出数据库权限

如何从 PostgreSQL 中的选择查询中将值插入表中？

如何使用 psql 列出所有数据库和表？

last_value 在具有附加维度的左连接中忽略 null？

1 个回答

相关问题