我有一个数据集,其中我需要保持相同的粒度,但我需要根据一组条件修复一些行
当这些客户从“活动”转换到“已取消”时,我需要在我期望的输出为"DATE_NEW“之后的每一行中保存我在每个转换中看到的第一个取消日期-您可以看到该日期与您从A -> C转换为状态时看到的第一个日期相同。
示例:
row_number,Customer,Status, Date, DATE_NEW
1,John,"A","3000-12-31","3000-12-31"
2,John,"C","2019-01-01","2019-01-01"
3,John,"A","3000-12-31","3000-12-31",
4,John,"C","2019-05-01","2019-05-01"
5,John,"C","2019-07-31","2019-05-01"
6,Eve,"A","3000-12-31","3000-12-31"
7,Eve,"C","2019-06-01","2019-06-01"
8,Eve,"C","2019-03-01","2019-06-01"
9,Eve,"C","2019-03-02","2019-06-01"
https://stackoverflow.com/questions/57386762
复制相似问题