数据样式,
| user | apps |
| X123 | ["123,微信,abc","234,QQ,bcd"] |
抽取apps字段中的“微信”和“QQ”,目标数据如下,
| user | apps |
| X123 | QQ,微信 |
思路:按","将apps字段进行拆分成行,然后在按,进行分割后取第二部分,
select
user
,concat_ws(',', collect_set(split(app, '\\,')[1])) AS applist --从0开始计数,第二部分标号为1
from (
select 'X123' as user ,'["123,微信,abc","234,QQ,bcd"]' as apps
)a
LATERAL VIEW explode(split(apps, '\\"\\,\\"')) a AS app
group by user
版权声明:本文为u012735708原创文章,遵循CC 4.0 BY-SA版权协议,转载请附上原文出处链接和本声明。