如何读取和分离配置单元表列中的非ascii字符

1rhkuytd  于 2021-06-24  发布在  Hive
关注(0)|答案(1)|浏览(368)

如果列包含这种格式的值,如何用逗号分隔列值 ổ,đĩa,á,mh,có .
输出应为 ổ đĩa á mh có

nhaq1z21

nhaq1z211#

使用拆分函数:

select splitted[0] as col1, 
       splitted[1] as col2,
       splitted[2] as col3,
       splitted[3] as col4,
       splitted[4] as col5
from
(
select split('ổ,đĩa,á,mh,có',',') as splitted
)s;

退货:

OK
ổ       đĩa     á       mh      có
Time taken: 0.097 seconds, Fetched: 1 row(s)

你的逗号好像不一样。它不是ascii 44字符。从字符串中复制粘贴逗号后,可以正常工作:

select split('Música,Padre-Hijo,Fe','‚') ;
OK
["MÃÃ","ºsicaïÃ","¼Ã","Å’Padre-HijoïÃ","¼Ã","Å’Fe"]

相关问题