Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
266 views
in Technique[技术] by (71.8m points)

sql - Very slow select query on array column on Impala table

Query on array column Impala table, works very slow. These queries are memory and CPU hungry. Is any optimized way/technique to speedup these query?

We had sample below table

Owner Pets
Sam cat
Sam dog
Sam parrot
Tom elephant
Tom horse
Tom pig
Tom cat

And used to query the data as :

Select owner, count(1)
from people
group by owner;

We moved to new Impala table with array column.

Owner Pets
Sam cat,dog,parrot
Tom elephant,horse,pig,cat

And used to query the data as :

Select owner, count(1) 
from people, people.pets
group by owner

I am querying approximate 10 GBs of data at a time.

Select owner, count(1) from people group by owner =>Working good

Select owner, count(1) from people, people.pets group by owner => performance is very bad.

Any optimized way/technique to speedup array column query, it helps lot, thanks.

question from:https://stackoverflow.com/questions/66051037/very-slow-select-query-on-array-column-on-impala-table

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)
Waitting for answers

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...