Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
136 views
in Technique[技术] by (71.8m points)

sql - SparkSQLContext dataframe Select query based on column array

This is my dataframe:

  authors: array (nullable = true)-->
    element: string (containsNull = true)

I want to select all books where the author is Udo Haiber.

spark.sql("select *  from f  where authors="Udo Haiber" ").show

but of course it didn't work because authors is array.

question from:https://stackoverflow.com/questions/65847587/sparksqlcontext-dataframe-select-query-based-on-column-array

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)

You can use array_contains to check if the author is inside the array:

spark.sql("select * from f where array_contains(authors, 'Udo Haiber')")

Use single quotes to quote the author name because you're using double quotes for the query string.


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...