Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
430 views
in Technique[技术] by (71.8m points)

Remove double quotes after =" and before " in spark scala column

I have a spark column which has the data something as below.

Agent="iee/500.0 (OS X 10_15_6) ("HTML", like) Version/144.0.1 Safari/000.1.0099", Status="null", Search_Type="null", Identifier="null", Mode="null", Activation="null", First_Name="null", Last_Name="null", Request="null", Code="null", Email="null"

So basically i split based (",) and create different columns.

But for Agent="iee/500.0 (OS X 10_15_6) ("HTML", like) Version/144.0.1 Safari/000.1.0099", i have a extra quotes in between and hence it splits after html.

Can you please suggest me some regex to handle the quotes

Input : Agent="iee/500.0 (OS X 10_15_6) ("HTML", like) Version/144.0.1 Safari/000.1.0099", Status="null", Search_Type="null", Identifier="null", Mode="null", Activation="null", First_Name="null", Last_Name="null", Request="null", Code="null", Email="null"

Expected Output : Agent="iee/500.0 (OS X 10_15_6) (HTML, like) Version/144.0.1 Safari/000.1.0099", Status="null", Search_Type="null", Identifier="null", Mode="null", Activation="null", First_Name="null", Last_Name="null", Request="null", Code="null", Email="null"

The extra quotes can be expected to come in any column and not just with the word html

question from:https://stackoverflow.com/questions/65842595/remove-double-quotes-after-and-before-in-spark-scala-column

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)
Waitting for answers

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...