Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
188 views
in Technique[技术] by (71.8m points)

Python pandas split array into cloumn(s)

I am trying to split a csv data containing data with arrays into multiple columns. This works perfectly for most arrays since these are whole numbers, but if I try to split following array (containing dot values) I get a problem.

So here the example. Suppose you have the following array data saved in a column called "Array"

{58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5}

If I apply the following
```python
splitted_data=raw_data["Array"].str.split("D",expand=True).add_prefix("setrwds_x")

I get the follwing result

   setrwds_x1  setrwds_x2  setrwds_x4  setrwds_x5  setrwds_x7  setrwds_x8  
0             58           5          58           5          58           5   
1             58           5          58           5          58           5   
2             58           5          58           5          58           5   
3             58           5          58           5          58           5   
4             58           5          58           5          58           5   
5             58           5          58           5          58           5   
6             58           5          58           5          58           5   
7             58           5          58           5          58           5   
8             58           5          58           5          58           5   
9             58           5          58           5          58           5   
10            58           5          58           5          58           5   
11            58           5          58           5          58           5   
12            58           5          58           5          58           5   
13            58           5          58           5          58           5   
14            58           5          58           5          58           5   
15            58           5          58           5          58           5   
16            58           5          58           5          58           5   
17            58           5          58           5          58           5   

It splits the 58.5 into two columns, which is wrong. I need to keep the 58.5.

Do you guys have an advice how to solve the problem?

question from:https://stackoverflow.com/questions/65843372/python-pandas-split-array-into-cloumns

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)

Try this. D in regex stands for non-digit, which includes | and ., you would want to explicitly split on | only. You also need to avoid the first and last bracket using str[1:-1]

raw_data["Array"].str[1:-1].str.split("|",expand=True).add_prefix("setrwds_x")

Tested this out with a dummy series -

#Dummy series
d = ['{58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5}', 
     '{58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5}', 
     '{58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5 |58.5}']
dd = pd.Series(d)

out = dd.str[1:-1].str.split("|",expand=True).add_prefix("setrwds_x")
print(out)
  setrwds_x0 setrwds_x1 setrwds_x2 setrwds_x3 setrwds_x4 setrwds_x5  
0      58.5       58.5       58.5       58.5       58.5       58.5    
1      58.5       58.5       58.5       58.5       58.5       58.5    
2      58.5       58.5       58.5       58.5       58.5       58.5    

  setrwds_x6 setrwds_x7 setrwds_x8 setrwds_x9 setrwds_x10 setrwds_x11  
0      58.5       58.5       58.5       58.5        58.5         58.5  
1      58.5       58.5       58.5       58.5        58.5         58.5  
2      58.5       58.5       58.5       58.5        58.5         58.5

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...