Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
1.0k views
in Technique[技术] by (71.8m points)

ruby - Zero-length string being returned from String#split

In Ruby 1.9.3 (and probably earlier versions, not sure), I'm trying to figure out why Ruby's String#split method is giving me certain results. The results I'm getting seem counter-intuitive from what I would expect. Here's an example:

"abcabc".split("b") #=> ["a", "ca", "c"]
"abcabc".split("a") #=> ["", "bc", "bc"]
"abcabc".split("c") #=> ["ab", "ab"]

Here, the first example returns exactly what I would expect.

But in the second example, I'm confused as to why #split is returning a zero-length string as the first value of the returned array. What is the reasoning for this? This is what I would expect instead:

"abcabc".split("a") #=> ["bc", "bc"]

And along that same line, how come there is no trailing zero-length string returned in the third example? If the second example returns a zero-length string as it's first value, then the last example should return the same as it's last value.

Enlighten me, what am I missing here??

EDIT: Looking more into it, I realize why this is the default behavior and why my thinking is completely wrong. If we were to go through a CSV file for example, splitting on each column, our data would be thrown off because empty leading columns would be ignored.

Also it's important to note that this question isn't related to only Ruby--I'm learning that many other languages behave in the exact same manner. I was simply using Ruby when I learned of this.

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)

The ruby 1.9 documentation says

If the limit parameter is omitted, trailing null fields are suppressed.

So if we take your example:

 "abcabc".split("a") #=> ["bc", "bc"]

And we include a limit value:

 "abcabc".split("a", -1)  #=> ["ab", "ab", ""]

You get the expected behavior.


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...