ruby - How to avoid tripping over UTF-8 BOM when reading files

Question

Welcome To Ask or Share your Answers For Others

ruby - How to avoid tripping over UTF-8 BOM when reading files

1 Reply

深蓝 · Answer 1 · 2021-10-16T23:07:25+0000

With ruby 1.9.2 you can use the mode r:bom|utf-8

text_without_bom = nil #define the variable outside the block to keep the data
File.open('file.txt', "r:bom|utf-8"){|file|
  text_without_bom = file.read
}

or

text_without_bom = File.read('file.txt', encoding: 'bom|utf-8')

or

text_without_bom = File.read('file.txt', mode: 'r:bom|utf-8')

It doesn't matter, if the BOM is available in the file or not.

You may also use the encoding option with other commands:

text_without_bom = File.readlines(@filename, "r:utf-8")

(You get an array with all lines).

Or with CSV:

require 'csv'
CSV.open(@filename, 'r:bom|utf-8'){|csv|
  csv.each{ |row| p row }
}

Categories

ruby - How to avoid tripping over UTF-8 BOM when reading files

ruby - How to avoid tripping over UTF-8 BOM when reading files

Please log in or register to add a comment.

Please log in or register to reply this article.

1 Reply

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags