I'm writing a program that works with documents in Perl and a lot of the documents have characters such as ?, ?, ü, é, etc
(both capital and lowercase). I'd like to replace them with ASCII counterparts a, o, u, e, etc
. How would I do it in Perl?
One of the solutions I thought of is to have a hash with keys being the umlaut and accent characters, and the values being ASCII counterparts, but that requires me to have a list of all umlaut and accent characters, which I don't have, and if I built a list, I'd certainly miss many as I'm unfamiliar with all the possible characters that could have umlauts, accents and other diacritics.
See Question&Answers more detail:
os 与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…