You can retrieve this information in an automated fashion from the official Unicode data file, UnicodeData.txt
, which is published here:
This is a file with semicolon-separated values in each line. The third column tells you the character class of each character.
The benefit of this is that you can get the character name for each character, so you have a better idea of what it is than by just looking at the character itself (e.g. would you know what ? is? That’s right, it’s Ban. In Georgian. :-)
)
与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…