pcre supports utf8 out of the box, see documentation for the 'u' modifier.
Illustration (xC3xA4 is the utf8 encoding for the german letter "?")
echo preg_replace('~w~', '@', "axC3xA4b");
this echoes "@@¤@" because "xC3" and "xA4" were treated as distinct symbols
echo preg_replace('~w~u', '@', "axC3xA4b");
(note the 'u') prints "@@@" because "xC3xA4" were treated as a single letter.
与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…