I suggest you use start conditions instead.
%x C_COMMENT
"/*" { BEGIN(C_COMMENT); }
<C_COMMENT>"*/" { BEGIN(INITIAL); }
<C_COMMENT>
{ }
<C_COMMENT>. { }
Do note that there must not be any whitespace between the <condition>
and the rule.
%x C_COMMENT
defines the C_COMMENT state, and the rule /*
has it start. Once it's started, */
will have it go back to the initial state (INITIAL
is predefined), and every other characters will just be consumed without any particular action. When two rules match, Flex disambiguates by taking the one that has the longest match, so the dot rule does not prevent */
from matching. The
rule is necessary because a dot matches everything except a newline.
The %x
definition makes C_COMMENT an exclusive state, which means the lexer will only match rules that are "tagged" <C_COMMENT>
once it enters the state.
Here is a tiny example lexer that implements this answer by printing everything except what's inside /* comments */
.
与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…