Easiest way is to use something like this which the dump (in short is the text version of viewable HTML).
Remote file:
lynx --dump www.google.com > file.txt
links -dump www.google.com
Local file:
lynx --dump ./1.html > file.txt
links -dump ./1.htm
与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…