I'm processing some English texts in a Java application, and I need to stem them.
For example, from the text "amenities/amenity" I need to get "amenit".
The function looks like:
String stemTerm(String term){
...
}
I've found the Lucene Analyzer, but it looks way too complicated for what I need.
http://lucene.apache.org/java/2_2_0/api/org/apache/lucene/analysis/PorterStemFilter.html
Is there a way to use it to stem words without building an Analyzer? I don't understand all the Analyzer business...
EDIT: I actually need a stemming + lemmatization. Can Lucene do this?
See Question&Answers more detail:
os 与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…