Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
293 views
in Technique[技术] by (71.8m points)

weka - Trained and Test data have different number of attributes that gave an error "Train and test set are not compatible"

I use WEKA for Text classification , I have trained data set , and I apply StringToWOrdVector and NumericToNominal filters , and have test data set and applied the same filters on it . When I try to apply my model on test data ,it gave me the following error Train and test set are not compatible I searched for a solution , the error occurred because number of attributes different between two sets, and it always be different because texts in two sets are different

How I can solve this error please ?

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)

The best thing you can do is combine your training and test set into one file and then apply the filter to it all in one go, then split them up again and copy the @attribute values from the combined file into both the training and test files. This way the attributes will be consistent across both files.


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...