reinforcement learning - Validation worse than training - OGeek|极客中国-技术改变生活,极客改变未来

In OpenAI gym classic-like env training, the model yields good results and completes the task. Validating with unseen data yields considerably lower results. Tried:

Added (+/- 2% random range) noise to observations to prevent the model from memorizing (both training/validation).
Applied same normalization (fit to training, transform to training/validation), and also no normalization.

No matter, getting same results with the above. Any idea what it could be or what I can try?

question from:https://stackoverflow.com/questions/66060521/validation-worse-than-training

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

Categories

reinforcement learning - Validation worse than training

reinforcement learning - Validation worse than training

Please log in or register to add a comment.

Please log in or register to reply this article.

1 Reply

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags