python - How can i reduce memory usage of Scikit-Learn Vectorizers?

Question

Welcome To Ask or Share your Answers For Others

python - How can i reduce memory usage of Scikit-Learn Vectorizers?

1 Reply

深蓝 · Answer 1 · 2021-10-23T19:32:09+0000

I would strongly recommend you to use the HashingVectorizer when fitting models on large dataset.

The HashingVectorizer is data independent, only the parameters from vectorizer.get_params() are important. Hence (un)pickling `HashingVectorizer instance should be very fast.

The vocabulary based vectorizers are better suited for exploratory analysis on small datasets.

Categories

python - How can i reduce memory usage of Scikit-Learn Vectorizers?

python - How can i reduce memory usage of Scikit-Learn Vectorizers?

Please log in or register to add a comment.

Please log in or register to reply this article.

1 Reply

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags