We know that BERT has a max length limit of tokens = 512, So if an article has a length of much bigger than 512, such as 10000 tokens in text How can BERT be used?
You have basically three options:
I would suggest to try option 1, and only if this is not good enough to consider the other options.
1.4m articles
1.4m replys
57.0k users