Hey Guys, I read a paper about Rumor detection and they used BERT as an unsupervised language representation, fine-tuning it using a small dataset, and combining it with a supervised learning model to provide an enriched text representation of the content of the rumor, Now I achieved the same accuracy using sentence-Bert, what are the differences between these two models and I want to know if their model is slower than mine or not, they did not mention the speed on their paper. they also have another paper with lower accuracy can I mention the paper that has lower accuracy to compare it with mine?