Certainty factor model in paraphrase detection


Kumova Metin S., Karaoğlan B. , Kışla T.

Pamukkale University Journal of Engineering Sciences, vol.27, no.2, pp.139-150, 2021 (Journal Indexed in ESCI) identifier

  • Publication Type: Article / Article
  • Volume: 27 Issue: 2
  • Publication Date: 2021
  • Doi Number: 10.5505/pajes.2020.75350
  • Title of Journal : Pamukkale University Journal of Engineering Sciences
  • Page Numbers: pp.139-150
  • Keywords: Paraphrase, Paraphrase detection, Certainty factor, Evidence, Evidence selection

Abstract

In this paper, we address the problem of uncertainty management in identification of paraphrase sentence pairs. Paraphrase sentences are simply sets/pairs of sentences that express the same facts and/or opinions using different words or order of words. We propose the use of certainty factor (CF) model in paraphrase detection. A set of succeeding paraphrase detection features (generic and distance based features) is built by filtering and this set is used as evidences in CF model. The CF model is evaluated by F1 and accuracy measures on Microsoft Research Paraphrase corpus. The results are compared to the well-known Bayesian reasoning. The experimental results showed that CF model is an alternating paraphrase detection method to Bayes model.