DeFacto - Temporal and multilingual deep fact validation

Daniel Gerber; Diego Esteves; Jens Lehmann; Lorenz Bühmann; Ricardo Usbeck; Axel Cyrille Ngonga Ngomo; René Speck

doi:10.1016/j.websem.2015.08.001

DeFacto - Temporal and multilingual deep fact validation

Research output: Journal contributions › Journal articles › Research › peer-review

Standard

DeFacto - Temporal and multilingual deep fact validation. / Gerber, Daniel; Esteves, Diego; Lehmann, Jens et al.
In: Journal of Web Semantics, Vol. 35, 01.12.2015, p. 85-101.

Research output: Journal contributions › Journal articles › Research › peer-review

Harvard

Gerber, D, Esteves, D, Lehmann, J, Bühmann, L, Usbeck, R, Ngonga Ngomo, AC & Speck, R 2015, 'DeFacto - Temporal and multilingual deep fact validation', Journal of Web Semantics, vol. 35, pp. 85-101. https://doi.org/10.1016/j.websem.2015.08.001

APA

Gerber, D., Esteves, D., Lehmann, J., Bühmann, L., Usbeck, R., Ngonga Ngomo, A. C., & Speck, R. (2015). DeFacto - Temporal and multilingual deep fact validation. Journal of Web Semantics, 35, 85-101. https://doi.org/10.1016/j.websem.2015.08.001

Vancouver

Gerber D, Esteves D, Lehmann J, Bühmann L, Usbeck R, Ngonga Ngomo AC et al. DeFacto - Temporal and multilingual deep fact validation. Journal of Web Semantics. 2015 Dec 1;35:85-101. doi: 10.1016/j.websem.2015.08.001

Bibtex

@article{8019491d57554740b356920374c134b6,

title = "DeFacto - Temporal and multilingual deep fact validation",

abstract = "One of the main tasks when creating and maintaining knowledge bases is to validate facts and provide sources for them in order to ensure correctness and traceability of the provided knowledge. So far, this task is often addressed by human curators in a three-step process: issuing appropriate keyword queries for the statement to check using standard search engines, retrieving potentially relevant documents and screening those documents for relevant content. The drawbacks of this process are manifold. Most importantly, it is very time-consuming as the experts have to carry out several search processes and must often read several documents. In this article, we present DeFacto (Deep Fact Validation) - an algorithm able to validate facts by finding trustworthy sources for them on the Web. DeFacto aims to provide an effective way of validating facts by supplying the user with relevant excerpts of web pages as well as useful additional information including a score for the confidence DeFacto has in the correctness of the input fact. To achieve this goal, DeFacto collects and combines evidence from web pages written in several languages. In addition, DeFacto provides support for facts with a temporal scope, i.e., it can estimate in which time frame a fact was valid. Given that the automatic evaluation of facts has not been paid much attention to so far, generic benchmarks for evaluating these frameworks were not previously available. We thus also present a generic evaluation framework for fact checking and make it publicly available.",

keywords = "Fact validation, NLP, Provenance, Web of Data, Informatics, Business informatics",

author = "Daniel Gerber and Diego Esteves and Jens Lehmann and Lorenz B{\"u}hmann and Ricardo Usbeck and {Ngonga Ngomo}, {Axel Cyrille} and Ren{\'e} Speck",

note = "Publisher Copyright: {\textcopyright} 2015 Elsevier B.V.",

year = "2015",

month = dec,

day = "1",

doi = "10.1016/j.websem.2015.08.001",

language = "English",

volume = "35",

pages = "85--101",

journal = "Journal of Web Semantics",

issn = "1570-8268",

publisher = "Elsevier B.V.",

}

RIS

TY - JOUR

T1 - DeFacto - Temporal and multilingual deep fact validation

AU - Gerber, Daniel

AU - Esteves, Diego

AU - Lehmann, Jens

AU - Bühmann, Lorenz

AU - Usbeck, Ricardo

AU - Ngonga Ngomo, Axel Cyrille

AU - Speck, René

PY - 2015/12/1

Y1 - 2015/12/1

N2 - One of the main tasks when creating and maintaining knowledge bases is to validate facts and provide sources for them in order to ensure correctness and traceability of the provided knowledge. So far, this task is often addressed by human curators in a three-step process: issuing appropriate keyword queries for the statement to check using standard search engines, retrieving potentially relevant documents and screening those documents for relevant content. The drawbacks of this process are manifold. Most importantly, it is very time-consuming as the experts have to carry out several search processes and must often read several documents. In this article, we present DeFacto (Deep Fact Validation) - an algorithm able to validate facts by finding trustworthy sources for them on the Web. DeFacto aims to provide an effective way of validating facts by supplying the user with relevant excerpts of web pages as well as useful additional information including a score for the confidence DeFacto has in the correctness of the input fact. To achieve this goal, DeFacto collects and combines evidence from web pages written in several languages. In addition, DeFacto provides support for facts with a temporal scope, i.e., it can estimate in which time frame a fact was valid. Given that the automatic evaluation of facts has not been paid much attention to so far, generic benchmarks for evaluating these frameworks were not previously available. We thus also present a generic evaluation framework for fact checking and make it publicly available.

AB - One of the main tasks when creating and maintaining knowledge bases is to validate facts and provide sources for them in order to ensure correctness and traceability of the provided knowledge. So far, this task is often addressed by human curators in a three-step process: issuing appropriate keyword queries for the statement to check using standard search engines, retrieving potentially relevant documents and screening those documents for relevant content. The drawbacks of this process are manifold. Most importantly, it is very time-consuming as the experts have to carry out several search processes and must often read several documents. In this article, we present DeFacto (Deep Fact Validation) - an algorithm able to validate facts by finding trustworthy sources for them on the Web. DeFacto aims to provide an effective way of validating facts by supplying the user with relevant excerpts of web pages as well as useful additional information including a score for the confidence DeFacto has in the correctness of the input fact. To achieve this goal, DeFacto collects and combines evidence from web pages written in several languages. In addition, DeFacto provides support for facts with a temporal scope, i.e., it can estimate in which time frame a fact was valid. Given that the automatic evaluation of facts has not been paid much attention to so far, generic benchmarks for evaluating these frameworks were not previously available. We thus also present a generic evaluation framework for fact checking and make it publicly available.

KW - Fact validation

KW - NLP

KW - Provenance

KW - Web of Data

KW - Informatics

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=84948698827&partnerID=8YFLogxK

U2 - 10.1016/j.websem.2015.08.001

DO - 10.1016/j.websem.2015.08.001

M3 - Journal articles

AN - SCOPUS:84948698827

VL - 35

SP - 85

EP - 101

JO - Journal of Web Semantics

JF - Journal of Web Semantics

SN - 1570-8268

ER -

Other publications by the same author(s)

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Salnikov, M., Sakhovskiy, A., Nikishina, I., Usmanova, A., Kraft, A., Möller, C., Banerjee, D., Huang, J., Jiang, L., Abdullah, R., Yan, X., Tutubalina, E., Usbeck, R. & Panchenko, A., 2026, Natural Language Processing and Information Systems: 30th International Conference on Applications of Natural Language to Information Systems, NLDB 2025, Proceedings. Ichise, R. (ed.). Springer Science and Business Media Deutschland, p. 95-110 16 p. (Lecture Notes in Computer Science; vol. 15836 LNCS).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Analyzing the Influence of Knowledge Graph Information on Relation Extraction.

Möller, C. & Usbeck, R., 2025

Research output: other publications › Other › Research

Analyzing the Influence of Knowledge Graph Information on Relation Extraction

Möller, C. & Usbeck, R., 2025, The Semantic Web: 22nd European Semantic Web Conference, ESWC 2025 Portoroz, Slovenia, June 1–5, 2025 Proceedings, Part I. Curry, E., Acosta, M., Poveda-Villalón, M., van Erp, M., Ojo, A., Hose, K., Shimizu, C. & Lisena, P. (eds.). Cham: Springer Nature Switzerland AG, Vol. 1. p. 460-480 21 p. (Lecture Notes in Computer Science ; vol. 15718).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

ASK-DBLP: Answering Questions over DBLP

Taffa, T., Neises, P., Ollinger, S., Westphal, P., Ackermann, M. R., Banerjee, D. & Usbeck, R., 02.11.2025, ISWC-C 2025, Industry, Doctoral Consortium, Posters and Demos at ISWC 2025: Joint Proceedings of Industry, Doctoral Consortium, Posters and Demos of the 24th International Semantic Web Conference (ISWC-C 2025), ISWC 2025 Companion Volume. Celino, I., Hassanzadeh, O., Bernstein, A., Noy, N., Cheng, G., Wang, S., Ferrada, S., Soulard, T., Kozaki, K., Takeda, H. & Gentile, A. L. (eds.). Aachen: Sun Site Central Europe (RWTH Aachen University), p. 435-440 6 p. D13. (CEUR Workshop Proceedings; vol. 4085).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Automating SPARQL Query Translations between DBpedia and Wikidata

Bartels, M. C., Banerjee, D. & Usbeck, R., 14.07.2025, Linking Meaning: Semantic Technologies Shaping the Future of AI: Cover 74617 Proceedings of the 21st International Conference on Semantic Systems, 3-5 September 2025, Vienna, Austria. Spahiu, B., Vahdati, S., Salatino, A., Pellegrini, T. & Havur, G. (eds.). IOS Press BV, p. 176-193 18 p. (Studies on the Semantic Web; vol. 62).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research

DOI

https://doi.org/10.1016/j.websem.2015.08.001
Final published version

DeFacto - Temporal and multilingual deep fact validation

Standard

Harvard

APA

Vancouver

Bibtex

RIS

Other publications by the same author(s)

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Analyzing the Influence of Knowledge Graph Information on Relation Extraction.

Analyzing the Influence of Knowledge Graph Information on Relation Extraction

ASK-DBLP: Answering Questions over DBLP

Automating SPARQL Query Translations between DBpedia and Wikidata

Links

DOI

Recently viewed

Projects

Activities

Publications