{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,17]],"date-time":"2026-06-17T19:46:14Z","timestamp":1781725574612,"version":"3.54.5"},"reference-count":23,"publisher":"Emerald","issue":"1","license":[{"start":{"date-parts":[[2019,2,11]],"date-time":"2019-02-11T00:00:00Z","timestamp":1549843200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["OIR"],"published-print":{"date-parts":[[2019,2,11]]},"abstract":"<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title>\n<jats:p>As user-generated content (UGC) is entering the news cycle alongside content captured by news professionals, it is important to detect misleading content as early as possible and avoid disseminating it. The purpose of this paper is to present an annotated dataset of 380 user-generated videos (UGVs), 200 debunked and 180 verified, along with 5,195 near-duplicate reposted versions of them, and a set of automatic verification experiments aimed to serve as a baseline for future comparisons.<\/jats:p>\n<\/jats:sec>\n<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title>\n<jats:p>The dataset was formed using a systematic process combining text search and near-duplicate video retrieval, followed by manual annotation using a set of journalism-inspired guidelines. Following the formation of the dataset, the automatic verification step was carried out using machine learning over a set of well-established features.<\/jats:p>\n<\/jats:sec>\n<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Findings<\/jats:title>\n<jats:p>Analysis of the dataset shows distinctive patterns in the spread of verified vs debunked videos, and the application of state-of-the-art machine learning models shows that the dataset poses a particularly challenging problem to automatic methods.<\/jats:p>\n<\/jats:sec>\n<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Research limitations\/implications<\/jats:title>\n<jats:p>Practical limitations constrained the current collection to three platforms: YouTube, Facebook and Twitter. Furthermore, there exists a wealth of information that can be drawn from the dataset analysis, which goes beyond the constraints of a single paper. Extension to other platforms and further analysis will be the object of subsequent research.<\/jats:p>\n<\/jats:sec>\n<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Practical implications<\/jats:title>\n<jats:p>The dataset analysis indicates directions for future automatic video verification algorithms, and the dataset itself provides a challenging benchmark.<\/jats:p>\n<\/jats:sec>\n<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Social implications<\/jats:title>\n<jats:p>Having a carefully collected and labelled dataset of debunked and verified videos is an important resource both for developing effective disinformation-countering tools and for supporting media literacy activities.<\/jats:p>\n<\/jats:sec>\n<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title>\n<jats:p>Besides its importance as a unique benchmark for research in automatic verification, the analysis also allows a glimpse into the dissemination patterns of UGC, and possible telltale differences between fake and real content.<\/jats:p>\n<\/jats:sec>","DOI":"10.1108\/oir-03-2018-0101","type":"journal-article","created":{"date-parts":[[2018,11,12]],"date-time":"2018-11-12T10:05:26Z","timestamp":1542017126000},"page":"72-88","source":"Crossref","is-referenced-by-count":63,"title":["A corpus of debunked and verified user-generated videos"],"prefix":"10.1108","volume":"43","author":[{"given":"Olga","family":"Papadopoulou","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7296-5942","authenticated-orcid":false,"given":"Markos","family":"Zampoglou","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Symeon","family":"Papadopoulos","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ioannis","family":"Kompatsiaris","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"140","reference":[{"issue":"1","key":"key2020092507150807200_ref001","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1007\/s13735-017-0143-x","article-title":"Detection and visualization of misleading content on Twitter","volume":"7","year":"2018","journal-title":"International Journal of Multimedia Information Retrieval"},{"key":"key2020092507150807200_ref002","article-title":"Verifying multimedia use at mediaeval 2015","year":"2015"},{"key":"key2020092507150807200_ref003","article-title":"Verifying multimedia use at mediaeval 2016","year":"2016"},{"issue":"7","key":"key2020092507150807200_ref004","doi-asserted-by":"crossref","first-page":"1905","DOI":"10.1007\/s00500-014-1373-y","article-title":"Fragile watermarking using Karhunen\u2013Lo\u00e8ve transform: the KLT-F approach","volume":"19","year":"2015","journal-title":"Soft Computing"},{"key":"key2020092507150807200_ref005","first-page":"675","article-title":"Information credibility on twitter","year":"2011"},{"issue":"2","key":"key2020092507150807200_ref006","doi-asserted-by":"crossref","first-page":"283","DOI":"10.1037\/h0076540","article-title":"A computer readability formula designed for machine scoring","volume":"60","year":"1975","journal-title":"Journal of Applied Psychology"},{"issue":"10","key":"key2020092507150807200_ref007","first-page":"1197","article-title":"An effective SVD-based image tampering detection and self-recovery using active watermarking","volume":"29","year":"2014","journal-title":"Signal Processing: Image Communication"},{"issue":"10","key":"key2020092507150807200_ref008","doi-asserted-by":"crossref","first-page":"4729","DOI":"10.1109\/TIP.2016.2593583","article-title":"Behavior knowledge space-based fusion for copy\u2013move forgery detection","volume":"25","year":"2016","journal-title":"IEEE Transactions on Image Processing"},{"key":"key2020092507150807200_ref009","first-page":"729","article-title":"Faking sandy: characterizing and identifying fake images on twitter during hurricane sandy","year":"2013"},{"key":"key2020092507150807200_ref010","first-page":"1","article-title":"The quest to automate fact-checking","year":"2015"},{"issue":"3","key":"key2020092507150807200_ref011","doi-asserted-by":"crossref","first-page":"343","DOI":"10.1080\/17512780802054538","article-title":"A clash of cultures: the integration of user-generated content within professional journalistic frameworks at British newspaper websites","volume":"2","year":"2008","journal-title":"Journalism Practice"},{"key":"key2020092507150807200_ref012","volume-title":"Derivation of New Readability Formulas (Automated Readability Index, Fog Count and Flesch Reading Ease Formula) for Navy Enlisted Personnel","year":"1975"},{"key":"key2020092507150807200_ref013","first-page":"347","article-title":"Near-duplicate video retrieval with deep metric learning","year":"2017"},{"key":"key2020092507150807200_ref014","article-title":"False information on web and social media: a survey","year":"2018"},{"key":"key2020092507150807200_ref015","first-page":"6","article-title":"Web video verification using contextual cues","year":"2017"},{"key":"key2020092507150807200_ref016","first-page":"23","article-title":"The InVID plug-in: web video verification on the browser","year":"2017"},{"issue":"4","key":"key2020092507150807200_ref017","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3070644","article-title":"Rumor gauge: predicting the veracity of rumors on twitter","volume":"11","year":"2017","journal-title":"ACM Transactions on Knowledge Discovery from Data (TKDD)"},{"key":"key2020092507150807200_ref018","first-page":"651","article-title":"False rumors detection on Sina Weibo by propagation structures","year":"2015"},{"key":"key2020092507150807200_ref019","first-page":"53","article-title":"Visual memes in social media: tracking real-world news in YouTube videos","year":"2011"},{"issue":"4","key":"key2020092507150807200_ref020","doi-asserted-by":"crossref","first-page":"4801","DOI":"10.1007\/s11042-016-3795-2","article-title":"Large-scale evaluation of splicing localization algorithms for web images","volume":"76","year":"2017","journal-title":"Multimedia Tools and Applications"},{"issue":"11","key":"key2020092507150807200_ref021","doi-asserted-by":"crossref","first-page":"2499","DOI":"10.1109\/TIFS.2016.2585118","article-title":"Iterative copy-move forgery detection based on a new interest point detector","volume":"11","year":"2016","journal-title":"IEEE Transactions on Information Forensics and Security"},{"issue":"2","key":"key2020092507150807200_ref022","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3161603","article-title":"Detection and resolution of rumours in social media: a survey","volume":"51","year":"2018","journal-title":"ACM Computing Surveys"},{"key":"key2020092507150807200_ref023","first-page":"1589","article-title":"Rumor has it: identifying misinformation in microblogs","year":"2011"}],"container-title":["Online Information Review"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/OIR-03-2018-0101\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/OIR-03-2018-0101\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T22:42:27Z","timestamp":1753396947000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/oir\/article\/43\/1\/72-88\/314459"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,2,11]]},"references-count":23,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2019,2,11]]}},"alternative-id":["10.1108\/OIR-03-2018-0101"],"URL":"https:\/\/doi.org\/10.1108\/oir-03-2018-0101","relation":{},"ISSN":["1468-4527"],"issn-type":[{"value":"1468-4527","type":"print"}],"subject":[],"published":{"date-parts":[[2019,2,11]]}}}