{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T04:31:25Z","timestamp":1772166685493,"version":"3.50.1"},"reference-count":54,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,8,11]],"date-time":"2020-08-11T00:00:00Z","timestamp":1597104000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,8,11]],"date-time":"2020-08-11T00:00:00Z","timestamp":1597104000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>\n                    In this work we develop a series of techniques and tools to determine and quantify the presence of bias and censorship in newspapers. These algorithms are tested analyzing the occurrence of keywords \u2018killed\u2019 and \u2018suicide\u2019 (\u2018\n                    <jats:italic>morti\u2019<\/jats:italic>\n                    \u2019, \u2018\n                    <jats:italic>suicidio<\/jats:italic>\n                    \u2019 in Italian) and their changes over time, gender and reported location on the complete online archives (42 million records) of the major US newspaper (\n                    <jats:italic>The New York Times<\/jats:italic>\n                    ) and the three major Italian ones (\n                    <jats:italic>Il Corriere della Sera, La Repubblica, La Stampa<\/jats:italic>\n                    ). Using these tools, since the Italian language distinguishes between the female and male cases, we find the presence of gender bias in all Italian newspapers, with reported single female deaths to be about one-third of those involving single men. Analyzing the historical trends, we show evidence of censorship in Italian newspapers both during World War 1 and during the Italian Fascist regime. Censorship in all countries during World Wars and in Italy during the Fascist period is a historically ascertained fact, but so far there was no estimate on the amount on censorship in newspaper reporting: in this work we estimate that about 75% of domestic deaths and suicides were not reported. This is also confirmed by statistical analysis of the distribution of the least significant digit of the number of reported deaths. We also find that the distribution function of the number of articles vs. the number of deaths reported in articles follows a power law, which is broken (with fewer articles being written) when reporting on few deaths occurring in foreign countries. The lack of articles is found to grow with geographical distance from the nation where the newspaper is being printed. Whereas the assessment of the truth of a single article or the debunking of what are now called \u2018fake news\u2019 requires specific fact-checking and becomes more difficult as time goes by, these methods can be used in historical analysis and to evaluate quantitatively the amount of bias and censorship present in other printed or online publication and can thus contribute to quantitatively assess the freedom of the press in a given country. Furthermore, they can be applied in wider contexts such as the evaluation of bias toward specific ethnic groups or specific accidents.\n                  <\/jats:p>","DOI":"10.1186\/s40537-020-00338-1","type":"journal-article","created":{"date-parts":[[2020,8,11]],"date-time":"2020-08-11T10:06:05Z","timestamp":1597140365000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Large scale analysis of violent death count in daily newspapers to quantify bias and censorship"],"prefix":"10.1186","volume":"7","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6067-5104","authenticated-orcid":false,"given":"Marco","family":"Casolino","sequence":"first","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,8,11]]},"reference":[{"key":"338_CR1","doi-asserted-by":"crossref","unstructured":"Ari\u00e8s P, Ranum P. Western attitudes toward death: from the middle ages to the present. Western attitudes toward death. Johns Hopkins University Press; 1975. https:\/\/books.google.it\/books?id=3sZJN3wojesC.","DOI":"10.1353\/book.20658"},{"key":"338_CR2","first-page":"837","volume":"61","author":"B Combs","year":"1979","unstructured":"Combs B, Slovic P. Newspaper coverage of causes of death. J Q. 1979;61:837\u201349.","journal-title":"J Q"},{"key":"338_CR3","unstructured":"Altheide DL. Creating reality: How TV news distorts events. A SageMark edition. Sage Publications; 1977. https:\/\/books.google.it\/books?id=ml21yAEACAAJ."},{"issue":"3","key":"338_CR4","doi-asserted-by":"publisher","first-page":"341","DOI":"10.1080\/14616700801997281","volume":"9","author":"F Hanusch","year":"2008","unstructured":"Hanusch F. Valuing those close to us. J Stud. 2008;9(3):341\u201356. https:\/\/doi.org\/10.1080\/14616700801997281.","journal-title":"J Stud"},{"issue":"1","key":"338_CR5","doi-asserted-by":"publisher","first-page":"81","DOI":"10.1177\/0267323188003001005","volume":"3","author":"KJ Burdach","year":"1988","unstructured":"Burdach KJ. Reporting on deaths: the perspective coverage of accident news in a German tabloid. Eur J Commun. 1988;3(1):81\u20139. https:\/\/doi.org\/10.1177\/0267323188003001005.","journal-title":"Eur J Commun"},{"issue":"2","key":"338_CR6","doi-asserted-by":"publisher","first-page":"73","DOI":"10.1214\/ss\/1177012580","volume":"4","author":"SM Stigler","year":"1989","unstructured":"Stigler SM. Francis Galton\u2019s account of the invention of correlation. Statist Sci. 1989;4(2):73\u20139. https:\/\/doi.org\/10.1214\/ss\/1177012580.","journal-title":"Statist Sci"},{"key":"338_CR7","volume-title":"Psycology of perception","author":"WN Dember","year":"1979","unstructured":"Dember WN, Warm JS. Psycology of perception. New York: Holt, Rinehart and Winston; 1979."},{"issue":"723","key":"338_CR8","first-page":"578","volume":"56","author":"K jen Tsang","year":"1984","unstructured":"jen Tsang K. News photos in time and newsweek. J Q. 1984;56(723):578\u201384.","journal-title":"J Q"},{"issue":"4","key":"338_CR9","doi-asserted-by":"publisher","first-page":"301","DOI":"10.1080\/13576270802383840","volume":"13","author":"F Hanusch","year":"2008","unstructured":"Mortality. Graphic death in the news media: present or absent? 2008;13(4):301\u201317. https:\/\/doi.org\/10.1080\/13576270802383840.","journal-title":"Mortality"},{"key":"338_CR10","unstructured":"Michel JB, Shen YK, Aiden AP, Veres A, Gray MK, , et\u00a0al. Quantitative analysis of culture using millions of digitized books. Science. 2011;331(6014):176\u2013182. http:\/\/science.sciencemag.org\/content\/331\/6014\/176."},{"key":"338_CR11","doi-asserted-by":"publisher","unstructured":"Smith R, Antonova D, Lee DS. Adapting the tesseract open source OCR engine for multilingual OCR. In: Proceedings of the international workshop on multilingual OCR. MOCR \u201909. New York, NY, USA: Association for Computing Machinery; 2009. https:\/\/doi.org\/10.1145\/1577802.1577804.","DOI":"10.1145\/1577802.1577804"},{"key":"338_CR12","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4615-5459-2","volume-title":"The informational complexity of learning: perspectives on neural networks and generative grammar","author":"P Niyogi","year":"1998","unstructured":"Niyogi P. The informational complexity of learning: perspectives on neural networks and generative grammar. Berlin: Springer; 1998."},{"key":"338_CR13","doi-asserted-by":"crossref","unstructured":"Brun R, Rademakers F. ROOT\u2014an object oriented data analysis framework. Nuclear instruments and methods in physics research section A: accelerators, spectrometers, detectors and associated equipment. 1997;389(1):81 \u2013 86. New Computing Techniques in Physics Research V. http:\/\/www.sciencedirect.com\/science\/article\/pii\/S016890029700048X.","DOI":"10.1016\/S0168-9002(97)00048-X"},{"key":"338_CR14","unstructured":"Smith DM. Storia di cento anni di vita italiana visti attraverso il Corriere della sera. Rizzoli; 1978."},{"key":"338_CR15","unstructured":"Melograni P. Il Corriere della sera (1919\u20131943). Cappelli Editore. 1965."},{"key":"338_CR16","unstructured":"Nicola\u00a0Tranfaglia ML Paolo\u00a0Murialdi. La stampa italiana nell\u2019et\u00e0 fascista. Editori Laterza. 1980."},{"key":"338_CR17","unstructured":"Murialdi P. Storia del giornalismo italiano. Il Mulino. 2014."},{"key":"338_CR18","unstructured":"James D\u00a0Ciment TR. The home front encyclopedia: United States, Britain, and Canada in World Wars I and II. vol. vol 1\u20133. 1st ed. ABC-CLIO; 2006."},{"key":"338_CR19","doi-asserted-by":"crossref","unstructured":"Rojcewicz SJ. War and suicide. Suicide and life-threatening behavior. 1(1):46\u201354. https:\/\/onlinelibrary.wiley.com\/doi\/abs\/10.1111\/j.1943-278X.1971.tb00598.x.","DOI":"10.1111\/j.1943-278X.1971.tb00598.x"},{"key":"338_CR20","volume-title":"Il suicidio in Italia 1864\u20131962","author":"S Somogyi","year":"1967","unstructured":"Somogyi S. Il suicidio in Italia 1864\u20131962. Analisi statistica. Milano: Giuffr\u00e8 edit; 1967."},{"key":"338_CR21","doi-asserted-by":"crossref","unstructured":"O\u2019Malley P. SUICIDE AND WAR: A case study and theoretical appraisal. Br J Criminol. 1975;15(4):348\u2013359. http:\/\/www.jstor.org\/stable\/23636204.","DOI":"10.1093\/oxfordjournals.bjc.a046667"},{"key":"338_CR22","unstructured":"Felice RD. Mussolini e il fascismo. Mussolini il duce. Lo stato totalitario 1936\u20131940. vol. 5. Einaudi. 1996."},{"key":"338_CR23","unstructured":"Aquarone A. L\u2019organizzazione dello Stato totalitario. Einaudi. 1995."},{"issue":"22","key":"338_CR24","first-page":"45","volume":"1","author":"G Alessandri","year":"1971","unstructured":"Alessandri G. Il suicidio in Italia. Aggiornamenti sociali. 1971;1(22):45\u201356.","journal-title":"Aggiornamenti sociali."},{"key":"338_CR25","unstructured":"Sweeney MS. Secrets of victory: The office of censorship and the American Press and radio in World War II. 1st ed. The University of North Carolina Press; 2001."},{"key":"338_CR26","unstructured":"ISTAT. La percezione della sicurezza. Statistical Report. 2018. https:\/\/www.istat.it\/it\/archivio\/217502."},{"key":"338_CR27","unstructured":"Eurostat;. 2018. https:\/\/ec.europa.eu\/eurostat\/web\/health\/causes-death."},{"issue":"9727","key":"338_CR28","doi-asserted-by":"publisher","first-page":"1704","DOI":"10.1016\/S0140-6736(10)60517-X","volume":"375","author":"JK Rajaratnam","year":"2010","unstructured":"Rajaratnam JK, Marcus JR, Levin-Rector A, Chalupka AN, Wang H, Dwyer L, et al. Worldwide mortality in men and women aged 15\u201359 years from 1970 to 2010: a systematic analysis. Lancet. 2010;375(9727):1704\u201320. https:\/\/doi.org\/10.1016\/S0140-6736(10)60517-X.","journal-title":"Lancet"},{"key":"338_CR29","unstructured":"of\u00a0labor statistics USB. 2018. https:\/\/www.bls.gov\/iif\/oshwc\/cfoi\/cfch0015.pdf."},{"key":"338_CR30","unstructured":"James F. MINUIT Function minimization and error analysis: reference manual version 94.1. CERN-D-506. 1994."},{"key":"338_CR31","doi-asserted-by":"crossref","unstructured":"Adriani O, Barbarino GC, Bazilevskaya GA, Bellotti R, Boezio M, Bogomolov EA, et\u00a0al. The PAMELA Mission: Heralding a new era in precision cosmic ray physics. Physics Reports. 2014;544(4):323 \u2013 370. The PAMELA Mission: Heralding a new era in precision cosmic ray physics.","DOI":"10.1016\/j.physrep.2014.06.003"},{"key":"338_CR32","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1126\/science.1199172","volume":"332","author":"O Adriani","year":"2011","unstructured":"Adriani O, Barbarino GC, Bazilevskaya GA, Bellotti R, Boezio M, Bogomolov EA, et al. PAMELA measurements of cosmic-ray proton and helium spectra. Science. 2011;332:69.","journal-title":"Science"},{"key":"338_CR33","doi-asserted-by":"publisher","first-page":"607","DOI":"10.1038\/nature07942","volume":"458","author":"O Adriani","year":"2009","unstructured":"Adriani O, Barbarino GC, Bazilevskaya GA, Bellotti R, Boezio M, Bogomolov EA, et al. An anomalous positron abundance in cosmic rays with energies 1.5\u2013100 GeV. Nature. 2009;458:607\u20139.","journal-title":"Nature"},{"key":"338_CR34","doi-asserted-by":"crossref","unstructured":"Bak P, Tang C. Earthquakes as a self-organized critical phenomenon. J Geophys Res Solid Earth;94(B11):15635\u201315637. https:\/\/agupubs.onlinelibrary.wiley.com\/doi\/abs\/10.1029\/JB094iB11p15635.","DOI":"10.1029\/JB094iB11p15635"},{"key":"338_CR35","unstructured":"Malamud BD, Morein G, Turcotte DL. Forest fires: an example of self-organized critical behavior. Science. 1998;281(5384):1840\u20132. http:\/\/science.sciencemag.org\/content\/281\/5384\/1840."},{"key":"338_CR36","doi-asserted-by":"publisher","first-page":"1326","DOI":"10.1016\/j.cnsns.2005.12.003","volume":"12","author":"L Telesca","year":"2007","unstructured":"Telesca L, Amatucci G, Lasaponara R, Lovallo M, Rodrigues MJ. Space time fractal properties of the forest-fire series in central Italy. Commun Nonlinear Sci Numer Simul. 2007;12:1326\u201333.","journal-title":"Commun Nonlinear Sci Numer Simul"},{"key":"338_CR37","unstructured":"Condit R, Ashton PS, Baker P, Bunyavejchewin S, Gunatilleke S, Gunatilleke N, et\u00a0al. Spatial patterns in the distribution of tropical tree species. Science. 2000;288(5470):1414\u20131418. http:\/\/science.sciencemag.org\/content\/288\/5470\/1414."},{"key":"338_CR38","doi-asserted-by":"publisher","first-page":"209","DOI":"10.1038\/nature06060","volume":"449","author":"TM Scanlon","year":"2007","unstructured":"Scanlon TM, Caylor KK, Levin SA, Rodriguez-Iturbe I. Positive feedbacks promote power-law clustering of Kalahari vegetation. Nature. 2007;449:209. https:\/\/doi.org\/10.1038\/nature06060.","journal-title":"Nature"},{"key":"338_CR39","doi-asserted-by":"publisher","first-page":"433","DOI":"10.1111\/j.1365-246X.2005.02717.x","volume":"163","author":"G Yakovlev","year":"2005","unstructured":"Yakovlev G, Newman WI, Turcotte DL, Gabrielov A. An inverse cascade model for self-organized complexity and natural hazards. Geophys J Int. 2005;163:433\u201342.","journal-title":"Geophys J Int"},{"issue":"4","key":"338_CR40","doi-asserted-by":"publisher","first-page":"661","DOI":"10.1137\/070710111","volume":"51","author":"A Clauset","year":"2009","unstructured":"Clauset A, Shalizi CR, Newman MEJ. Power-law distributions in empirical data. SIAM Rev. 2009;51(4):661\u2013703. https:\/\/doi.org\/10.1137\/070710111.","journal-title":"SIAM Rev"},{"issue":"5","key":"338_CR41","doi-asserted-by":"publisher","first-page":"323","DOI":"10.1080\/00107510500052444","volume":"46","author":"M Newman","year":"2005","unstructured":"Newman M. Power laws, Pareto distributions and Zipf\u2019s law. Contemp Phys. 2005;46(5):323\u201351. https:\/\/doi.org\/10.1080\/00107510500052444.","journal-title":"Contemp Phys"},{"issue":"1","key":"338_CR42","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.pone.0147073","volume":"11","author":"I Moreno-Sanchez","year":"2016","unstructured":"Moreno-Sanchez I, Font-Clos F. Large-scale Corral A, analysis of Zipf\u2019s law in english texts. PLOS ONE. 2016;11(1):1\u201319. https:\/\/doi.org\/10.1371\/journal.pone.0147073.","journal-title":"PLOS ONE"},{"key":"338_CR43","doi-asserted-by":"crossref","unstructured":"Richardson LF. Variation of the frequency of fatal quarrels with magnitude. J Am Stat Assoc. 1948;43(244):523\u201346. https:\/\/www.tandfonline.com\/doi\/abs\/10.1080\/01621459.1948.10483278.","DOI":"10.1080\/01621459.1948.10483278"},{"issue":"1","key":"338_CR44","doi-asserted-by":"publisher","first-page":"135","DOI":"10.1017\/S0003055403000571","volume":"97","author":"LE Cederman","year":"2003","unstructured":"Cederman LE. Modeling the size of wars: from billiard balls to sandpiles. Am Polit Sci Rev. 2003;97(1):135\u201350.","journal-title":"Am Polit Sci Rev"},{"key":"338_CR45","unstructured":"Lim M, Metzler R, Bar-Yam Y. Global pattern formation and ethnic\/cultural violence. Science. 2007;317(5844):1540\u20134. http:\/\/science.sciencemag.org\/content\/317\/5844\/1540."},{"key":"338_CR46","doi-asserted-by":"publisher","first-page":"207 EP","DOI":"10.1038\/nature03459","volume":"435","author":"AL Barab\u00e1si","year":"2005","unstructured":"Barab\u00e1si AL. The origin of bursts and heavy tails in human dynamics. Nature. 2005;435:207 EP. https:\/\/doi.org\/10.1038\/nature03459.","journal-title":"Nature"},{"key":"338_CR47","doi-asserted-by":"publisher","first-page":"911 EP","DOI":"10.1038\/nature08631","volume":"462","author":"JC Bohorquez","year":"2009","unstructured":"Bohorquez JC, Gourley S, Dixon AR, Spagat M, Johnson NF. Common ecology quantifies human insurgency. Nature. 2009;462:911 EP. https:\/\/doi.org\/10.1038\/nature08631.","journal-title":"Nature"},{"key":"338_CR48","doi-asserted-by":"publisher","first-page":"39","DOI":"10.2307\/2369148","volume":"4","author":"S Newcomb","year":"1881","unstructured":"Newcomb S. Note on the frequency of use of the different digits in natural numbers. Am J Math. 1881;4:39\u201340.","journal-title":"Am J Math"},{"key":"338_CR49","unstructured":"Benford F. The law of anomalous numbers. Proc Am Philos Soc. 1938;78(4):551\u2013572. http:\/\/www.jstor.org\/stable\/984802."},{"key":"338_CR50","doi-asserted-by":"publisher","first-page":"34917 EP","DOI":"10.1038\/srep34917","volume":"6","author":"M Morzy","year":"2016","unstructured":"Morzy M, Kajdanowicz T, Benford\u2019s Szymanski BK. Distribution in complex networks. Sci Rep. 2016;6:34917 EP. https:\/\/doi.org\/10.1038\/srep34917.","journal-title":"Sci Rep"},{"key":"338_CR51","first-page":"5","volume":"01","author":"C Durtschi","year":"2004","unstructured":"Durtschi C, Hillison W, Pacini C. The effective use of Benford\u2019s law to assist in detecting fraud in accounting data. J Forensic Acc. 2004;01:5.","journal-title":"J Forensic Acc"},{"key":"338_CR52","doi-asserted-by":"crossref","unstructured":"Jimenez R, Hidalgo M, Klimek P. Testing for voter rigging in small polling stations. Sci Adv. 2017;3(6). http:\/\/advances.sciencemag.org\/content\/3\/6\/e1602363.","DOI":"10.1126\/sciadv.1602363"},{"key":"338_CR53","unstructured":"Dunn HL. Vital Statistics of the United states. Government of the United States of America; 1945."},{"key":"338_CR54","unstructured":"OECD2018. Suicide rates (indicator), 10.1787\/a82f3459-en;. https:\/\/www.oecd-ilibrary.org\/social-issues-migration-health\/suicide-rates\/indicator\/english_a82f3459-en."}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-020-00338-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s40537-020-00338-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-020-00338-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,11]],"date-time":"2024-08-11T13:04:06Z","timestamp":1723381446000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-020-00338-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,8,11]]},"references-count":54,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["338"],"URL":"https:\/\/doi.org\/10.1186\/s40537-020-00338-1","relation":{"has-preprint":[{"id-type":"doi","id":"10.21203\/rs.3.rs-17213\/v1","asserted-by":"object"},{"id-type":"doi","id":"10.21203\/rs.3.rs-17213\/v2","asserted-by":"object"}]},"ISSN":["2196-1115"],"issn-type":[{"value":"2196-1115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,8,11]]},"assertion":[{"value":"10 March 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 July 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"11 August 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The author declres no competing interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"60"}}