{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,26]],"date-time":"2026-03-26T14:51:19Z","timestamp":1774536679138,"version":"3.50.1"},"reference-count":15,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T00:00:00Z","timestamp":1772150400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2026,3,26]],"date-time":"2026-03-26T00:00:00Z","timestamp":1774483200000},"content-version":"vor","delay-in-days":27,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100009827","name":"Alexandria University","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100009827","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>In some GWAS studies, particularly those involving biobank data, linear regression is employed to obtain summary statistics on binary traits, while others report the log odds or odds ratios from the logistic regression of the genomic variants. However, some studies applied a transformation equation between logistic regression to linear regression.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Aim<\/jats:title>\n                    <jats:p>The current study aims to assess the performance of the Wald ratio using logistic regression, linear probability models (LPM), and transformation approaches in comparison with structural equation modelling (SEM) and Two Stage Predictor Substitution (TSPS), Two Stage Residual (TSRI) via simulation and real data analysis.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Methods<\/jats:title>\n                    <jats:p>Simulation data based on a bivariate Bernoulli distribution were applied within an instrumental variable framework to estimate empirical bias. Four sensitivity analysis scenarios were considered, varying the sample size, IV prevalence, exposure and outcome prevalence, and confounder effect. Additionally, real data from the Golden Retriever Lifetime Study were analyzed to estimate the potential causal effect of activity level on cancer risk.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>In the simulation data, for the positive effect size with a low confounder effect and a weak instrumental variable scenario, the median (Q1\u2013Q3) biases of the Wald ratio were as follows: under SEM, the bias for logistic regression was 0.77 (\u22120.20\u20131.74), for LPM it was 0.03 (\u22123.78\u20133.95), and for the transformation method it was 0.03 (\u22123.79\u20133.94). Under TSPS, the bias for logistic regression was 0.72 (\u22122.24\u20133.61), for LPM it was 0.00 (\u22120.04\u20130.05), and for the transformation method it was 0.00 (\u22120.01\u20130.01). Under TSRI, the bias for logistic regression was 0.75 (\u22122.47\u20133.87), for LPM it was 0.02 (\u22120.21\u20130.26), and for the transformation method it was 0.02 (\u22120.22\u20130.26). In the real data analysis, for the SNP Affx-205724246_A, the biases of the Wald ratio were as follows: under SEM, \u22120.14 for logistic regression, \u22121.30 for LPM, and \u22121.45 for the transformation method. Under TSPS and TSRI, the bias for logistic regression was 1.24, for LPM 0.08, and for the transformation method \u22120.07.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusion<\/jats:title>\n                    <jats:p>The findings indicated that increasing the strength of the instrumental variable led to a reduction in bias, with the best performance observed at instrumental strengths of 0.5 and 0.7. The LPM and transformation approaches produced relatively lower bias in the TSPS framework when the confounder effect was below 0.1 and the prevalence of the outcome or the exposure is within 0.5 to 0.62. When the prevalence of the exposure and outcome ranged from 0.67 to 0.84 and the instrumental strength was high (0.7), the bias of the LPM and transformation methods were slightly higher but comparable to that observed when the prevalence ranged from 0.5 to 0.62. Furthermore, low prevalence of exposure or outcome (ranged from 0.12 to 0.23) produced larger bias than those observed at higher prevalence values (ranged from 0.67 to 0.84). In the real data analysis, the bias of the Wald ratio using LPM and transformation methods were lower under TSPS and TSRI and higher under the SEM method, while the bias of the Wald ratio using logistic regression was lower under SEM compared to the other methods.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/s12859-026-06388-1","type":"journal-article","created":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T05:04:02Z","timestamp":1772168642000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Transformation to estimate the causal effect in Mendelian randomization study with binary risk factor and outcome"],"prefix":"10.1186","volume":"27","author":[{"given":"Nesma","family":"Lotfy","sequence":"first","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2026,2,27]]},"reference":[{"issue":"1","key":"6388_CR1","doi-asserted-by":"publisher","first-page":"6","DOI":"10.1038\/s43586-021-00092-5","volume":"2","author":"E Sanderson","year":"2022","unstructured":"Sanderson E, Glymour MM, Holmes MV, Kang H, Morrison J, Munaf\u00f2 MR, et al. Mendelian randomization. Nature Rev Methods Primers. 2022;2(1):6.","journal-title":"Nature Rev Methods Primers"},{"issue":"1","key":"6388_CR2","doi-asserted-by":"publisher","first-page":"a040501","DOI":"10.1101\/cshperspect.a040501","volume":"12","author":"RC Richmond","year":"2022","unstructured":"Richmond RC, Smith GD. Mendelian randomization: concepts and scope. Cold Spring Harb Perspect Med. 2022;12(1):a040501.","journal-title":"Cold Spring Harb Perspect Med"},{"key":"6388_CR3","doi-asserted-by":"crossref","unstructured":"Hartwig FP, Davies NM, Hemani G, Davey Smith G. Two-sample Mendelian randomization: avoiding the downsides of a powerful, widely applicable but potentially fallible technique. Oxford University Press; 2016. p. 1717\u201326.","DOI":"10.1093\/ije\/dyx028"},{"issue":"7","key":"6388_CR4","doi-asserted-by":"publisher","first-page":"658","DOI":"10.1002\/gepi.21758","volume":"37","author":"S Burgess","year":"2013","unstructured":"Burgess S, Butterworth A, Thompson SG. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet Epidemiol. 2013;37(7):658\u201365.","journal-title":"Genet Epidemiol"},{"issue":"1","key":"6388_CR5","doi-asserted-by":"publisher","first-page":"29","DOI":"10.1214\/ss\/1009211805","volume":"14","author":"S Greenland","year":"1999","unstructured":"Greenland S, Pearl J, Robins JM. Confounding and collapsibility in causal inference. Stat Sci. 1999;14(1):29\u201346.","journal-title":"Stat Sci"},{"issue":"5","key":"6388_CR6","doi-asserted-by":"publisher","first-page":"1925","DOI":"10.1177\/0962280213505804","volume":"25","author":"M Pang","year":"2016","unstructured":"Pang M, Kaufman JS, Platt RW. Studying noncollapsibility of the odds ratio with marginal structural and logistic regression models. Stat Methods Med Res. 2016;25(5):1925\u201337.","journal-title":"Stat Methods Med Res"},{"issue":"5","key":"6388_CR7","doi-asserted-by":"publisher","first-page":"549","DOI":"10.1002\/gepi.22387","volume":"45","author":"PH Allman","year":"2021","unstructured":"Allman PH, Aban I, Long DM, Bridges SL Jr, Srinivasasainagendra V, MacKenzie T, et al. A novel Mendelian randomization method with binary risk factor and outcome. Genet Epidemiol. 2021;45(5):549\u201360.","journal-title":"Genet Epidemiol"},{"issue":"7","key":"6388_CR8","doi-asserted-by":"publisher","first-page":"1148","DOI":"10.1111\/2041-210X.13600","volume":"12","author":"JB Grace","year":"2021","unstructured":"Grace JB. Instrumental variable methods in structural equation models. Methods Ecol Evol. 2021;12(7):1148\u201357.","journal-title":"Methods Ecol Evol"},{"key":"6388_CR9","first-page":"252","volume":"12","author":"AS Goldberger","year":"1964","unstructured":"Goldberger AS. Econometric Theory, John Willey and Sons. Inc, New York. 1964;12:252\u20133.","journal-title":"Inc, New York"},{"key":"6388_CR10","doi-asserted-by":"publisher","first-page":"100505","DOI":"10.1016\/j.jocm.2024.100505","volume":"52","author":"P Delle Site","year":"2024","unstructured":"Delle Site P, Parmar J. On the Linear Probability Model as binary choice random utility model. J Choice Modell. 2024;52:100505.","journal-title":"J Choice Modell"},{"issue":"10","key":"6388_CR11","doi-asserted-by":"publisher","first-page":"1875","DOI":"10.1016\/j.jpain.2023.05.012","volume":"24","author":"EE Elgaeva","year":"2023","unstructured":"Elgaeva EE, Williams FMK, Zaytseva OO, Freidin MB, Aulchenko YS, Suri P, et al. Bidirectional mendelian randomization study of personality traits reveals a positive feedback loop between neuroticism and back pain. J Pain. 2023;24(10):1875\u201385.","journal-title":"J Pain"},{"issue":"6","key":"6388_CR12","doi-asserted-by":"publisher","first-page":"e0269425","DOI":"10.1371\/journal.pone.0269425","volume":"17","author":"J Labadie","year":"2022","unstructured":"Labadie J, Swafford B, DePena M, Tietje K, Page R, Patterson-Kane J. Cohort profile: the golden retriever lifetime study (GRLS). PLoS ONE. 2022;17(6):e0269425.","journal-title":"PLoS ONE"},{"key":"6388_CR13","doi-asserted-by":"publisher","first-page":"129","DOI":"10.1007\/978-1-62703-447-0_5","volume-title":"Genome-Wide Association Studies and Genomic Prediction","author":"C Gondro","year":"2013","unstructured":"Gondro C, Lee SH, Lee HK, Porto-Neto LR. Quality control for genome-wide association studies. In: Gondro C, van der Werf J, Hayes B, editors. Genome-Wide Association Studies and Genomic Prediction. Totowa, NJ: Humana Press; 2013. p. 129\u201347."},{"issue":"3","key":"6388_CR14","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1086\/519795","volume":"81","author":"S Purcell","year":"2007","unstructured":"Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559\u201375.","journal-title":"Am J Hum Genet"},{"key":"6388_CR15","doi-asserted-by":"publisher","first-page":"947","DOI":"10.1007\/s10654-018-0424-6","volume":"33","author":"S Burgess","year":"2018","unstructured":"Burgess S, Labrecque JA. Mendelian randomization with a binary exposure variable: interpretation and presentation of causal estimates. Eur J Epidemiol. 2018;33:947\u201352.","journal-title":"Eur J Epidemiol"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-026-06388-1","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-026-06388-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-026-06388-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,26]],"date-time":"2026-03-26T13:54:16Z","timestamp":1774533256000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1186\/s12859-026-06388-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,2,27]]},"references-count":15,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2026,12]]}},"alternative-id":["6388"],"URL":"https:\/\/doi.org\/10.1186\/s12859-026-06388-1","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,2,27]]},"assertion":[{"value":"29 June 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 January 2026","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 February 2026","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"72"}}