{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,22]],"date-time":"2026-01-22T13:12:56Z","timestamp":1769087576770,"version":"3.49.0"},"reference-count":56,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2026,1,11]],"date-time":"2026-01-11T00:00:00Z","timestamp":1768089600000},"content-version":"vor","delay-in-days":10,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key R&D Program of China","doi-asserted-by":"publisher","award":["2023YFF1205101"],"award-info":[{"award-number":["2023YFF1205101"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2026,1,7]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Causal inference is an essential approach for understanding biological processes. Traditional causal inference methods assume a linear relationship between different biological traits, whereas their true causal relationship may be nonlinear, such as U-shaped. Moreover, when the instrument set includes weak and pleiotropic genetic instruments, accurately capturing the shape of these relationships becomes challenging. To address these issues, we propose model-averaged control function-based instrumental variable regression, a two-stage framework based on a model-averaged control function approach to estimate the marginal effect function, which represents the derivative of the causal relationship. In the first stage, a model averaging technique is employed to estimate the control function, thereby reducing weak genetic instrument bias. In the second stage, B-spline approximation is applied to estimate the marginal effect function, while SCAD penalization is used to minimize pleiotropic instrument bias. We establish the asymptotic properties of the proposed estimator and demonstrate its robust performance through simulations. Application to the Atherosclerosis Risk in Communities dataset highlights a nonlinear causal relationship between body mass index and hypertension, with the proposed method effectively estimating the specific shape and trend of the relationship.<\/jats:p>","DOI":"10.1093\/bib\/bbaf714","type":"journal-article","created":{"date-parts":[[2025,12,23]],"date-time":"2025-12-23T12:51:52Z","timestamp":1766494312000},"source":"Crossref","is-referenced-by-count":0,"title":["MACFIV: a novel framework for nonlinear causal inference in the body mass index\u2013hypertension relationship with many weak and pleiotropic genetic instruments"],"prefix":"10.1093","volume":"27","author":[{"given":"Dong","family":"Chen","sequence":"first","affiliation":[{"name":"State Key Laboratory of Genetics and Development of Complex Phenotypes, Institute of Biostatistics, School of Life Sciences, Fudan University , 2005 Songhu Road, Yangpu District, Shanghai 200438 ,","place":["China"]}]},{"given":"Yuquan","family":"Wang","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Genetics and Development of Complex Phenotypes, Institute of Biostatistics, School of Life Sciences, Fudan University , 2005 Songhu Road, Yangpu District, Shanghai 200438 ,","place":["China"]}]},{"given":"Dapeng","family":"Shi","sequence":"additional","affiliation":[{"name":"Shanghai Center for Mathematical Sciences, Fudan University , 2005 Songhu Road, Yangpu District, Shanghai 200438 ,","place":["China"]}]},{"given":"Yunlong","family":"Cao","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Genetics and Development of Complex Phenotypes, Institute of Biostatistics, School of Life Sciences, Fudan University , 2005 Songhu Road, Yangpu District, Shanghai 200438 ,","place":["China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5730-0317","authenticated-orcid":false,"given":"Yue-Qing","family":"Hu","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Genetics and Development of Complex Phenotypes, Institute of Biostatistics, School of Life Sciences, Fudan University , 2005 Songhu Road, Yangpu District, Shanghai 200438 ,","place":["China"]},{"name":"Shanghai Center for Mathematical Sciences, Fudan University , 2005 Songhu Road, Yangpu District, Shanghai 200438 ,","place":["China"]}]}],"member":"286","published-online":{"date-parts":[[2026,1,11]]},"reference":[{"key":"2026011104410896300_ref1","doi-asserted-by":"publisher","first-page":"1673","DOI":"10.1111\/j.1468-0262.2005.00632.x","article-title":"Consistent estimation with a large number of weak instruments","volume":"73","author":"Chao","year":"2005","journal-title":"Econometrica"},{"key":"2026011104410896300_ref2","doi-asserted-by":"publisher","first-page":"398","DOI":"10.1198\/073500108000000024","article-title":"Estimation with many instrumental variables","volume":"26","author":"Hansen","year":"2008","journal-title":"J Bus Econ Stat"},{"key":"2026011104410896300_ref3","doi-asserted-by":"publisher","first-page":"368","DOI":"10.1007\/s40484-017-0124-3","article-title":"On the use of kernel machines for mendelian randomization","volume":"5","author":"Zhang","year":"2017","journal-title":"Quant Biol"},{"key":"2026011104410896300_ref4","doi-asserted-by":"publisher","first-page":"100019","DOI":"10.1016\/j.xhgg.2020.100019","article-title":"Transcriptome prediction performance across machine learning models and diverse ancestries","volume":"2","author":"Okoro","year":"2021","journal-title":"Hum Genet Genomics Adv"},{"key":"2026011104410896300_ref5","doi-asserted-by":"publisher","first-page":"531","DOI":"10.1016\/j.jhealeco.2007.09.009","article-title":"Two-stage residual inclusion estimation: addressing endogeneity in health econometric modeling","volume":"27","author":"Terza","year":"2008","journal-title":"J Health Econ"},{"key":"2026011104410896300_ref6","doi-asserted-by":"publisher","first-page":"877","DOI":"10.1097\/EDE.0000000000000161","article-title":"Instrumental variable analysis with a nonlinear exposure\u2013outcome relationship","volume":"25","author":"Burgess","year":"2014","journal-title":"Epidemiology"},{"key":"2026011104410896300_ref7","first-page":"793","article-title":"Inference of nonlinear causal effects with application to TWAS with GWAS summary data","volume-title":"Proceedings of the Third Conference on Causal Learning and Reasoning","author":"Dai","year":"2024"},{"key":"2026011104410896300_ref8","article-title":"Inference for nonlinear endogenous treatment effects accounting for high-dimensional covariate complexity","author":"Fan","year":"2024"},{"key":"2026011104410896300_ref9","doi-asserted-by":"publisher","first-page":"454","DOI":"10.1002\/sim.6358","article-title":"The many weak instruments problem and mendelian randomization","volume":"34","author":"Davies","year":"2015","journal-title":"Stat Med"},{"key":"2026011104410896300_ref10","doi-asserted-by":"publisher","first-page":"793","DOI":"10.1111\/rssb.12275","article-title":"Confidence intervals for causal effects with invalid instruments by using two-stage hard thresholding with voting","volume":"80","author":"Guo","year":"2018","journal-title":"J. R. Stat. Soc. B"},{"key":"2026011104410896300_ref11","article-title":"Causal inference for nonlinear outcome models with possibly invalid instrumental variables","author":"Li","year":"2022"},{"key":"2026011104410896300_ref12","article-title":"Endogenous treatment effect estimation with some invalid and irrelevant instruments","author":"Fan","year":"2020"},{"key":"2026011104410896300_ref13","doi-asserted-by":"publisher","first-page":"1655","DOI":"10.1162\/rest_a_01230","article-title":"Endogenous treatment effect estimation with a large and mixed set of instruments and control variables","volume":"106","author":"Fan","year":"2024","journal-title":"Rev Econ Stat"},{"key":"2026011104410896300_ref14","doi-asserted-by":"publisher","first-page":"815","DOI":"10.1080\/07350015.2020.1870479","article-title":"Structural equation model averaging: methodology and application","volume":"40","author":"Seng","year":"2022","journal-title":"J Bus Econ Stat"},{"key":"2026011104410896300_ref15","doi-asserted-by":"publisher","first-page":"3547","DOI":"10.1002\/sim.9819","article-title":"Instrumental variable model average with applications in mendelian randomization","volume":"42","author":"Seng","year":"2023","journal-title":"Stat Med"},{"key":"2026011104410896300_ref16","doi-asserted-by":"publisher","first-page":"905","DOI":"10.1080\/10485252.2023.2215339","article-title":"Nonparametric instrument model averaging","volume":"35","author":"Chen","year":"2023","journal-title":"J Nonparamet Stat"},{"key":"2026011104410896300_ref17","doi-asserted-by":"publisher","first-page":"879","DOI":"10.1198\/016214503000000828","article-title":"Frequentist model average estimators","volume":"98","author":"Hjort","year":"2003","journal-title":"J Am Stat Assoc"},{"key":"2026011104410896300_ref18","doi-asserted-by":"publisher","first-page":"1175","DOI":"10.1111\/j.1468-0262.2007.00785.x","article-title":"Least squares model averaging","volume":"75","author":"Hansen","year":"2007","journal-title":"Econometrica."},{"key":"2026011104410896300_ref19","doi-asserted-by":"publisher","first-page":"1053","DOI":"10.1198\/jasa.2011.tm09478","article-title":"Optimal weight choice for frequentist model average estimators","volume":"106","author":"Liang","year":"2011","journal-title":"J Am Stat Assoc"},{"key":"2026011104410896300_ref20","doi-asserted-by":"publisher","first-page":"38","DOI":"10.1016\/j.jeconom.2011.06.019","article-title":"Jackknife model averaging","volume":"167","author":"Hansen","year":"2012","journal-title":"J Econ"},{"key":"2026011104410896300_ref21","doi-asserted-by":"publisher","first-page":"254","DOI":"10.1080\/01621459.2013.838168","article-title":"A model-averaging approach for high-dimensional regression","volume":"109","author":"Ando","year":"2014","journal-title":"J Am Stat Assoc"},{"key":"2026011104410896300_ref22","doi-asserted-by":"publisher","first-page":"1583","DOI":"10.5705\/ss.2013.326","article-title":"Model averaging based on Kullback-Leibler distance","volume":"25","author":"Zhang","year":"2015","journal-title":"Stat Sin"},{"key":"2026011104410896300_ref23","doi-asserted-by":"publisher","first-page":"106902","DOI":"10.1016\/j.csda.2019.106902","article-title":"Corrected mallows criterion for model averaging","volume":"144","author":"Liao","year":"2020","journal-title":"Comput Stat Data Anal"},{"key":"2026011104410896300_ref24","doi-asserted-by":"publisher","first-page":"697","DOI":"10.3982\/ECTA7444","article-title":"Constructing optimal instruments by first-stage prediction averaging","volume":"78","author":"Kuersteiner","year":"2010","journal-title":"Econometrica"},{"key":"2026011104410896300_ref25","doi-asserted-by":"publisher","first-page":"223","DOI":"10.1177\/0962280210394459","article-title":"Using multiple genetic variants as instrumental variables for modifiable risk factors","volume":"21","author":"Palmer","year":"2012","journal-title":"Stat Methods Med Res"},{"key":"2026011104410896300_ref26","doi-asserted-by":"publisher","first-page":"1134","DOI":"10.1093\/ije\/dyt093","article-title":"Use of allele scores as instrumental variables for mendelian randomization","volume":"42","author":"Burgess","year":"2013","journal-title":"Int J Epidemiol"},{"key":"2026011104410896300_ref27","doi-asserted-by":"publisher","first-page":"474","DOI":"10.1080\/07350015.2014.978175","article-title":"Identification and inference with many invalid instruments","volume":"33","author":"Koles\u00e1r","year":"2015","journal-title":"J"},{"key":"2026011104410896300_ref28","doi-asserted-by":"publisher","first-page":"857","DOI":"10.1017\/S0266466612000783","article-title":"Adaptive GMM shrinkage estimation with consistent moment selection","volume":"29","author":"Liao","year":"2013","journal-title":"Economet Theor"},{"key":"2026011104410896300_ref29","doi-asserted-by":"publisher","first-page":"443","DOI":"10.1016\/j.jeconom.2015.02.019","article-title":"Select the valid and relevant moments: an information-based LASSO for GMM with many moments","volume":"186","author":"Cheng","year":"2015","journal-title":"Journal of Econometrics"},{"key":"2026011104410896300_ref30","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1080\/07350015.2015.1129344","article-title":"Adaptive elastic net GMM estimation with many invalid moment conditions: Simultaneous model and moment selection","volume":"36","author":"Caner","year":"2018","journal-title":"J Bus Econ Stat"},{"key":"2026011104410896300_ref31","doi-asserted-by":"publisher","first-page":"132","DOI":"10.1080\/01621459.2014.994705","article-title":"Instrumental variables estimation with some invalid instruments and its application to mendelian randomization","volume":"111","author":"Kang","year":"2016","journal-title":"J Am Stat Assoc"},{"key":"2026011104410896300_ref32","doi-asserted-by":"publisher","first-page":"1339","DOI":"10.1080\/01621459.2018.1498346","article-title":"On the use of the lasso for instrumental variables estimation with some invalid instruments","volume":"114","author":"Windmeijer","year":"2019","journal-title":"J Am Stat Assoc"},{"key":"2026011104410896300_ref33","doi-asserted-by":"publisher","first-page":"752","DOI":"10.1111\/rssb.12449","article-title":"The confidence interval method for selecting valid instrumental variables","volume":"83","author":"Windmeijer","year":"2021","journal-title":"J R Stat Soc Series B"},{"key":"2026011104410896300_ref34","doi-asserted-by":"publisher","first-page":"1068","DOI":"10.1093\/jrsssb\/qkae025","article-title":"On the instrumental variable estimation with many weak and invalid instruments","volume":"86","author":"Lin","year":"2024","journal-title":"J R Stat Soc Series B"},{"key":"2026011104410896300_ref35","article-title":"Two stage curvature identification with machine learning: causal inference with possibly invalid instrumental variables","author":"Guo","year":"2022"},{"key":"2026011104410896300_ref36","first-page":"1414","article-title":"Deep IV: a flexible approach for counterfactual prediction","volume-title":"International Conference on Machine Learning","author":"Hartford","year":"2017"},{"key":"2026011104410896300_ref37","doi-asserted-by":"publisher","first-page":"468","DOI":"10.1093\/biostatistics\/kxac051","article-title":"DeLIVR: A deep learning approach to IV regression for testing nonlinear causal effects in transcriptome-wide association studies","volume":"25","author":"He","year":"2024","journal-title":"Biostatistics"},{"key":"2026011104410896300_ref38","doi-asserted-by":"publisher","first-page":"1344","DOI":"10.1016\/j.ajhg.2025.04.010","article-title":"A flexible machine learning mendelian randomization estimator applied to predict the safety and efficacy of sclerostin inhibition","volume":"112","author":"Legault","year":"2025","journal-title":"Am J Hum Genet"},{"key":"2026011104410896300_ref39","first-page":"1","article-title":"Control function instrumental variable estimation of nonlinear causal effect models","volume":"17","author":"Guo","year":"2016","journal-title":"J Mach Learn Res"},{"key":"2026011104410896300_ref40","article-title":"Non-linear mendelian randomization with two-stage prediction estimation and control function estimation","author":"Wang","year":"2024"},{"key":"2026011104410896300_ref41","doi-asserted-by":"publisher","first-page":"162","DOI":"10.1093\/restud\/rdae025","article-title":"Adaptive estimation and uniform confidence bands for nonparametric structural functions and elasticities","volume":"92","author":"Chen","year":"2024","journal-title":"Rev Econ Stud"},{"key":"2026011104410896300_ref42","doi-asserted-by":"publisher","first-page":"100124","DOI":"10.1016\/j.xhgg.2022.100124","article-title":"Polynomial mendelian randomization reveals non-linear causal effects for obesity-related traits","volume":"3","author":"Sulc","year":"2022","journal-title":"Hum Genet Genomics Adv"},{"key":"2026011104410896300_ref43","doi-asserted-by":"publisher","first-page":"565","DOI":"10.1111\/1468-0262.00037","article-title":"Nonparametric estimation of triangular simultaneous equations models","volume":"67","author":"Newey","year":"1999","journal-title":"Econometrica"},{"key":"2026011104410896300_ref44","doi-asserted-by":"publisher","first-page":"495","DOI":"10.3982\/QE332","article-title":"Model averaging, asymptotic risk, and regressor groups","volume":"5","author":"Hansen","year":"2014","journal-title":"Quant Econ"},{"key":"2026011104410896300_ref45","doi-asserted-by":"publisher","first-page":"1348","DOI":"10.1198\/016214501753382273","article-title":"Variable selection via nonconcave penalized likelihood and its oracle properties","volume":"96","author":"Fan","year":"2001","journal-title":"J Am Stat Assoc"},{"key":"2026011104410896300_ref46","doi-asserted-by":"publisher","first-page":"2084","DOI":"10.1080\/01621459.2019.1689984","article-title":"Kernel meets sieve: post-regularization confidence bands for sparse additive model","volume":"115","author":"Lu","year":"2020","journal-title":"J Am Stat Assoc"},{"key":"2026011104410896300_ref47","doi-asserted-by":"publisher","first-page":"229","DOI":"10.1016\/S0091-7435(02)00017-8","article-title":"The association between high blood pressure, physical fitness, and body mass index in adolescents","volume":"36","author":"Nielsen","year":"2003","journal-title":"Prev Med"},{"key":"2026011104410896300_ref48","first-page":"126","article-title":"The relation of body mass index and blood pressure in Iranian children and adolescents aged 7\u201318 years old","volume":"39","author":"Hosseini","year":"2010","journal-title":"Iran J Public Health"},{"key":"2026011104410896300_ref49","doi-asserted-by":"publisher","first-page":"341","DOI":"10.1002\/gepi.22041","article-title":"Semiparametric methods for estimation of a nonlinear exposure-outcome relationship using instrumental variables with application to mendelian randomization","volume":"41","author":"Staley","year":"2017","journal-title":"Genet Epidemiol"},{"key":"2026011104410896300_ref50","doi-asserted-by":"publisher","first-page":"5814","DOI":"10.1002\/sim.10269","article-title":"Instrumental variable model average with applications in nonlinear causal inference","volume":"43","author":"Chen","year":"2024","journal-title":"Stat Med"},{"key":"2026011104410896300_ref51","doi-asserted-by":"publisher","first-page":"512","DOI":"10.1093\/ije\/dyv080","article-title":"Mendelian randomization with invalid instruments: effect estimation and bias detection through egger regression","volume":"44","author":"Bowden","year":"2015","journal-title":"Int J Epidemiol"},{"key":"2026011104410896300_ref52","doi-asserted-by":"publisher","first-page":"304","DOI":"10.1002\/gepi.21965","article-title":"Consistent estimation in mendelian randomization with some invalid instruments using a weighted median estimator","volume":"40","author":"Bowden","year":"2016","journal-title":"Genet Epidemiol"},{"key":"2026011104410896300_ref53","doi-asserted-by":"publisher","first-page":"693","DOI":"10.1038\/s41588-018-0099-7","article-title":"Detection of widespread horizontal pleiotropy in causal relationships inferred from mendelian randomization between complex traits and diseases","volume":"50","author":"Verbanck","year":"2018","journal-title":"Nat Genet"},{"key":"2026011104410896300_ref54","doi-asserted-by":"publisher","first-page":"740","DOI":"10.1038\/s41588-020-0631-4","article-title":"Mendelian randomization accounting for correlated and uncorrelated pleiotropic effects using genome-wide summary statistics","volume":"52","author":"Morrison","year":"2020","journal-title":"Nat Genet"},{"key":"2026011104410896300_ref55","doi-asserted-by":"publisher","first-page":"602","DOI":"10.1161\/01.HYP.0000158261.86674.8e","article-title":"Impact of obesity on 24-hour ambulatory blood pressure and hypertension","volume":"45","author":"Kotsis","year":"2005","journal-title":"Hypertension"},{"key":"2026011104410896300_ref56","doi-asserted-by":"publisher","first-page":"991","DOI":"10.1161\/CIRCRESAHA.116.305697","article-title":"Obesity-induced hypertension: interaction of neurohumoral and renal mechanisms","volume":"116","author":"Hall","year":"2015","journal-title":"Circ Res"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/27\/1\/bbaf714\/66342085\/bbaf714.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/27\/1\/bbaf714\/66342085\/bbaf714.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,11]],"date-time":"2026-01-11T09:41:16Z","timestamp":1768124476000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbaf714\/8419941"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,1]]},"references-count":56,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2026,1,7]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbaf714","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2026,1]]},"published":{"date-parts":[[2026,1]]},"article-number":"bbaf714"}}