{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,1]],"date-time":"2025-10-01T15:30:24Z","timestamp":1759332624737},"reference-count":25,"publisher":"Springer Science and Business Media LLC","issue":"S16","license":[{"start":{"date-parts":[[2019,12,1]],"date-time":"2019-12-01T00:00:00Z","timestamp":1575158400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2019,12,2]],"date-time":"2019-12-02T00:00:00Z","timestamp":1575244800000},"content-version":"vor","delay-in-days":1,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2019,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n<jats:title>Background<\/jats:title>\n<jats:p>Predicting disease-related genes is helpful for understanding the disease pathology and the molecular mechanisms during the disease progression. However, traditional methods are not suitable for screening genes related to the disease development, because there are some samples with weak label information in the disease dataset and a small number of genes are known disease-related genes.<\/jats:p>\n<\/jats:sec><jats:sec>\n<jats:title>Results<\/jats:title>\n<jats:p>We designed a disease-related gene mining method based on the weakly supervised learning model in this paper. The method is separated into two steps. Firstly, the differentially expressed genes are screened based on the weakly supervised learning model. In the model, the strong and weak label information at different stages of the disease progression is fully utilized. The obtained differentially expressed gene set is stable and complete after the algorithm converges. Then, we screen disease-related genes in the obtained differentially expressed gene set using transductive support vector machine based on the difference kernel function. The difference kernel function can map the input space of the original Huntington\u2019s disease gene expression dataset to the difference space. The relation between the two genes can be evaluated more accurately in the difference space and the known disease-related gene information can be used effectively.<\/jats:p>\n<\/jats:sec><jats:sec>\n<jats:title>Conclusions<\/jats:title>\n<jats:p>The experimental results show that the disease-related gene mining method based on the weakly supervised learning model can effectively improve the precision of the disease-related gene prediction compared with other excellent methods.<\/jats:p>\n<\/jats:sec>","DOI":"10.1186\/s12859-019-3078-9","type":"journal-article","created":{"date-parts":[[2019,12,2]],"date-time":"2019-12-02T07:00:35Z","timestamp":1575270035000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["A disease-related gene mining method based on weakly supervised learning model"],"prefix":"10.1186","volume":"20","author":[{"given":"Han","family":"Zhang","sequence":"first","affiliation":[]},{"given":"Xueting","family":"Huo","sequence":"additional","affiliation":[]},{"given":"Xia","family":"Guo","sequence":"additional","affiliation":[]},{"given":"Xin","family":"Su","sequence":"additional","affiliation":[]},{"given":"Xiongwen","family":"Quan","sequence":"additional","affiliation":[]},{"given":"Chen","family":"Jin","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2019,12,2]]},"reference":[{"key":"3078_CR1","first-page":"266","volume":"211","author":"TM Barchet","year":"2009","unstructured":"Barchet TM, Amiji MM. Challenges and opportunities in cns delivery of therapeutics for neurodegenerative diseases. Expert Opin Drug Deliv. 2009; 211:266\u20137.","journal-title":"Expert Opin Drug Deliv"},{"key":"3078_CR2","doi-asserted-by":"publisher","first-page":"540","DOI":"10.1038\/nature.2015.16774","volume":"517","author":"R Sara","year":"2015","unstructured":"Sara R. Precision-medicine plan raises hopes. Nature. 2015; 517:540.","journal-title":"Nature"},{"key":"3078_CR3","doi-asserted-by":"publisher","first-page":"1084","DOI":"10.1093\/bioinformatics\/bti108","volume":"21","author":"YH Yang","year":"2005","unstructured":"Yang YH, Xiao Y, Segal MR. Identifying differentially expressed genes from microarray experiments via statistic synthesis. Bioinformatics. 2005; 21:1084\u201393.","journal-title":"Bioinformatics"},{"key":"3078_CR4","doi-asserted-by":"publisher","first-page":"1431","DOI":"10.1093\/bioinformatics\/btq163","volume":"26","author":"L Zhi-Qiang","year":"2010","unstructured":"Zhi-Qiang L, Yi W, Kenichi M, Takanori H, Takeshi T, Ming-Hua G, Li J, Wei-Min M, Hiroyuki S. Novel statistical framework to identify differentially expressed genes allowing transcriptomic background differences. Bioinformatics. 2010; 26:1431\u20136.","journal-title":"Bioinformatics"},{"key":"3078_CR5","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1186\/1471-2105-14-91","volume":"14","author":"C Soneson","year":"2013","unstructured":"Soneson C, Delorenzi M. A comparison of methods for differential expression analysis of rna-seq data. BMC Bioinformatics. 2013; 14:91.","journal-title":"BMC Bioinformatics"},{"key":"3078_CR6","doi-asserted-by":"publisher","first-page":"2825","DOI":"10.1093\/bioinformatics\/btl476","volume":"22","author":"F Hong","year":"2006","unstructured":"Hong F, Breitling R, Mcentee C, Wittner B, Nemhauser J, Chory J. Rankprod: a bioconductor package for detecting differentially expressed genes in meta-analysis. Bioinformatics. 2006; 22:2825\u20137.","journal-title":"Bioinformatics"},{"key":"3078_CR7","doi-asserted-by":"publisher","first-page":"374","DOI":"10.1093\/bioinformatics\/btm620","volume":"24","author":"H Fangxin","year":"2008","unstructured":"Fangxin H, Rainer B. A comparison of meta-analysis methods for detecting differentially expressed genes in microarray experiments. Bioinformatics. 2008; 24:374.","journal-title":"Bioinformatics"},{"key":"3078_CR8","doi-asserted-by":"publisher","unstructured":"Tusher VG, Tibshirani R, Chu G. Significance analysis of microarrays. In: Encyclopedia of Genetics, Genomics, Proteomics and Informatics: 2008. p. 1755. https:\/\/doi.org\/10.1007\/978-1-4020-6754-9_15015.","DOI":"10.1007\/978-1-4020-6754-9_15015"},{"key":"3078_CR9","doi-asserted-by":"publisher","first-page":"971","DOI":"10.1109\/TCBB.2015.2478454","volume":"13","author":"A Chin","year":"2016","unstructured":"Chin A, Mirzal A, Haron H, Hamed H. Supervised, unsupervised and semi-supervised feature selection: A review on gene selection. IEEE\/ACM Trans Comput Biol Bioinforma. 2016; 13:971\u201389.","journal-title":"IEEE\/ACM Trans Comput Biol Bioinforma"},{"key":"3078_CR10","doi-asserted-by":"publisher","first-page":"313","DOI":"10.1007\/978-3-540-87481-2_21","volume-title":"Machine Learning and Knowledge Discovery in Databases","author":"Yvan Saeys","year":"2008","unstructured":"Saeys Y, Abeel T, Van de Peer Y. Robust feature selection using ensemble feature selection techniques. Mach Learn Knowl Discov Database. 2008:313\u201325. https:\/\/doi.org\/10.1007\/978-3-540-87481-2_21."},{"key":"3078_CR11","doi-asserted-by":"publisher","first-page":"122","DOI":"10.1109\/TCBB.2010.44","volume":"8","author":"LK Luo","year":"2011","unstructured":"Luo LK, Huang DF, Ye LJ, Zhou QF, Shao GF, Peng H. Improving the computational efficiency of recursive cluster elimination for gene selection. IEEE\/ACM Trans Comput Biol Bioinforma. 2011; 8:122\u20139.","journal-title":"IEEE\/ACM Trans Comput Biol Bioinforma"},{"key":"3078_CR12","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1109\/TNB.2009.2035284","volume":"9","author":"PA Mundra","year":"2010","unstructured":"Mundra PA, Rajapakse JC. Svm-rfe with mrmr filter for gene selection. IEEE Trans Nanobioscience. 2010; 9:31\u20137.","journal-title":"IEEE Trans Nanobioscience"},{"key":"3078_CR13","doi-asserted-by":"publisher","first-page":"389","DOI":"10.1023\/A:1012487302797","volume":"46","author":"I Guyon","year":"2002","unstructured":"Guyon I, Weston J, Barnhill S, Vapnik V. Gene selection for cancer classification using support vector machines. Mach Learn. 2002; 46:389\u2013422.","journal-title":"Mach Learn"},{"key":"3078_CR14","doi-asserted-by":"publisher","first-page":"660","DOI":"10.1109\/21.97458","volume":"21","author":"SR Safavian","year":"2002","unstructured":"Safavian SR, Landgrebe D. A survey of decision tree classifier methodology. IEEE Trans Syst Man Cybern. 2002; 21:660\u201374.","journal-title":"IEEE Trans Syst Man Cybern"},{"key":"3078_CR15","doi-asserted-by":"publisher","unstructured":"Sandri M, Zuccolotto P. Variable selection using random forests. In: Studies in Classification, Data Analysis, and Knowledge Organization: 2012. p. 263\u201370. https:\/\/doi.org\/10.1007\/3-540-35978-8_30.","DOI":"10.1007\/3-540-35978-8_30"},{"key":"3078_CR16","doi-asserted-by":"publisher","first-page":"605","DOI":"10.1109\/TCBB.2007.70257","volume":"6","author":"S Niijima","year":"2009","unstructured":"Niijima S, Okuno Y. Laplacian linear discriminant analysis approach to unsupervised feature selection. IEEE\/ACM Trans Comput Biol Bioinforma. 2009; 6:605\u2013614.","journal-title":"IEEE\/ACM Trans Comput Biol Bioinforma"},{"key":"3078_CR17","doi-asserted-by":"publisher","first-page":"1374","DOI":"10.1109\/TCBB.2015.2415790","volume":"12","author":"B Liao","year":"2015","unstructured":"Liao B, Jiang Y, Liang W, L P, Peng L, Hanyurwimfura D, Li Z, Chen M. On efficient feature ranking methods for high-throughput data analysis. IEEE\/ACM Trans Comput Biol Bioinforma. 2015; 12:1374\u201384.","journal-title":"IEEE\/ACM Trans Comput Biol Bioinforma"},{"key":"3078_CR18","doi-asserted-by":"publisher","unstructured":"Lu X, Dan C. Cancer classification through filtering progressive transductive support vector machine based on gene expression data. In: American Institute of Physics Conference Series: 2017. https:\/\/doi.org\/10.1063\/1.4992918.","DOI":"10.1063\/1.4992918"},{"key":"3078_CR19","unstructured":"Ke G, Meng Q, Finely T, Chen W, Ma W, Ye Q, Liu T-Y. Lightgbm: A highly efficient gradient boosting decision tree. In: 31st Conference on Neural Information Processing Systems (NIPS 2017). Long Beach: 2017. p. 3149\u20133157."},{"key":"3078_CR20","doi-asserted-by":"publisher","first-page":"623","DOI":"10.1038\/nn.4256","volume":"19","author":"P Langfelder","year":"2016","unstructured":"Langfelder P, Cantle JP, Chatzopoulou D, Wang N, Gao F, Al-Ramahi I, Lu XH, Ramos EM, El-Zein K, Zhao Y. Integrated genomics and proteomics define huntingtin cag length-dependent networks in mice. Nat Neurosci. 2016; 19:623\u201333.","journal-title":"Nat Neurosci"},{"key":"3078_CR21","doi-asserted-by":"publisher","unstructured":"Schuldt C, Laptev I, Caputo B. Recognizing human actions: A local svm approach. In: Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004: 2004. https:\/\/doi.org\/10.1109\/icpr.2004.1334462.","DOI":"10.1109\/icpr.2004.1334462"},{"key":"3078_CR22","doi-asserted-by":"publisher","unstructured":"Davis J, Goadrich M. The relationship between precision-recall and roc curves. In: Proceedings of the 23rd International Conference on Machine Learning. ACM: 2006. https:\/\/doi.org\/10.1145\/1143844.1143874.","DOI":"10.1145\/1143844.1143874"},{"key":"3078_CR23","doi-asserted-by":"publisher","first-page":"507","DOI":"10.1007\/BF02296192","volume":"66","author":"A Satorra","year":"2001","unstructured":"Satorra A, Bentler PM. A scaled difference chi-square test statistic for moment structure analysis. Psychometrika. 2001; 66:507\u201314.","journal-title":"Psychometrika"},{"key":"3078_CR24","unstructured":"Joachims T. Svm-light support vector machine, Developed at: University of Dortmund, Informatik, AIUnit Collaborative Research Center on \u2019Complexity Reduction in Multivariate Data\u2019 (SFB475). Psychometrika. 2008."},{"key":"3078_CR25","doi-asserted-by":"publisher","first-page":"55","DOI":"10.1109\/TKDE.2007.190672","volume":"20","author":"F Wang","year":"2007","unstructured":"Wang F, Zhang C. Label propagation through linear neighborhoods. IEEE Trans Knowl Data Eng. 2007; 20:55\u201367.","journal-title":"IEEE Trans Knowl Data Eng"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-019-3078-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s12859-019-3078-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-019-3078-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,11,30]],"date-time":"2020-11-30T19:20:37Z","timestamp":1606764037000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-019-3078-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,12]]},"references-count":25,"journal-issue":{"issue":"S16","published-print":{"date-parts":[[2019,12]]}},"alternative-id":["3078"],"URL":"https:\/\/doi.org\/10.1186\/s12859-019-3078-9","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,12]]},"assertion":[{"value":"2 December 2019","order":1,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"589"}}