{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,14]],"date-time":"2026-02-14T10:22:53Z","timestamp":1771064573100,"version":"3.50.1"},"reference-count":35,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,7,18]],"date-time":"2022-07-18T00:00:00Z","timestamp":1658102400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,7,18]],"date-time":"2022-07-18T00:00:00Z","timestamp":1658102400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"foundation of Yun'nan Educational Committee","award":["2022J0635"],"award-info":[{"award-number":["2022J0635"]}]},{"DOI":"10.13039\/501100001809","name":"national natural science foundation of china","doi-asserted-by":"publisher","award":["61973180"],"award-info":[{"award-number":["61973180"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100007129","name":"natural science foundation of shandong province","doi-asserted-by":"publisher","award":["ZR2019MF033"],"award-info":[{"award-number":["ZR2019MF033"]}],"id":[{"id":"10.13039\/501100007129","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100007129","name":"natural science foundation of shandong province","doi-asserted-by":"publisher","award":["ZR2021MF092"],"award-info":[{"award-number":["ZR2021MF092"]}],"id":[{"id":"10.13039\/501100007129","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100015286","name":"Hebei Provincial Key Research Projects","doi-asserted-by":"publisher","award":["2021RKY02037"],"award-info":[{"award-number":["2021RKY02037"]}],"id":[{"id":"10.13039\/501100015286","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cloud Comp"],"published-print":{"date-parts":[[2022,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Web API is a popular way to organize network services in cloud computing environment. However, it is a challenge to find an appropriate service for the requestor from massive Web API services. Service clustering can improve the efficiency of service discovery for its ability of reducing search space. Latent Dirichlet Allocation (LDA) is the most frequently used topic model in service clustering. To further improve the topic representation ability of LDA, we propose a new variant model of LDA with probability incremental correction factor (PICF-LDA) to generate the high-quality service representation vectors (SRVs) for Web API services. We first compute the words\u2019 topic contribution degree (TCD) in the service description text by its context weight and part-of-speech (POS) weight. Then the probability incremental correction factor (PICF) for a word is designed based on TCD and the word\u2019s maximum topic probability value. PICF is used to correct the probability distributions in SRVs. Experiments show that PICF-LDA has a better performance than LDA, the variant LDA models and other state-of-the-art topic models in service clustering.<\/jats:p>","DOI":"10.1186\/s13677-022-00291-9","type":"journal-article","created":{"date-parts":[[2022,7,18]],"date-time":"2022-07-18T09:02:49Z","timestamp":1658134969000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":10,"title":["PICF-LDA: a topic enhanced LDA with probability incremental correction factor for Web API service clustering"],"prefix":"10.1186","volume":"11","author":[{"given":"Jiaji","family":"Shen","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wen","family":"Huang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7642-5660","authenticated-orcid":false,"given":"Qiang","family":"Hu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2022,7,18]]},"reference":[{"issue":"2","key":"291_CR1","first-page":"421","volume":"7","author":"A Rashid","year":"2019","unstructured":"Rashid A, Chaturvedi A (2019) Cloud computing characteristics and services: a brief review. Int J Comput Sci Eng 7(2):421\u2013426","journal-title":"Int J Comput Sci Eng"},{"key":"291_CR2","doi-asserted-by":"publisher","first-page":"53","DOI":"10.1186\/s13677-021-00264-4","volume":"10","author":"Z Movahedi","year":"2021","unstructured":"Movahedi Z, Defude B (2021) An efficient population-based multi-objective task scheduling approach in fog computing systems. J Cloud Comput 10:53","journal-title":"J Cloud Comput"},{"key":"291_CR3","doi-asserted-by":"publisher","first-page":"51","DOI":"10.1186\/s13677-021-00251-9","volume":"10","author":"J Silva","year":"2021","unstructured":"Silva J, Marques ER, Lopes L, Silva F (2021) Energy-aware adaptive offloading of soft real-time jobs in mobile edge clouds. J Cloud Comput 10:51","journal-title":"J Cloud Comput"},{"key":"291_CR4","doi-asserted-by":"publisher","DOI":"10.1109\/TBDATA.2020.2975587(earlyaccess)","author":"L Qi","year":"2020","unstructured":"Qi L, He Q, Chen F, Zhang X, Dou W, Ni Q (2020) Data-driven web APIs recommendation for building web applications. IEEE Trans Big Data. https:\/\/doi.org\/10.1109\/TBDATA.2020.2975587(earlyaccess)","journal-title":"IEEE Trans Big Data"},{"key":"291_CR5","doi-asserted-by":"publisher","first-page":"38","DOI":"10.1186\/s13677-021-00251-9","volume":"10","author":"A Tsagkaropoulos","year":"2021","unstructured":"Tsagkaropoulos A, Verginadis Y, Papageorgiou N, Paraskevopoulos F, Apostolou D, Mentzas G (2021) Severity: a QoS-aware approach to cloud application elasticity. J Cloud Comput 10:38","journal-title":"J Cloud Comput"},{"issue":"1","key":"291_CR6","doi-asserted-by":"publisher","first-page":"99","DOI":"10.1109\/TSC.2017.2686390","volume":"13","author":"B Cao","year":"2017","unstructured":"Cao B, Liu XF, Rahman MM, Li B, Liu J, Tang M (2017) Integrated content and network-based service clustering and web apis recommendation for mashup development. IEEE Trans Serv Comput 13(1):99\u2013113","journal-title":"IEEE Trans Serv Comput"},{"issue":"1","key":"291_CR7","doi-asserted-by":"publisher","first-page":"207","DOI":"10.1007\/s10115-013-0623-0","volume":"38","author":"J Wu","year":"2014","unstructured":"Wu J, Chen L, Zheng Z, Lyu MR, Wu Z (2014) Clustering web services to facilitate service discovery. Knowl Inf Syst 38(1):207\u2013229","journal-title":"Knowl Inf Syst"},{"issue":"3","key":"291_CR8","doi-asserted-by":"publisher","first-page":"669","DOI":"10.1007\/s10115-018-1171-4","volume":"58","author":"N Zhang","year":"2019","unstructured":"Zhang N, Wang J, He K, Li Z, Huang Y (2019) Mining and clustering service goals for RESTful service discovery. Knowl Inf Syst 58(3):669\u2013700","journal-title":"Knowl Inf Syst"},{"key":"291_CR9","first-page":"23","volume-title":"The International Conference on Recent Innovations in Computing","author":"N Agarwal","year":"2020","unstructured":"Agarwal N, Sikka G, Awasthi LK (2020) Web service clustering approaches to enhance service discovery: a review. The International Conference on Recent Innovations in Computing. Springer, Singapore, pp 23\u201335"},{"key":"291_CR10","doi-asserted-by":"publisher","first-page":"177","DOI":"10.1016\/j.jpdc.2018.04.002","volume":"132","author":"B Cao","year":"2019","unstructured":"Cao B, Liu J, Wen Y, Li H, Xiao Q, Chen J (2019) QoS-aware service recommendation based on relational topic model and factorization machines for IoT Mashup applications. J Parallel Distrib Comput 132:177\u2013189","journal-title":"J Parallel Distrib Comput"},{"key":"291_CR11","doi-asserted-by":"crossref","unstructured":"Yang D, He D (2021) Web service clustering method based on word vector and biterm topic model. 2021 IEEE 6th International Conference on Cloud Computing and Big Data Analytics (ICCCBDA). IEEE,\u00a0Surabaya, pp 299\u2013304","DOI":"10.1109\/ICCCBDA51879.2021.9442496"},{"key":"291_CR12","doi-asserted-by":"publisher","first-page":"387","DOI":"10.1016\/j.future.2019.02.063","volume":"97","author":"Y Jiang","year":"2019","unstructured":"Jiang Y, Tao D, Liu Y, Sun J, Ling H (2019) Cloud service recommendation based on unstructured textual information. Futur Gener Comput Syst 97:387\u2013396","journal-title":"Futur Gener Comput Syst"},{"issue":"4","key":"291_CR13","doi-asserted-by":"publisher","first-page":"102238","DOI":"10.1016\/j.ipm.2020.102238","volume":"57","author":"N Agarwal","year":"2020","unstructured":"Agarwal N, Sikka G, Awasthi LK (2020) Evaluation of web service clustering using Dirichlet Multinomial Mixture model based approach for dimensionality reduction in service representation. Inf Process Manage 57(4):102238","journal-title":"Inf Process Manage"},{"key":"291_CR14","doi-asserted-by":"publisher","first-page":"40","DOI":"10.1016\/j.infsof.2017.05.001","volume":"90","author":"B Cao","year":"2017","unstructured":"Cao B, Liu XF, Liu J, Tang M (2017) Domain-aware Mashup service clustering based on LDA topic model from multiple data sources. Inf Softw Technol 90:40\u201354","journal-title":"Inf Softw Technol"},{"issue":"11","key":"291_CR15","doi-asserted-by":"publisher","first-page":"15169","DOI":"10.1007\/s11042-018-6894-4","volume":"78","author":"H Jelodar","year":"2019","unstructured":"Jelodar H, Wang Y, Yuan C, Feng X, Jiang X, Li Y, Zhao L (2019) Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey. Multimed Tools Appl 78(11):15169\u201315211","journal-title":"Multimed Tools Appl"},{"key":"291_CR16","doi-asserted-by":"publisher","first-page":"215","DOI":"10.1109\/SOCA.2014.27","volume-title":"2014 IEEE 7th International Conference on Service-Oriented Computing and Applications","author":"T Liang","year":"2014","unstructured":"Liang T, Chen L, Ying H, Wu J (2014) Co-clustering WSDL documents to bootstrap service discovery. 2014 IEEE 7th International Conference on Service-Oriented Computing and Applications. pp 215\u2013222"},{"key":"291_CR17","doi-asserted-by":"publisher","first-page":"340","DOI":"10.1109\/ICWS.2013.53","volume-title":"2013 IEEE 20th International Conference on Web Services","author":"BTGS Kumara","year":"2013","unstructured":"Kumara BTGS, Paik I, Chen W (2013) Web-service clustering with a hybrid of ontology learning and information-retrieval-based term similarity. 2013 IEEE 20th International Conference on Web Services. pp 340\u2013347. https:\/\/doi.org\/10.1109\/ICWS.2013.53"},{"issue":"2020","key":"291_CR18","doi-asserted-by":"publisher","first-page":"113682","DOI":"10.1016\/j.eswa.2020.113682","volume":"161","author":"N Agarwal","year":"2020","unstructured":"Agarwal N, Sikka G, Awasthi LK (2020) Enhancing web service clustering using length feature weight method for service description document vector space representation. Expert Syst Appl 161(2020):113682","journal-title":"Expert Syst Appl"},{"key":"291_CR19","doi-asserted-by":"crossref","unstructured":"Hu Q, Shen J, Wang K, et al (2022) A Web service clustering method based on topic enhanced Gibbs sampling algorithm\u00a0for the Dirichlet Multinomial Mixture model and service collaboration graph[J]. Inf Sci 586:239-260.","DOI":"10.1016\/j.ins.2021.11.087"},{"key":"291_CR20","doi-asserted-by":"publisher","first-page":"162","DOI":"10.1007\/978-3-642-45005-1_12","volume-title":"International conference on service-oriented computing","author":"L Chen","year":"2013","unstructured":"Chen L, Wang Y, Yu Q, Zheng Z, Wu J (2013) WT-LDA: user tagging augmented LDA for web service clustering. International conference on service-oriented computing. Springer, Berlin, Heidelberg, pp 162\u2013176"},{"key":"291_CR21","doi-asserted-by":"publisher","first-page":"239","DOI":"10.1007\/978-3-030-30952-7_25","volume-title":"International Conference on Web Information Systems and Applications","author":"H Zhao","year":"2019","unstructured":"Zhao H, Chen J, Xu L (2019) Semantic web service discovery based on LDA clustering. International Conference on Web Information Systems and Applications. Springer, Cham, pp 239\u2013250"},{"key":"291_CR22","doi-asserted-by":"publisher","first-page":"9","DOI":"10.1109\/ICWS.2017.9","volume-title":"2017 ieee international conference on web services","author":"M Shi","year":"2017","unstructured":"Shi M, Liu J, Zhou D, Tang M, Cao B (2017) WE-LDA: a word embeddings augmented LDA model for web services clustering. 2017 ieee international conference on web services. pp 9\u201316"},{"issue":"2","key":"291_CR23","doi-asserted-by":"publisher","first-page":"169","DOI":"10.1007\/s11761-018-0232-6","volume":"12","author":"A Bukhari","year":"2018","unstructured":"Bukhari A, Liu X (2018) A Web service search engine for large-scale web service discovery based on the probabilistic topic modeling and clustering. SOCA 12(2):169\u2013182","journal-title":"SOCA"},{"key":"291_CR24","doi-asserted-by":"publisher","first-page":"46","DOI":"10.1007\/978-3-030-05054-2_4","volume-title":"International conference on algorithms` and architectures for parallel processing","author":"Y Zhao","year":"2018","unstructured":"Zhao Y, He K, Qiao Y (2018) ST-LDA: high quality similar words augmented LDA for service clustering. International conference on algorithms` and architectures for parallel processing. Springer, Cham, pp 46\u201359"},{"issue":"3","key":"291_CR25","doi-asserted-by":"publisher","first-page":"59","DOI":"10.4018\/IJWSR.2019070104","volume":"16","author":"Y Zhao","year":"2019","unstructured":"Zhao Y, Qiao Y, He K (2019) A novel tagging augmented LDA model for clustering. Int J Web Serv Res 16(3):59\u201377","journal-title":"Int J Web Serv Res"},{"key":"291_CR26","doi-asserted-by":"crossref","unstructured":"Baskara AR, Sarno R (2017) Web service discovery using combined bi-term topic model and WDAG similarity. 2017 11th International Conference on Information & Communication Technology and System (ICTS). IEEE,\u00a0Chengdu, pp 235\u2013240","DOI":"10.1109\/ICTS.2017.8265676"},{"key":"291_CR27","doi-asserted-by":"crossref","unstructured":"Hu R, Liu J, Wen Y (2020) SP-BTM: A Specific Part-of-speech BTM for Service Clustering. 2020 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking. IEEE,\u00a0Exeter, pp 1050\u20131057","DOI":"10.1109\/ISPA-BDCloud-SocialCom-SustainCom51426.2020.00158"},{"issue":"42","key":"291_CR28","first-page":"1367","volume":"6","author":"BQ Cao","year":"2019","unstructured":"Cao BQ, Xiao QX, Zhang XP, Liu JX (2019) An API service recommendation method via combining self-organization map-based functionality clustering and deep factorization machine-based quality prediction. Chin J Comput 6(42):1367\u20131383","journal-title":"Chin J Comput"},{"key":"291_CR29","first-page":"153","volume-title":"International conference on services computing","author":"KK Fletcher","year":"2018","unstructured":"Fletcher KK (2018) A quality-based web api selection for mashup development using affinity propagation. International conference on services computing. Springer, Cham, pp 153\u2013165"},{"issue":"2","key":"291_CR30","doi-asserted-by":"publisher","first-page":"131","DOI":"10.26599\/BDMA.2019.9020022","volume":"3","author":"J Li","year":"2020","unstructured":"Li J, Jiao H, Wang J, Liu Z, Wu J (2020) Online real-time trajectory analysis based on adaptive time interval clustering algorithm. Big Data Mining Analytics 3(2):131\u2013142","journal-title":"Big Data Mining Analytics"},{"issue":"4","key":"291_CR31","doi-asserted-by":"publisher","first-page":"536","DOI":"10.26599\/TST.2020.9010024","volume":"26","author":"X Zhao","year":"2021","unstructured":"Zhao X, Wang Z, Gao L, Li Y, Wang S (2021) Incremental face clustering with optimal summary learning via graph convolutional network. Tsinghua Sci Technol 26(4):536\u2013547","journal-title":"Tsinghua Sci Technol"},{"issue":"3","key":"291_CR32","doi-asserted-by":"publisher","first-page":"183","DOI":"10.26599\/BDMA.2021.9020001","volume":"4","author":"Z Xue","year":"2021","unstructured":"Xue Z, Wang H (2021) Effective density-based clustering algorithms for incomplete data. Big Data Mining Analytics 4(3):183\u2013194","journal-title":"Big Data Mining Analytics"},{"issue":"5","key":"291_CR33","doi-asserted-by":"publisher","first-page":"772","DOI":"10.26599\/TST.2020.9010028","volume":"26","author":"Y Tian","year":"2021","unstructured":"Tian Y, Zheng R, Liang Z, Li S, Wu FX, Li M (2021) A data-driven clustering recommendation method for single-cell RNA-sequencing data. Tsinghua Sci Technol 26(5):772\u2013789","journal-title":"Tsinghua Sci Technol"},{"issue":"2","key":"291_CR34","doi-asserted-by":"publisher","first-page":"185","DOI":"10.26599\/TST.2019.9010078","volume":"26","author":"J Hu","year":"2020","unstructured":"Hu J, Pan Y, Li T, Yang Y (2020) TW-Co-MFC: Two-level weighted collaborative fuzzy clustering based on maximum entropy for multi-view data. Tsinghua Sci Technol 26(2):185\u2013198","journal-title":"Tsinghua Sci Technol"},{"issue":"6","key":"291_CR35","doi-asserted-by":"publisher","first-page":"886","DOI":"10.26599\/TST.2020.9010051","volume":"26","author":"A Xiong","year":"2021","unstructured":"Xiong A, Liu D, Tian H, Liu Z, Yu P, Kadoch M (2021) News keyword extraction algorithm based on semantic clustering and word graph model. Tsinghua Sci Technol 26(6):886\u2013893","journal-title":"Tsinghua Sci Technol"}],"container-title":["Journal of Cloud Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13677-022-00291-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13677-022-00291-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13677-022-00291-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,7,18]],"date-time":"2022-07-18T09:16:15Z","timestamp":1658135775000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofcloudcomputing.springeropen.com\/articles\/10.1186\/s13677-022-00291-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,18]]},"references-count":35,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,12]]}},"alternative-id":["291"],"URL":"https:\/\/doi.org\/10.1186\/s13677-022-00291-9","relation":{},"ISSN":["2192-113X"],"issn-type":[{"value":"2192-113X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,7,18]]},"assertion":[{"value":"3 August 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 June 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 July 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"19"}}