{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,19]],"date-time":"2026-02-19T00:32:07Z","timestamp":1771461127781,"version":"3.50.1"},"reference-count":40,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2014,6,1]],"date-time":"2014-06-01T00:00:00Z","timestamp":1401580800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001457","name":"Media Development Authority - Singapore","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100001457","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002855","name":"Ministry of Science and Technology of the People's Republic of China","doi-asserted-by":"publisher","award":["2013DFG12870"],"award-info":[{"award-number":["2013DFG12870"]}],"id":[{"id":"10.13039\/501100002855","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002855","name":"Ministry of Science and Technology of the People's Republic of China","doi-asserted-by":"publisher","award":["2011CB302206"],"award-info":[{"award-number":["2011CB302206"]}],"id":[{"id":"10.13039\/501100002855","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National \u201c1000 People Plan\u201d"},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61370022, 61003097, 60933013, and 61210008"],"award-info":[{"award-number":["61370022, 61003097, 60933013, and 61210008"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2014,6]]},"abstract":"<jats:p>Nowadays, the amount of multimedia contents in microblogs is growing significantly. More than 20% of microblogs link to a picture or video in certain large systems. The rich semantics in microblogs provides an opportunity to endow images with higher-level semantics beyond object labels. However, this raises new challenges for understanding the association between multimodal multimedia contents in multimedia-rich microblogs. Disobeying the fundamental assumptions of traditional annotation, tagging, and retrieval systems, pictures and words in multimedia-rich microblogs are loosely associated and a correspondence between pictures and words cannot be established. To address the aforementioned challenges, we present the first study analyzing and modeling the associations between multimodal contents in microblog streams, aiming to discover multimodal topics from microblogs by establishing correspondences between pictures and words in microblogs. We first use a data-driven approach to analyze the new characteristics of the words, pictures, and their association types in microblogs. We then propose a novel generative model called the Bilateral Correspondence Latent Dirichlet Allocation (BC-LDA) model. Our BC-LDA model can assign flexible associations between pictures and words and is able to not only allow picture-word co-occurrence with bilateral directions, but also single modal association. This flexible association can best fit the data distribution, so that the model can discover various types of joint topics and generate pictures and words with the topics accordingly. We evaluate this model extensively on a large-scale real multimedia-rich microblogs dataset. We demonstrate the advantages of the proposed model in several application scenarios, including image tagging, text illustration, and topic discovery. The experimental results demonstrate that our proposed model can significantly and consistently outperform traditional approaches.<\/jats:p>","DOI":"10.1145\/2611388","type":"journal-article","created":{"date-parts":[[2014,7,28]],"date-time":"2014-07-28T13:21:33Z","timestamp":1406553693000},"page":"1-21","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":19,"title":["Bilateral Correspondence Model for Words-and-Pictures Association in Multimedia-Rich Microblogs"],"prefix":"10.1145","volume":"10","author":[{"given":"Zhiyu","family":"Wang","sequence":"first","affiliation":[{"name":"Tsinghua University, Beijing, China"}]},{"given":"Peng","family":"Cui","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing, China"}]},{"given":"Lexing","family":"Xie","sequence":"additional","affiliation":[{"name":"Australian National University and NICTA, Australia and NICTA"}]},{"given":"Wenwu","family":"Zhu","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing, China"}]},{"given":"Yong","family":"Rui","sequence":"additional","affiliation":[{"name":"Microsoft Research Asia, Beijing, China"}]},{"given":"Shiqiang","family":"Yang","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing, China"}]}],"member":"320","published-online":{"date-parts":[[2014,7,4]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.5555\/944919.944965"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/860435.860460"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1214\/07-AOAS114"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.5555\/944919.944937"},{"key":"e_1_2_1_5_1","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1080\/00031305.1992.10475878","article-title":"Explaining the Gibbs sampler","volume":"46","author":"Casella G.","year":"1992","journal-title":"Amer. Statist."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/2502081.2502203"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2063576.2063770"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/1871437.1871552"},{"key":"e_1_2_1_9_1","unstructured":"China Internet Watch Team Staff. 2011. Total WEIBO users: Sina v.s. Tencent. http:\/\/www.chinainternetwatch.com\/1296\/total-weibo-users-sina-tencent. China Internet Watch Team Staff. 2011. Total WEIBO users: Sina v.s. Tencent. http:\/\/www.chinainternetwatch.com\/1296\/total-weibo-users-sina-tencent."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1646396.1646452"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1835449.1835553"},{"key":"e_1_2_1_12_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'09)","author":"Deng J."},{"key":"e_1_2_1_13_1","unstructured":"M. Everingham L. Van Gool C. K. I. Williams J. Winn and A. Zisserman. 2011. The Pascal visual object classes challenge 2011 (voc2011) results. http:\/\/www.pascalnetwork.org\/challenges\/VOC. M. Everingham L. Van Gool C. K. I. Williams J. Winn and A. Zisserman. 2011. The Pascal visual object classes challenge 2011 (voc2011) results. http:\/\/www.pascalnetwork.org\/challenges\/VOC."},{"key":"e_1_2_1_14_1","volume-title":"Proceedings of the 14th Annual ACM-SIAM Symposium on Discrete Algorithms. SIAM, 28--36","author":"Fagin R."},{"key":"e_1_2_1_15_1","first-page":"1295","article-title":"Symmetric correspondence topic models for multilingual text analysis","volume":"25","author":"Fukumasu K.","year":"2012","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/860435.860459"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/2396761.2396771"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1126004.1126008"},{"key":"e_1_2_1_19_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'10)","author":"Li L. J."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2013.06.011"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.5555\/850924.851523"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/219717.219748"},{"key":"e_1_2_1_23_1","unstructured":"R. Miller. 2010. Twitter unveils new website with picture and video content embedded on site. http:\/\/www.engadget.com\/2010\/09\/14\/twitter-relaunches-main-site-with-content-embedded-on-site. R. Miller. 2010. Twitter unveils new website with picture and video content embedded on site. http:\/\/www.engadget.com\/2010\/09\/14\/twitter-relaunches-main-site-with-content-embedded-on-site."},{"key":"e_1_2_1_24_1","volume-title":"Proceedings of the 20th Annual Conference on Neural Information Processing Systems. 985--992","author":"Moosmann F."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2009916.2010010"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/1571941.1572026"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/2487575.2487668"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/1963405.1963449"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2393347.2393416"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/1958824.1958830"},{"key":"e_1_2_1_31_1","volume-title":"Proceedings of the AAAI International Conference on Weblogs and Social Media. The AAAI Press.","author":"Ramage D."},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2187836.2187896"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2393347.2393364"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/1367497.1367542"},{"key":"e_1_2_1_35_1","volume-title":"Proceedings of the 9th IEEE International Conference on Computer Vision. 1470--1477","author":"Sivic J."},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.3115\/1119250.1119269"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/2124295.2124299"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/2393347.2396484"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/1935826.1935865"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/2502081.2502204"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2611388","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2611388","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T06:56:05Z","timestamp":1750229765000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2611388"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,6]]},"references-count":40,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2014,6]]}},"alternative-id":["10.1145\/2611388"],"URL":"https:\/\/doi.org\/10.1145\/2611388","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2014,6]]},"assertion":[{"value":"2013-08-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2014-02-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2014-07-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}