{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:30:14Z","timestamp":1750221014546,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":66,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,1,20]],"date-time":"2020-01-20T00:00:00Z","timestamp":1579478400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,1,20]]},"DOI":"10.1145\/3336191.3371768","type":"proceedings-article","created":{"date-parts":[[2020,1,22]],"date-time":"2020-01-22T19:08:16Z","timestamp":1579720096000},"page":"456-464","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["Extreme Regression for Dynamic Search Advertising"],"prefix":"10.1145","author":[{"given":"Yashoteja","family":"Prabhu","sequence":"first","affiliation":[{"name":"Microsoft Research India &amp; Indian Institute of Technology Delhi, Bengaluru, India"}]},{"given":"Aditya","family":"Kusupati","sequence":"additional","affiliation":[{"name":"University of Washington, Seattle, WA, USA"}]},{"given":"Nilesh","family":"Gupta","sequence":"additional","affiliation":[{"name":"Microsoft Research India, Bengaluru, India"}]},{"given":"Manik","family":"Varma","sequence":"additional","affiliation":[{"name":"Microsoft Research India &amp; Indian Institute of Technology Delhi, Bengaluru, India"}]}],"member":"320","published-online":{"date-parts":[[2020,1,22]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"crossref","unstructured":"R. Agrawal A. Gupta Y. Prabhu and M. Varma. 2013. Multi-label learning with millions of labels: Recommending advertiser bid phrases for web pages. In WWW .  R. Agrawal A. Gupta Y. Prabhu and M. Varma. 2013. Multi-label learning with millions of labels: Recommending advertiser bid phrases for web pages. In WWW .","DOI":"10.1145\/2488388.2488391"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3018661.3018741"},{"key":"e_1_3_2_1_3_1","unstructured":"R. Babbar and B. Sch\u00f6lkopf. 2018. Adversarial Extreme Multi-label Classification. arXiv preprint arXiv:1803.01570 (2018).  R. Babbar and B. Sch\u00f6lkopf. 2018. Adversarial Extreme Multi-label Classification. arXiv preprint arXiv:1803.01570 (2018)."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"crossref","unstructured":"R. Bekkerman M. Bilenko and J. Langford. 2011. Scaling up machine learning: Parallel and distributed approaches .Cambridge University Press.  R. Bekkerman M. Bilenko and J. Langford. 2011. Scaling up machine learning: Parallel and distributed approaches .Cambridge University Press.","DOI":"10.1017\/CBO9781139042918"},{"key":"e_1_3_2_1_5_1","unstructured":"K. Bhatia K. Dahiya H. Jain Y. Prabhu and M. Varma. 2019. The Extreme Classification Repository: Multi-label Datasets & Code . http:\/\/manikvarma.org\/downloads\/XC\/XMLRepository.html  K. Bhatia K. Dahiya H. Jain Y. Prabhu and M. Varma. 2019. The Extreme Classification Repository: Multi-label Datasets & Code . http:\/\/manikvarma.org\/downloads\/XC\/XMLRepository.html"},{"key":"e_1_3_2_1_6_1","unstructured":"K. Bhatia H. Jain P. Kar M. Varma and P. Jain. 2015. Sparse Local Embeddings for Extreme Multi-label Classification. In NeurIPS .  K. Bhatia H. Jain P. Kar M. Varma and P. Jain. 2015. Sparse Local Embeddings for Extreme Multi-label Classification. In NeurIPS ."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"crossref","unstructured":"T. Chai and R. R. Draxler. 2014. Root mean square error (RMSE) or mean absolute error (MAE).  T. Chai and R. R. Draxler. 2014. Root mean square error (RMSE) or mean absolute error (MAE).","DOI":"10.5194\/gmdd-7-1525-2014"},{"key":"e_1_3_2_1_8_1","unstructured":"Y. Chen and H. Lin. 2012. Feature-aware label space dimension reduction for multi-label classification. In NeurIPS .  Y. Chen and H. Lin. 2012. Feature-aware label space dimension reduction for multi-label classification. In NeurIPS ."},{"key":"e_1_3_2_1_9_1","unstructured":"Minhao Cheng Ian Davidson and Cho-Jui Hsieh. 2018. Extreme Learning to Rank via Low Rank Assumption. In ICML .  Minhao Cheng Ian Davidson and Cho-Jui Hsieh. 2018. Extreme Learning to Rank via Low Rank Assumption. In ICML ."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Y. Choi M. Fontoura E. Gabrilovich V. Josifovski M. R. Mediano and B. Pang. 2010. Using landing pages for sponsored search ad selection. In WWW .  Y. Choi M. Fontoura E. Gabrilovich V. Josifovski M. R. Mediano and B. Pang. 2010. Using landing pages for sponsored search ad selection. In WWW .","DOI":"10.1145\/1772690.1772717"},{"key":"e_1_3_2_1_11_1","unstructured":"M. Ciss\u00e9 N. Usunier T. Arti\u00e8res and P. Gallinari. 2013. Robust Bloom Filters for Large MultiLabel Classification Tasks. In NeurIPS .  M. Ciss\u00e9 N. Usunier T. Arti\u00e8res and P. Gallinari. 2013. Robust Bloom Filters for Large MultiLabel Classification Tasks. In NeurIPS ."},{"key":"e_1_3_2_1_12_1","volume-title":"W. Kot\u0142owski, W. Waegeman, R. Busa-Fekete, and E. H\u00fcllermeier.","author":"K.","year":"2016","unstructured":"K. Dembczy'n ski , W. Kot\u0142owski, W. Waegeman, R. Busa-Fekete, and E. H\u00fcllermeier. 2016 . Consistency of Probabilistic Classifier Trees. In Machine Learning and Knowledge Discovery in Databases . K. Dembczy'n ski, W. Kot\u0142owski, W. Waegeman, R. Busa-Fekete, and E. H\u00fcllermeier. 2016. Consistency of Probabilistic Classifier Trees. In Machine Learning and Knowledge Discovery in Databases ."},{"key":"e_1_3_2_1_13_1","volume-title":"Cl\u00e9mencc on","author":"Dhanjal C.","year":"2015","unstructured":"C. Dhanjal , R. Gaudel , and S. Cl\u00e9mencc on . 2015 . Collaborative filtering with localised ranking. In AAAI . C. Dhanjal, R. Gaudel, and S. Cl\u00e9mencc on. 2015. Collaborative filtering with localised ranking. In AAAI ."},{"key":"e_1_3_2_1_14_1","volume-title":"Polytomous logistic regression. Statistica Neerlandica","author":"Engel J","year":"1988","unstructured":"J Engel . 1988. Polytomous logistic regression. Statistica Neerlandica ( 1988 ). J Engel. 1988. Polytomous logistic regression. Statistica Neerlandica (1988)."},{"key":"e_1_3_2_1_15_1","volume-title":"LIBLINEAR: A library for large linear classification. JMLR","author":"Fan R. E.","year":"2008","unstructured":"R. E. Fan , K. W. Chang , C. J. Hsieh , X. R. Wang , and C. J. Lin . 2008 . LIBLINEAR: A library for large linear classification. JMLR (2008). R. E. Fan, K. W. Chang, C. J. Hsieh, X. R. Wang, and C. J. Lin. 2008. LIBLINEAR: A library for large linear classification. JMLR (2008)."},{"key":"e_1_3_2_1_16_1","unstructured":"C. Guo A. Mousavi X. Wu D. N. Holtmann-Rice S. Kale S. Reddi and S. Kumar. 2019. Breaking the Glass Ceiling for Embedding-Based Classifiers for Large Output Spaces. In NeurIPS .  C. Guo A. Mousavi X. Wu D. N. Holtmann-Rice S. Kale S. Reddi and S. Kumar. 2019. Breaking the Glass Ceiling for Embedding-Based Classifiers for Large Output Spaces. In NeurIPS ."},{"key":"e_1_3_2_1_17_1","volume-title":"Konstan","author":"Maxwell Harper F.","year":"2015","unstructured":"F. Maxwell Harper and Joseph A . Konstan . 2015 . The MovieLens Datasets: History and Context. ACM Trans. Interact. Intell. Syst . (2015), 19:1--19:19. F. Maxwell Harper and Joseph A. Konstan. 2015. The MovieLens Datasets: History and Context. ACM Trans. Interact. Intell. Syst. (2015), 19:1--19:19."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"crossref","unstructured":"R. Herbrich T. Graepel and K. Obermayer. 1999. Support vector learning for ordinal regression. (1999).  R. Herbrich T. Graepel and K. Obermayer. 1999. Support vector learning for ordinal regression. (1999).","DOI":"10.1049\/cp:19991091"},{"key":"e_1_3_2_1_19_1","unstructured":"D. Hsu S. Kakade J. Langford and T. Zhang. 2009. Multi-Label Prediction via Compressed Sensing. In NeurIPS .  D. Hsu S. Kakade J. Langford and T. Zhang. 2009. Multi-Label Prediction via Compressed Sensing. In NeurIPS ."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"crossref","unstructured":"J. Hu and P. Li. 2018. Collaborative Multi-objective Ranking. In PCIKM .  J. Hu and P. Li. 2018. Collaborative Multi-objective Ranking. In PCIKM .","DOI":"10.1145\/3269206.3271785"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"crossref","unstructured":"P. S. Huang X. He J. Gao L. Deng A. Acero and L. P. Heck. 2013. Learning deep structured semantic models for web search using clickthrough data. In CIKM .  P. S. Huang X. He J. Gao L. Deng A. Acero and L. P. Heck. 2013. Learning deep structured semantic models for web search using clickthrough data. In CIKM .","DOI":"10.1145\/2505515.2505665"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3289600.3290979"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"crossref","unstructured":"H. Jain Y. Prabhu and M. Varma. 2016. Extreme Multi-label Loss Functions for Recommendation Tagging Ranking and Other Missing Label Applications. In KDD .  H. Jain Y. Prabhu and M. Varma. 2016. Extreme Multi-label Loss Functions for Recommendation Tagging Ranking and Other Missing Label Applications. In KDD .","DOI":"10.1145\/2939672.2939756"},{"key":"e_1_3_2_1_24_1","volume-title":"ACM Transactions on Information Systems (TOIS)","volume":"20","author":"K.","year":"2002","unstructured":"K. J\"arvelin and J. Kek\"al\"ainen. 2002 . Cumulated gain-based evaluation of IR techniques . ACM Transactions on Information Systems (TOIS) , Vol. 20 , 4 (2002). K. J\"arvelin and J. Kek\"al\"ainen. 2002. Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems (TOIS) , Vol. 20, 4 (2002)."},{"key":"e_1_3_2_1_25_1","unstructured":"K. Jasinska K. Dembczynski R. Busa-Fekete K. Pfannschmidt T. Klerx and E. H\u00fcllermeier. 2016. Extreme F-measure Maximization Using Sparse Probability Estimates. In ICML .  K. Jasinska K. Dembczynski R. Busa-Fekete K. Pfannschmidt T. Klerx and E. H\u00fcllermeier. 2016. Extreme F-measure Maximization Using Sparse Probability Estimates. In ICML ."},{"key":"e_1_3_2_1_26_1","unstructured":"K. S. Jones S. Walker and S. E. Robertson. 2000. A probabilistic model of information retrieval: development and comparative experiments. Inf. Process. Manage. (2000).  K. S. Jones S. Walker and S. E. Robertson. 2000. A probabilistic model of information retrieval: development and comparative experiments. Inf. Process. Manage. (2000)."},{"key":"e_1_3_2_1_27_1","unstructured":"S. M. Kakade K. Sridharan and A. Tewari. 2009. On the complexity of linear prediction: Risk bounds margin bounds and regularization. In NeurIPS .  S. M. Kakade K. Sridharan and A. Tewari. 2009. On the complexity of linear prediction: Risk bounds margin bounds and regularization. In NeurIPS ."},{"key":"e_1_3_2_1_28_1","volume-title":"Biometrika","volume":"30","author":"Kendall M. G.","year":"1938","unstructured":"M. G. Kendall . 1938 . A new measure of rank correlation . Biometrika , Vol. 30 (1938). M. G. Kendall. 1938. A new measure of rank correlation. Biometrika , Vol. 30 (1938)."},{"key":"e_1_3_2_1_29_1","volume-title":"Dvz eroski","author":"Kocev D.","year":"2007","unstructured":"D. Kocev , C. Vens , J. Struyf , and S. Dvz eroski . 2007 . Ensembles of multi-objective decision trees. In ECML . D. Kocev, C. Vens, J. Struyf, and S. Dvz eroski. 2007. Ensembles of multi-objective decision trees. In ECML ."},{"key":"e_1_3_2_1_30_1","volume-title":"On information and sufficiency. The annals of mathematical statistics","author":"Kullback Solomon","year":"1951","unstructured":"Solomon Kullback and Richard A Leibler . 1951. On information and sufficiency. The annals of mathematical statistics , Vol. 22 , 1 ( 1951 ), 79--86. Solomon Kullback and Richard A Leibler. 1951. On information and sufficiency. The annals of mathematical statistics , Vol. 22, 1 (1951), 79--86."},{"key":"e_1_3_2_1_31_1","unstructured":"A. Kusupati M. Singh K. Bhatia A. Kumar P. Jain and M. Varma. 2018. FastGRNN: A Fast Accurate Stable and Tiny Kilobyte Sized Gated Recurrent Neural Network.. In NeurIPS .  A. Kusupati M. Singh K. Bhatia A. Kumar P. Jain and M. Varma. 2018. FastGRNN: A Fast Accurate Stable and Tiny Kilobyte Sized Gated Recurrent Neural Network.. In NeurIPS ."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"crossref","unstructured":"C. Lee and C. Lin. 2014. Large-scale linear ranksvm. Neural computation Vol. 26 4 (2014) 781--817.  C. Lee and C. Lin. 2014. Large-scale linear ranksvm. Neural computation Vol. 26 4 (2014) 781--817.","DOI":"10.1162\/NECO_a_00571"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"crossref","unstructured":"J. Lee S. Bengio S. Kim G. Lebanon and Y. Singer. 2014. Local collaborative ranking. In WWW .  J. Lee S. Bengio S. Kim G. Lebanon and Y. Singer. 2014. Local collaborative ranking. In WWW .","DOI":"10.1145\/2566486.2567970"},{"key":"e_1_3_2_1_34_1","unstructured":"Z. Lin G. Ding M. Hu and J. Wang. 2014. Multi-label Classification via Feature-aware Implicit Label Space Encoding. In ICML .  Z. Lin G. Ding M. Hu and J. Wang. 2014. Multi-label Classification via Feature-aware Implicit Label Space Encoding. In ICML ."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"crossref","unstructured":"J. Liu W. Chang Y. Wu and Y. Yang. 2017. Deep Learning for Extreme Multi-label Text Classification. In SIGIR .  J. Liu W. Chang Y. Wu and Y. Yang. 2017. Deep Learning for Extreme Multi-label Text Classification. In SIGIR .","DOI":"10.1145\/3077136.3080834"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"crossref","unstructured":"J. McAuley and J. Leskovec. 2013. Hidden factors and hidden topics: understanding rating dimensions with review text. In RecSys .  J. McAuley and J. Leskovec. 2013. Hidden factors and hidden topics: understanding rating dimensions with review text. In RecSys .","DOI":"10.1145\/2507157.2507163"},{"key":"e_1_3_2_1_37_1","unstructured":"E. L. Mencia and J. F\u00fcrnkranz. 2008. Efficient pairwise multilabel classification for large-scale problems in the legal domain. In ECML .  E. L. Mencia and J. F\u00fcrnkranz. 2008. Efficient pairwise multilabel classification for large-scale problems in the legal domain. In ECML ."},{"key":"e_1_3_2_1_38_1","unstructured":"P. Mineiro and N. Karampatziakis. 2015. Fast Label Embeddings for Extremely Large Output Spaces. In ECML .  P. Mineiro and N. Karampatziakis. 2015. Fast Label Embeddings for Extremely Large Output Spaces. In ECML ."},{"key":"e_1_3_2_1_39_1","unstructured":"A. Niculescu-Mizil and E. Abbasnejad. 2017. Label `s for Large Scale Multilabel Classification. In AISTATS .  A. Niculescu-Mizil and E. Abbasnejad. 2017. Label `s for Large Scale Multilabel Classification. In AISTATS ."},{"key":"e_1_3_2_1_40_1","unstructured":"D. Park J. Neeman J. Zhang S. Sanghavi and I. S. Dhillon. 2015. Preference completion: Large-scale collaborative ranking from pairwise comparisons. In ICML .  D. Park J. Neeman J. Zhang S. Sanghavi and I. S. Dhillon. 2015. Preference completion: Large-scale collaborative ranking from pairwise comparisons. In ICML ."},{"key":"e_1_3_2_1_41_1","volume-title":"G. Paliouras, \u00c9 . Gaussier, I. Androutsopoulos, M. R. Amini, and P. Gallinari.","author":"Partalas I.","year":"2015","unstructured":"I. Partalas , A. Kosmopoulos , N. Baskiotis , T. Arti\u00e8 res , G. Paliouras, \u00c9 . Gaussier, I. Androutsopoulos, M. R. Amini, and P. Gallinari. 2015 . LSHTC : A Benchmark for Large-Scale Text Classification . (2015). http:\/\/arxiv.org\/abs\/1503.08581 I. Partalas, A. Kosmopoulos, N. Baskiotis, T. Arti\u00e8 res, G. Paliouras, \u00c9 . Gaussier, I. Androutsopoulos, M. R. Amini, and P. Gallinari. 2015. LSHTC: A Benchmark for Large-Scale Text Classification. (2015). http:\/\/arxiv.org\/abs\/1503.08581"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"crossref","unstructured":"Y. Prabhu A. Kag S. Gopinath K. Dahiya S. Harsola R. Agrawal and M. Varma. 2018a. Extreme multi-label learning with label features for warm-start tagging ranking and recommendation. In WSDM .  Y. Prabhu A. Kag S. Gopinath K. Dahiya S. Harsola R. Agrawal and M. Varma. 2018a. Extreme multi-label learning with label features for warm-start tagging ranking and recommendation. In WSDM .","DOI":"10.1145\/3159652.3159660"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/3178876.3185998"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"crossref","unstructured":"Y. Prabhu and M. Varma. 2014. FastXML: A fast accurate and stable tree-classifier for extreme multi-label learning. In KDD .  Y. Prabhu and M. Varma. 2014. FastXML: A fast accurate and stable tree-classifier for extreme multi-label learning. In KDD .","DOI":"10.1145\/2623330.2623651"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"crossref","unstructured":"B. Pradel N. Usunier and P. Gallinari. 2012. Ranking With Non-Random Missing Ratings: Influence Of Popularity And Positivity on Evaluation Metrics. In RecSys .  B. Pradel N. Usunier and P. Gallinari. 2012. Ranking With Non-Random Missing Ratings: Influence Of Popularity And Positivity on Evaluation Metrics. In RecSys .","DOI":"10.1145\/2365952.2365982"},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"crossref","unstructured":"S. Ravi A. Z. Broder E. Gabrilovich V. Josifovski S. Pandey and B. Pang. 2010. Automatic generation of bid phrases for online advertising. In WSDM .  S. Ravi A. Z. Broder E. Gabrilovich V. Josifovski S. Pandey and B. Pang. 2010. Automatic generation of bid phrases for online advertising. In WSDM .","DOI":"10.1145\/1718487.1718530"},{"key":"e_1_3_2_1_47_1","unstructured":"D. Sculley. 2009. Large scale learning to rank. (2009).  D. Sculley. 2009. Large scale learning to rank. (2009)."},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"crossref","unstructured":"Y. Shen X. He J. Gao L. Deng and G. Mesnil. 2014. Learning semantic representations using convolutional neural networks for web search. In WWW.  Y. Shen X. He J. Gao L. Deng and G. Mesnil. 2014. Learning semantic representations using convolutional neural networks for web search. In WWW.","DOI":"10.1145\/2567948.2577348"},{"key":"e_1_3_2_1_49_1","unstructured":"S. Si H. Zhang S. S. Keerthi D. Mahajan I. S. Dhillon and C. J. Hsieh. 2017. Gradient Boosted Decision Trees for High Dimensional Sparse Output. In ICML .  S. Si H. Zhang S. S. Keerthi D. Mahajan I. S. Dhillon and C. J. Hsieh. 2017. Gradient Boosted Decision Trees for High Dimensional Sparse Output. In ICML ."},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"crossref","unstructured":"A. J. Smola and B. Sch\u00f6lkopf. 2004. A tutorial on support vector regression. Statistics and computing (2004).  A. J. Smola and B. Sch\u00f6lkopf. 2004. A tutorial on support vector regression. Statistics and computing (2004).","DOI":"10.1002\/0470011815.b2a14038"},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"crossref","unstructured":"Y. Tagami. 2017. AnnexML: Approximate Nearest Neighbor Search for Extreme Multi-label Classification. In KDD .  Y. Tagami. 2017. AnnexML: Approximate Nearest Neighbor Search for Extreme Multi-label Classification. In KDD .","DOI":"10.1145\/3097983.3097987"},{"key":"e_1_3_2_1_52_1","unstructured":"Y. Wang L. Wang Y. Li D. He and T. Liu. 2013. A theoretical analysis of NDCG type ranking measures. In COLT .  Y. Wang L. Wang Y. Li D. He and T. Liu. 2013. A theoretical analysis of NDCG type ranking measures. In COLT ."},{"key":"e_1_3_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1214\/aoms\/1177698603"},{"key":"e_1_3_2_1_54_1","doi-asserted-by":"crossref","unstructured":"X. Wei and W. B. Croft. 2006. LDA-based document models for ad-hoc retrieval. In SIGIR.  X. Wei and W. B. Croft. 2006. LDA-based document models for ad-hoc retrieval. In SIGIR.","DOI":"10.1145\/1148170.1148204"},{"key":"e_1_3_2_1_55_1","volume-title":"Wsabie: Scaling Up To Large Vocabulary Image Annotation. In IJCAI .","author":"Weston J.","year":"2011","unstructured":"J. Weston , S. Bengio , and N. Usunier . 2011 . Wsabie: Scaling Up To Large Vocabulary Image Annotation. In IJCAI . J. Weston, S. Bengio, and N. Usunier. 2011. Wsabie: Scaling Up To Large Vocabulary Image Annotation. In IJCAI ."},{"key":"e_1_3_2_1_56_1","unstructured":"J. Weston A. Makadia and H. Yee. 2013. Label Partitioning For Sublinear Ranking. In ICML .  J. Weston A. Makadia and H. Yee. 2013. Label Partitioning For Sublinear Ranking. In ICML ."},{"key":"e_1_3_2_1_57_1","unstructured":"L. Wu C. Hsieh and J. Sharpnack. 2018. SQL-Rank: A Listwise Approach to Collaborative Ranking. In ICML .  L. Wu C. Hsieh and J. Sharpnack. 2018. SQL-Rank: A Listwise Approach to Collaborative Ranking. In ICML ."},{"key":"e_1_3_2_1_58_1","doi-asserted-by":"crossref","unstructured":"C. Xu D. Tao and C. Xu. 2016. Robust Extreme Multi-label Learning. In KDD .  C. Xu D. Tao and C. Xu. 2016. Robust Extreme Multi-label Learning. In KDD .","DOI":"10.1145\/2939672.2939798"},{"key":"e_1_3_2_1_59_1","unstructured":"I. E. H. Yen X. Huang W. Dai P. Ravikumar I. Dhillon and E. Xing. 2017. PPDsparse: A Parallel Primal-Dual Sparse Method for Extreme Classification. In KDD .  I. E. H. Yen X. Huang W. Dai P. Ravikumar I. Dhillon and E. Xing. 2017. PPDsparse: A Parallel Primal-Dual Sparse Method for Extreme Classification. In KDD ."},{"key":"e_1_3_2_1_60_1","unstructured":"I. E. H. Yen X. Huang P. Ravikumar K. Zhong and I. S. Dhillon. 2016. PD-Sparse: A primal and dual sparse approach to extreme multiclass and multilabel classification. In ICML .  I. E. H. Yen X. Huang P. Ravikumar K. Zhong and I. S. Dhillon. 2016. PD-Sparse: A primal and dual sparse approach to extreme multiclass and multilabel classification. In ICML ."},{"key":"e_1_3_2_1_61_1","doi-asserted-by":"crossref","unstructured":"W. T. Yih J. Goodman and V. R. Carvalho. 2006. Finding advertising keywords on web pages. In WWW .  W. T. Yih J. Goodman and V. R. Carvalho. 2006. Finding advertising keywords on web pages. In WWW .","DOI":"10.1145\/1135777.1135813"},{"key":"e_1_3_2_1_62_1","volume-title":"et almbox","author":"Yin D.","year":"2016","unstructured":"D. Yin , Y. Hu , J. Tang , T. Daly , M. Zhou , Hua Ouyang , Jianhui Chen , Changsung Kang , H. Deng , C. Nobata , et almbox . 2016 . Ranking relevance in yahoo search. In PKDD . D. Yin, Y. Hu, J. Tang, T. Daly, M. Zhou, Hua Ouyang, Jianhui Chen, Changsung Kang, H. Deng, C. Nobata, et almbox. 2016. Ranking relevance in yahoo search. In PKDD ."},{"key":"e_1_3_2_1_63_1","unstructured":"R. You Z. Zhang Z. Wang S. Dai H. Mamitsuka and S. Zhu. 2019. AttentionXML: Label Tree-based Attention-Aware Deep Model for High-Performance Extreme Multi-Label Text Classification. In NeurIPS .  R. You Z. Zhang Z. Wang S. Dai H. Mamitsuka and S. Zhu. 2019. AttentionXML: Label Tree-based Attention-Aware Deep Model for High-Performance Extreme Multi-Label Text Classification. In NeurIPS ."},{"key":"e_1_3_2_1_64_1","unstructured":"H. F. Yu P. Jain P. Kar and I. S. Dhillon. 2014. Large-scale Multi-label Learning with Missing Labels. In ICML .  H. F. Yu P. Jain P. Kar and I. S. Dhillon. 2014. Large-scale Multi-label Learning with Missing Labels. In ICML ."},{"key":"e_1_3_2_1_65_1","doi-asserted-by":"crossref","unstructured":"W. Zhang D. Wang G. Xue and H. Zha. 2012. Advertising Keywords Recommendation for Short-Text Web Pages Using Wikipedia. ACM TIST (2012).  W. Zhang D. Wang G. Xue and H. Zha. 2012. Advertising Keywords Recommendation for Short-Text Web Pages Using Wikipedia. ACM TIST (2012).","DOI":"10.1145\/2089094.2089112"},{"key":"e_1_3_2_1_66_1","unstructured":"M. Zhu. 2004. Recall Precision and Average Precision.  M. Zhu. 2004. Recall Precision and Average Precision."}],"event":{"name":"WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data","SIGIR ACM Special Interest Group on Information Retrieval"],"location":"Houston TX USA","acronym":"WSDM '20"},"container-title":["Proceedings of the 13th International Conference on Web Search and Data Mining"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3336191.3371768","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3336191.3371768","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:26:10Z","timestamp":1750206370000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3336191.3371768"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,1,20]]},"references-count":66,"alternative-id":["10.1145\/3336191.3371768","10.1145\/3336191"],"URL":"https:\/\/doi.org\/10.1145\/3336191.3371768","relation":{},"subject":[],"published":{"date-parts":[[2020,1,20]]},"assertion":[{"value":"2020-01-22","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}