{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,20]],"date-time":"2025-09-20T20:16:12Z","timestamp":1758399372718,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":38,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,9,26]],"date-time":"2019-09-26T00:00:00Z","timestamp":1569456000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,9,26]]},"DOI":"10.1145\/3341981.3344220","type":"proceedings-article","created":{"date-parts":[[2019,9,27]],"date-time":"2019-09-27T12:34:07Z","timestamp":1569587647000},"page":"19-26","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":8,"title":["Learning a Better Negative Sampling Policy with Deep Neural Networks for Search"],"prefix":"10.1145","author":[{"given":"Daniel","family":"Cohen","sequence":"first","affiliation":[{"name":"University of Massachusetts Amherst, Amherst, MA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Scott M.","family":"Jordan","sequence":"additional","affiliation":[{"name":"University of Massachusetts Amherst, Amherst, MA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"W. Bruce","family":"Croft","sequence":"additional","affiliation":[{"name":"University of Massachusetts Amherst, Amherst, MA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2019,9,26]]},"reference":[{"volume-title":"SIGIR 2018 .","author":"Ai Qingyao","key":"e_1_3_2_1_1_1","unstructured":"Qingyao Ai , Keping Bi , Jiafeng Guo , and W. Bruce Croft . 2018. Learning a Deep Listwise Context Model for Ranking Refinement . In SIGIR 2018 . Qingyao Ai, Keping Bi, Jiafeng Guo, and W. Bruce Croft. 2018. Learning a Deep Listwise Context Model for Ranking Refinement. In SIGIR 2018 ."},{"volume-title":"Proceedings of ICML 2017 (Proceedings of Machine Learning Research). PMLR .","author":"Bello Irwan","key":"e_1_3_2_1_2_1","unstructured":"Irwan Bello , Barret Zoph , Vijay Vasudevan , and Quoc V. Le . 2017. Neural Optimizer Search with Reinforcement Learning . In Proceedings of ICML 2017 (Proceedings of Machine Learning Research). PMLR . Irwan Bello, Barret Zoph, Vijay Vasudevan, and Quoc V. Le. 2017. Neural Optimizer Search with Reinforcement Learning. In Proceedings of ICML 2017 (Proceedings of Machine Learning Research). PMLR ."},{"key":"e_1_3_2_1_3_1","volume-title":"How to assess and report the performance of a stochastic algorithm on a benchmark problem: mean or best result on a number of runs? Optimization letters","author":"Birattari Mauro","year":"2007","unstructured":"Mauro Birattari and Marco Dorigo . 2007. How to assess and report the performance of a stochastic algorithm on a benchmark problem: mean or best result on a number of runs? Optimization letters , Vol. 1 , 3 ( 2007 ), 309--311. Mauro Birattari and Marco Dorigo. 2007. How to assess and report the performance of a stochastic algorithm on a benchmark problem: mean or best result on a number of runs? Optimization letters , Vol. 1, 3 (2007), 309--311."},{"key":"e_1_3_2_1_4_1","volume-title":"Davide Del Testa","author":"Bojarski Mariusz","year":"2016","unstructured":"Mariusz Bojarski , Davide Del Testa , Daniel Dworakowski, Bernhard Firner , Beat Flepp, Prasoon Goyal, Lawrence D. Jackel, Mathew Monfort, Urs Muller, Jiakai Zhang, Xin Zhang, Jake Zhao, and Karol Zieba. 2016 . End to End Learning for Self-Driving Cars. CoRR , Vol. abs\/ 1604 .07316 (2016). arxiv: 1604.07316 Mariusz Bojarski, Davide Del Testa, Daniel Dworakowski, Bernhard Firner, Beat Flepp, Prasoon Goyal, Lawrence D. Jackel, Mathew Monfort, Urs Muller, Jiakai Zhang, Xin Zhang, Jake Zhao, and Karol Zieba. 2016. End to End Learning for Self-Driving Cars. CoRR , Vol. abs\/1604.07316 (2016). arxiv: 1604.07316"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3159652.3159695"},{"volume-title":"ICTIR '16 .","author":"Cohen Daniel","key":"e_1_3_2_1_6_1","unstructured":"Daniel Cohen and W. Bruce Croft . [n. d.]. End to End Long Short Term Memory Networks for Non-Factoid Question Answering . In ICTIR '16 . Daniel Cohen and W. Bruce Croft. [n. d.]. End to End Long Short Term Memory Networks for Non-Factoid Question Answering. In ICTIR '16 ."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"crossref","unstructured":"Mostafa Dehghani Hamed Zamani Aliaksei Severyn Jaap Kamps and W. Bruce Croft. 2017. Neural Ranking Models with Weak Supervision. In SIGIR. ACM .  Mostafa Dehghani Hamed Zamani Aliaksei Severyn Jaap Kamps and W. Bruce Croft. 2017. Neural Ranking Models with Weak Supervision. In SIGIR. ACM .","DOI":"10.1145\/3077136.3080832"},{"key":"e_1_3_2_1_8_1","unstructured":"Yixing Fan Jiafeng Guo Yanyan Lan Jun Xu Chengxiang Zhai and Xueqi Cheng. 2018a. Modeling Diverse Relevance Patterns in Ad-hoc Retrieval. In SIGIR. 375--384.  Yixing Fan Jiafeng Guo Yanyan Lan Jun Xu Chengxiang Zhai and Xueqi Cheng. 2018a. Modeling Diverse Relevance Patterns in Ad-hoc Retrieval. In SIGIR. 375--384."},{"key":"e_1_3_2_1_9_1","unstructured":"Yang Fan Fei Tian Tao Qin Xiang-Yang Li and Tie-Yan Liu. 2018b. Learning to Teach. In ICLR .  Yang Fan Fei Tian Tao Qin Xiang-Yang Li and Tie-Yan Liu. 2018b. Learning to Teach. In ICLR ."},{"key":"e_1_3_2_1_10_1","volume-title":"Daniel M. Roy, and Michael Carbin.","author":"Frankle Jonathan","year":"2019","unstructured":"Jonathan Frankle , Gintare Karolina Dziugaite , Daniel M. Roy, and Michael Carbin. 2019 . The Lottery Ticket Hypothesis at Scale. CoRR , Vol. abs\/ 1903 .01611 (2019). arxiv: 1903.01611 http:\/\/arxiv.org\/abs\/1903.01611 Jonathan Frankle, Gintare Karolina Dziugaite, Daniel M. Roy, and Michael Carbin. 2019. The Lottery Ticket Hypothesis at Scale. CoRR , Vol. abs\/1903.01611 (2019). arxiv: 1903.01611 http:\/\/arxiv.org\/abs\/1903.01611"},{"key":"e_1_3_2_1_11_1","volume-title":"word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method. CoRR","author":"Goldberg Yoav","year":"2014","unstructured":"Yoav Goldberg and Omer Levy . 2014. word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method. CoRR , Vol. abs\/ 1402 .3722 ( 2014 ). Yoav Goldberg and Omer Levy. 2014. word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method. CoRR , Vol. abs\/1402.3722 (2014)."},{"key":"e_1_3_2_1_12_1","unstructured":"Joshua Goodman. 2001. Classes for Fast Maximum Entropy Training. In ICASSP .  Joshua Goodman. 2001. Classes for Fast Maximum Entropy Training. In ICASSP ."},{"key":"e_1_3_2_1_13_1","volume-title":"Efficient softmax approximation for GPUs. CoRR","author":"Grave Edouard","year":"2016","unstructured":"Edouard Grave , Armand Joulin , Moustapha Ciss\u00e9 , David Grangier , and Herv\u00e9 J\u00e9 gou. 2016. Efficient softmax approximation for GPUs. CoRR , Vol. abs\/ 1609 .04309 ( 2016 ). Edouard Grave, Armand Joulin, Moustapha Ciss\u00e9, David Grangier, and Herv\u00e9 J\u00e9 gou. 2016. Efficient softmax approximation for GPUs. CoRR , Vol. abs\/1609.04309 (2016)."},{"key":"e_1_3_2_1_14_1","unstructured":"Alex Graves Marc G. Bellemare Jacob Menick R\u00e9mi Munos and Koray Kavukcuoglu. 2017. Automated Curriculum Learning for Neural Networks. In ICML (Proceedings of Machine Learning Research). PMLR.  Alex Graves Marc G. Bellemare Jacob Menick R\u00e9mi Munos and Koray Kavukcuoglu. 2017. Automated Curriculum Learning for Neural Networks. In ICML (Proceedings of Machine Learning Research). PMLR."},{"key":"e_1_3_2_1_15_1","volume-title":"On using very large target vocabulary for neural machine translation. arXiv preprint arXiv:1412.2007","author":"Jean S\u00e9bastien","year":"2014","unstructured":"S\u00e9bastien Jean , Kyunghyun Cho , Roland Memisevic , and Yoshua Bengio . 2014. On using very large target vocabulary for neural machine translation. arXiv preprint arXiv:1412.2007 ( 2014 ). S\u00e9bastien Jean, Kyunghyun Cho, Roland Memisevic, and Yoshua Bengio. 2014. On using very large target vocabulary for neural machine translation. arXiv preprint arXiv:1412.2007 (2014)."},{"key":"e_1_3_2_1_16_1","unstructured":"Scott Jordan Daniel Cohen and Philip Thomas. 2018. Using Cumulative Distribution Based Performance Analysis to Benchmark Models. In NeurIPS .  Scott Jordan Daniel Cohen and Philip Thomas. 2018. Using Cumulative Distribution Based Performance Analysis to Benchmark Models. In NeurIPS ."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.5555\/3239802"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"crossref","unstructured":"Bhaskar Mitra Fernando Diaz and Nick Craswell. 2017. Learning to match using local and distributed representations of text for web search. In WWW 17. 1291--1299.  Bhaskar Mitra Fernando Diaz and Nick Craswell. 2017. Learning to match using local and distributed representations of text for web search. In WWW 17. 1291--1299.","DOI":"10.1145\/3038912.3052579"},{"key":"e_1_3_2_1_20_1","unstructured":"Andriy Mnih and Koray Kavukcuoglu. 2013. Learning word embeddings efficiently with noise-contrastive estimation. In Advances in NIPS. 2265--2273.  Andriy Mnih and Koray Kavukcuoglu. 2013. Learning word embeddings efficiently with noise-contrastive estimation. In Advances in NIPS. 2265--2273."},{"volume-title":"Advances in NIPS . Curran Associates","author":"Montufar Guido F","key":"e_1_3_2_1_21_1","unstructured":"Guido F Montufar , Razvan Pascanu , Kyunghyun Cho , and Yoshua Bengio . 2014. On the Number of Linear Regions of Deep Neural Networks . In Advances in NIPS . Curran Associates , Inc . Guido F Montufar, Razvan Pascanu, Kyunghyun Cho, and Yoshua Bengio. 2014. On the Number of Linear Regions of Deep Neural Networks. In Advances in NIPS . Curran Associates, Inc."},{"key":"e_1_3_2_1_22_1","volume-title":"Russell","author":"Ng Andrew Y.","year":"1999","unstructured":"Andrew Y. Ng , Daishi Harada , and Stuart J . Russell . 1999 . Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping. In Proceedings of ICML. Morgan Kaufmann . Andrew Y. Ng, Daishi Harada, and Stuart J. Russell. 1999. Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping. In Proceedings of ICML. Morgan Kaufmann."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v30i1.10341"},{"volume-title":"SIGIR (SIGIR '98)","author":"Ponte Jay M.","key":"e_1_3_2_1_24_1","unstructured":"Jay M. Ponte and W. Bruce Croft . 1998. A Language Modeling Approach to Information Retrieval . In SIGIR (SIGIR '98) . ACM , New York, NY, USA , 275--281. Jay M. Ponte and W. Bruce Croft. 1998. A Language Modeling Approach to Information Retrieval. In SIGIR (SIGIR '98). ACM, New York, NY, USA, 275--281."},{"key":"e_1_3_2_1_25_1","volume-title":"Singh","author":"Precup Doina","year":"2000","unstructured":"Doina Precup , Richard S. Sutton , and Satinder P . Singh . 2000 . Eligibility Traces for Off-Policy Policy Evaluation. In Proceedings of ICML . Morgan Kaufmann . Doina Precup, Richard S. Sutton, and Satinder P. Singh. 2000. Eligibility Traces for Off-Policy Policy Evaluation. In Proceedings of ICML . Morgan Kaufmann."},{"key":"e_1_3_2_1_26_1","volume-title":"Learning to Selectively Transfer: Reinforced Transfer Learning for Deep Text Matching. arXiv preprint arXiv:1812.11561","author":"Qu Chen","year":"2018","unstructured":"Chen Qu , Feng Ji , Minghui Qiu , Liu Yang , Zhiyu Min , Haiqing Chen , Jun Huang , and W Bruce Croft . 2018. Learning to Selectively Transfer: Reinforced Transfer Learning for Deep Text Matching. arXiv preprint arXiv:1812.11561 ( 2018 ). Chen Qu, Feng Ji, Minghui Qiu, Liu Yang, Zhiyu Min, Haiqing Chen, Jun Huang, and W Bruce Croft. 2018. Learning to Selectively Transfer: Reinforced Transfer Learning for Deep Text Matching. arXiv preprint arXiv:1812.11561 (2018)."},{"key":"e_1_3_2_1_27_1","unstructured":"Nicholas Roy and Andrew Mccallum. 2001. Toward optimal active learning through monte carlo estimation of error reduction. In ICML .  Nicholas Roy and Andrew Mccallum. 2001. Toward optimal active learning through monte carlo estimation of error reduction. In ICML ."},{"key":"e_1_3_2_1_28_1","volume-title":"Proximal Policy Optimization Algorithms. CoRR","author":"Schulman John","year":"2017","unstructured":"John Schulman , Filip Wolski , Prafulla Dhariwal , Alec Radford , and Oleg Klimov . 2017. Proximal Policy Optimization Algorithms. CoRR , Vol. abs\/ 1707 .06347 ( 2017 ). John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. 2017. Proximal Policy Optimization Algorithms. CoRR , Vol. abs\/1707.06347 (2017)."},{"key":"e_1_3_2_1_29_1","unstructured":"Richard Sutton and Andrew Barto. 2016. Reinforcement Learning .MIT.  Richard Sutton and Andrew Barto. 2016. Reinforcement Learning .MIT."},{"volume-title":"Advances in NIPS","author":"Sutton Richard S","key":"e_1_3_2_1_30_1","unstructured":"Richard S Sutton , David A. McAllester , Satinder P. Singh , and Yishay Mansour . 2000. Policy Gradient Methods for Reinforcement Learning with Function Approximation . In Advances in NIPS . MIT Press . Richard S Sutton, David A. McAllester, Satinder P. Singh, and Yishay Mansour. 2000. Policy Gradient Methods for Reinforcement Learning with Function Approximation. In Advances in NIPS . MIT Press."},{"key":"e_1_3_2_1_31_1","volume-title":"ACL 2015","volume":"712","author":"Wang Di","year":"2015","unstructured":"Di Wang and Eric Nyberg . 2015 . A Long Short-Term Memory Model for Answer Sentence Selection in Question Answering. In ACL-IJCNLP , ACL 2015 , July 26 --31 , 2015, Beijing, China, Volume 2: Short Papers . ACL, 707-- 712 . Di Wang and Eric Nyberg. 2015. A Long Short-Term Memory Model for Answer Sentence Selection in Question Answering. In ACL-IJCNLP, ACL 2015, July 26--31, 2015, Beijing, China, Volume 2: Short Papers . ACL, 707--712."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/3077136.3080786"},{"volume-title":"Advances in NIPS","author":"Wang Yu-Xiong","key":"e_1_3_2_1_33_1","unstructured":"Yu-Xiong Wang , Deva Ramanan , and Martial Hebert . 2017a. Learning to Model the Tail . In Advances in NIPS , , I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.). Curran Associates, Inc. , 7029--7039. Yu-Xiong Wang, Deva Ramanan, and Martial Hebert. 2017a. Learning to Model the Tail. In Advances in NIPS , , I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.). Curran Associates, Inc., 7029--7039."},{"volume-title":"Advances in NIPS . Curran Associates","author":"Wang Yu-Xiong","key":"e_1_3_2_1_34_1","unstructured":"Yu-Xiong Wang , Deva Ramanan , and Martial Hebert . 2017b. Learning to Model the Tail . In Advances in NIPS . Curran Associates , Inc ., 7029--7039. Yu-Xiong Wang, Deva Ramanan, and Martial Hebert. 2017b. Learning to Model the Tail. In Advances in NIPS . Curran Associates, Inc., 7029--7039."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF00992696"},{"key":"e_1_3_2_1_36_1","unstructured":"Jiawei Wu Lei Li and William Yang Wang. 2018a. Reinforced Co-Training. In NAACL-HLT Marilyn A. Walker Heng Ji and Amanda Stent (Eds.). ACL 1252--1262.  Jiawei Wu Lei Li and William Yang Wang. 2018a. Reinforced Co-Training. In NAACL-HLT Marilyn A. Walker Heng Ji and Amanda Stent (Eds.). ACL 1252--1262."},{"key":"e_1_3_2_1_37_1","volume-title":"abs\/1804.06035","author":"Wu Jiawei","year":"2018","unstructured":"Jiawei Wu , Lei Li , and William Yang Wang . 2018b. Reinforced Co-Training . Co RR , Vol. abs\/1804.06035 ( 2018 ). arxiv: 1804.06035 Jiawei Wu, Lei Li, and William Yang Wang. 2018b. Reinforced Co-Training. CoRR , Vol. abs\/1804.06035 (2018). arxiv: 1804.06035"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"crossref","unstructured":"Weinan Zhang Tianqi Chen Jun Wang and Yong Yu. 2013. Optimizing top-n collaborative filtering via dynamic negative item sampling. In SIGIR. ACM 785--788.  Weinan Zhang Tianqi Chen Jun Wang and Yong Yu. 2013. Optimizing top-n collaborative filtering via dynamic negative item sampling. In SIGIR. ACM 785--788.","DOI":"10.1145\/2484028.2484126"},{"key":"e_1_3_2_1_39_1","volume-title":"Le","author":"Zoph Barret","year":"2016","unstructured":"Barret Zoph and Quoc V . Le . 2016 . Neural Architecture Search with Reinforcement Learning. CoRR , Vol. abs\/ 1611 .01578 (2016). arxiv: 1611.01578 Barret Zoph and Quoc V. Le. 2016. Neural Architecture Search with Reinforcement Learning. CoRR , Vol. abs\/1611.01578 (2016). arxiv: 1611.01578"}],"event":{"name":"ICTIR '19: The 2019 ACM SIGIR International Conference on the Theory of Information Retrieval","sponsor":["SIGIR ACM Special Interest Group on Information Retrieval"],"location":"Santa Clara CA USA","acronym":"ICTIR '19"},"container-title":["Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3341981.3344220","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3341981.3344220","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:43:24Z","timestamp":1750207404000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3341981.3344220"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,9,26]]},"references-count":38,"alternative-id":["10.1145\/3341981.3344220","10.1145\/3341981"],"URL":"https:\/\/doi.org\/10.1145\/3341981.3344220","relation":{},"subject":[],"published":{"date-parts":[[2019,9,26]]},"assertion":[{"value":"2019-09-26","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}