{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T18:07:32Z","timestamp":1775844452174,"version":"3.50.1"},"publisher-location":"Cham","reference-count":32,"publisher":"Springer Nature Switzerland","isbn-type":[{"value":"9783031781711","type":"print"},{"value":"9783031781728","type":"electronic"}],"license":[{"start":{"date-parts":[[2024,12,3]],"date-time":"2024-12-03T00:00:00Z","timestamp":1733184000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"},{"start":{"date-parts":[[2024,12,3]],"date-time":"2024-12-03T00:00:00Z","timestamp":1733184000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025]]},"DOI":"10.1007\/978-3-031-78172-8_16","type":"book-chapter","created":{"date-parts":[[2024,12,2]],"date-time":"2024-12-02T09:46:46Z","timestamp":1733132806000},"page":"239-254","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Beyond Labels: Aligning Large Language Models with\u00a0Human-Like Reasoning"],"prefix":"10.1007","author":[{"given":"Muhammad Rafsan","family":"Kabir","sequence":"first","affiliation":[]},{"given":"Rafeed Mohammad","family":"Sultan","sequence":"additional","affiliation":[]},{"given":"Ihsanul Haque","family":"Asif","sequence":"additional","affiliation":[]},{"given":"Jawad Ibn","family":"Ahad","sequence":"additional","affiliation":[]},{"given":"Fuad","family":"Rahman","sequence":"additional","affiliation":[]},{"given":"Mohammad Ruhul","family":"Amin","sequence":"additional","affiliation":[]},{"given":"Nabeel","family":"Mohammed","sequence":"additional","affiliation":[]},{"given":"Shafin","family":"Rahman","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,12,3]]},"reference":[{"key":"16_CR1","unstructured":"Albrecht, J., Kitanidis, E., Fetterman, A.: Despite \u201dsuper-human\u201d performance, current LLMs are unsuited for decisions about ethics and safety. In: NeurIPS ML Safety Workshop (2022)"},{"key":"16_CR2","doi-asserted-by":"crossref","unstructured":"Awasthi, R., et al.: Humanely: Human evaluation of llm yield, using a novel web based evaluation tool. medRxiv, pp. 2023\u201312 (2023)","DOI":"10.1101\/2023.12.22.23300458"},{"key":"16_CR3","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/A:1010933404324","volume":"45","author":"L Breiman","year":"2001","unstructured":"Breiman, L.: Random forests. Mach. Learn. 45, 5\u201332 (2001)","journal-title":"Mach. Learn."},{"key":"16_CR4","doi-asserted-by":"crossref","unstructured":"Chiang, C.H., Lee, H.y.: Can large language models be an alternative to human evaluations? In: Rogers, A., Boyd-Graber, J., Okazaki, N. (eds.) Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 15607\u201315631. Association for Computational Linguistics, Toronto, Canada (Jul 2023)","DOI":"10.18653\/v1\/2023.acl-long.870"},{"key":"16_CR5","unstructured":"cjadams, Borkan, D., inversion, Sorensen, J., Dixon, L., Vasserman, L., nithum: Jigsaw unintended bias in toxicity classification (2019). https:\/\/kaggle.com\/competitions\/jigsaw-unintended-bias-in-toxicity-classification"},{"key":"16_CR6","unstructured":"Dettmers, T., Pagnoni, A., Holtzman, A., Zettlemoyer, L.: Qlora: efficient finetuning of quantized llms. Adv. Neural Inform. Process. Syst. 36 (2024)"},{"key":"16_CR7","unstructured":"Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: Pre-training of deep bidirectional transformers for language understanding. In: Burstein, J., Doran, C., Solorio, T. (eds.) Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 4171\u20134186. Association for Computational Linguistics, Minneapolis, Minnesota (Jun 2019)"},{"key":"16_CR8","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1016\/j.neunet.2017.12.012","volume":"107","author":"S Elfwing","year":"2018","unstructured":"Elfwing, S., Uchibe, E., Doya, K.: Sigmoid-weighted linear units for neural network function approximation in reinforcement learning. Neural Netw. 107, 3\u201311 (2018)","journal-title":"Neural Netw."},{"key":"16_CR9","doi-asserted-by":"crossref","unstructured":"Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Annals Stat., 1189\u20131232 (2001)","DOI":"10.1214\/aos\/1013203451"},{"issue":"3","key":"16_CR10","doi-asserted-by":"publisher","first-page":"411","DOI":"10.1007\/s11023-020-09539-2","volume":"30","author":"I Gabriel","year":"2020","unstructured":"Gabriel, I.: Artificial intelligence, values, and alignment. Mind. Mach. 30(3), 411\u2013437 (2020)","journal-title":"Mind. Mach."},{"key":"16_CR11","unstructured":"Hendrycks, D., et al.: Aligning ai with shared human values. In: International Conference on Learning Representations (2021)"},{"key":"16_CR12","unstructured":"Hendrycks, D., et al.: Measuring massive multitask language understanding. Proceedings of the International Conference on Learning Representations (ICLR) (2021)"},{"issue":"6245","key":"16_CR13","doi-asserted-by":"publisher","first-page":"261","DOI":"10.1126\/science.aaa8685","volume":"349","author":"J Hirschberg","year":"2015","unstructured":"Hirschberg, J., Manning, C.D.: Advances in natural language processing. Science 349(6245), 261\u2013266 (2015)","journal-title":"Science"},{"key":"16_CR14","unstructured":"Jiang, A.Q., et\u00a0al.: Mistral 7b. arXiv preprint arXiv:2310.06825 (2023)"},{"issue":"1","key":"16_CR15","first-page":"1","volume":"8","author":"BY Kasula","year":"2016","unstructured":"Kasula, B.Y.: Advancements and applications of artificial intelligence: a comprehensive review. Inter. J. Stat. Comput. Simulat. 8(1), 1\u20137 (2016)","journal-title":"Inter. J. Stat. Comput. Simulat."},{"issue":"3","key":"16_CR16","doi-asserted-by":"publisher","first-page":"3713","DOI":"10.1007\/s11042-022-13428-4","volume":"82","author":"D Khurana","year":"2023","unstructured":"Khurana, D., Koli, A., Khatter, K., Singh, S.: Natural language processing: State of the art, current trends and challenges. Multimedia Tools Appli. 82(3), 3713\u20133744 (2023)","journal-title":"Multimedia Tools Appli."},{"key":"16_CR17","unstructured":"Kleinbaum, D.G., Dietz, K., Gail, M., Klein, M., Klein, M.: Logistic regression. Springer (2002)"},{"key":"16_CR18","doi-asserted-by":"crossref","unstructured":"Li, Y., et al.: Making language models better reasoners with step-aware verifier. In: Rogers, A., Boyd-Graber, J., Okazaki, N. (eds.) Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 5315\u20135333. Association for Computational Linguistics, Toronto, Canada (Jul 2023)","DOI":"10.18653\/v1\/2023.acl-long.291"},{"issue":"6","key":"16_CR19","doi-asserted-by":"publisher","first-page":"4663","DOI":"10.1007\/s40747-021-00608-2","volume":"8","author":"I Mollas","year":"2022","unstructured":"Mollas, I., Chrysopoulou, Z., Karlos, S., Tsoumakas, G.: Ethos: a multi-label hate speech detection dataset. Complex Intell. Syst. 8(6), 4663\u20134678 (2022)","journal-title":"Complex Intell. Syst."},{"key":"16_CR20","first-page":"27730","volume":"35","author":"L Ouyang","year":"2022","unstructured":"Ouyang, L., et al.: Training language models to follow instructions with human feedback. Adv. Neural. Inf. Process. Syst. 35, 27730\u201327744 (2022)","journal-title":"Adv. Neural. Inf. Process. Syst."},{"key":"16_CR21","doi-asserted-by":"crossref","unstructured":"Rana, S.: Exploring the advancements and ramifications of artificial intelligence. J. Artifi. Intell. General Sci. (JAIGS) 2(1), 30\u201335 (2024), ISSN: 3006-4023","DOI":"10.60087\/jaigs.v2i1.p35"},{"key":"16_CR22","doi-asserted-by":"crossref","unstructured":"Renze, M., Guven, E.: The effect of sampling temperature on problem solving in large language models. arXiv preprint arXiv:2402.05201 (2024)","DOI":"10.18653\/v1\/2024.findings-emnlp.432"},{"key":"16_CR23","unstructured":"Sanh, V., Debut, L., Chaumond, J., Wolf, T.: Distilbert, a distilled version of bert: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108 (2019)"},{"key":"16_CR24","unstructured":"Shazeer, N.: Glu variants improve transformer. arXiv preprint arXiv:2002.05202 (2020)"},{"key":"16_CR25","doi-asserted-by":"crossref","unstructured":"Suthaharan, S., Suthaharan, S.: Support vector machine. Machine learning models and algorithms for big data classification: thinking with examples for effective learning, pp. 207\u2013235 (2016)","DOI":"10.1007\/978-1-4899-7641-3_9"},{"key":"16_CR26","unstructured":"Touvron, H., et\u00a0al.: Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288 (2023)"},{"key":"16_CR27","doi-asserted-by":"crossref","unstructured":"Wang, P., et al.: Making large language models better reasoners with alignment (2024)","DOI":"10.18653\/v1\/2023.findings-emnlp.167"},{"key":"16_CR28","unstructured":"Wang, Y., et al.: Aligning large language models with human: A survey. arXiv preprint arXiv:2307.12966 (2023)"},{"key":"16_CR29","first-page":"24824","volume":"35","author":"J Wei","year":"2022","unstructured":"Wei, J., et al.: Chain-of-thought prompting elicits reasoning in large language models. Adv. Neural. Inf. Process. Syst. 35, 24824\u201324837 (2022)","journal-title":"Adv. Neural. Inf. Process. Syst."},{"key":"16_CR30","unstructured":"Weidinger, L., et\u00a0al.: Ethical and social risks of harm from language models. arXiv preprint arXiv:2112.04359 (2021)"},{"key":"16_CR31","unstructured":"Yuan, H., Yuan, Z., Tan, C., Wang, W., Huang, S., Huang, F.: Rrhf: rank responses to align language models with human feedback. Adv. Neural Inform. Process. Syst. 36 (2024)"},{"key":"16_CR32","first-page":"1","volume":"4","author":"E Yudkowsky","year":"2016","unstructured":"Yudkowsky, E.: The ai alignment problem: why it is hard, and where to start. Symbolic Syst. Distinguished Speaker 4, 1 (2016)","journal-title":"Symbolic Syst. Distinguished Speaker"}],"container-title":["Lecture Notes in Computer Science","Pattern Recognition"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-031-78172-8_16","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,2]],"date-time":"2024-12-02T10:06:58Z","timestamp":1733134018000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/978-3-031-78172-8_16"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,12,3]]},"ISBN":["9783031781711","9783031781728"],"references-count":32,"URL":"https:\/\/doi.org\/10.1007\/978-3-031-78172-8_16","relation":{},"ISSN":["0302-9743","1611-3349"],"issn-type":[{"value":"0302-9743","type":"print"},{"value":"1611-3349","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,12,3]]},"assertion":[{"value":"3 December 2024","order":1,"name":"first_online","label":"First Online","group":{"name":"ChapterHistory","label":"Chapter History"}},{"value":"We take ethical considerations very seriously in this study, which involves generating ethical reasoning using LLMs and their evaluations by humans. We recruited five human evaluators from diverse demographics on a voluntary basis. Importantly, no sensitive information was collected from the evaluators; only the necessary details to assess their suitability for the task were collected, with any potentially identifying data deleted post-evaluation. Additionally, we ensured that the work would not cause any harm to the evaluators, either physically or mentally.The data from the publicly available ETHOS dataset [] may contain some abusive language, which could potentially make some evaluators uncomfortable. We implemented strict safety protocols to ensure the LLMs did not produce harmful or abusive content. Moreover, we reject any attempts to insult or demean any race, acknowledging that gender and race are social constructs that warrant respect. Therefore, we believe that our work will not cause any ethical issues.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethical Statement"}},{"value":"ICPR","order":1,"name":"conference_acronym","label":"Conference Acronym","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"International Conference on Pattern Recognition","order":2,"name":"conference_name","label":"Conference Name","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Kolkata","order":3,"name":"conference_city","label":"Conference City","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"India","order":4,"name":"conference_country","label":"Conference Country","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"2024","order":5,"name":"conference_year","label":"Conference Year","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"1 December 2024","order":7,"name":"conference_start_date","label":"Conference Start Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"5 December 2024","order":8,"name":"conference_end_date","label":"Conference End Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"27","order":9,"name":"conference_number","label":"Conference Number","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"icpr2024","order":10,"name":"conference_id","label":"Conference ID","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"https:\/\/icpr2024.org\/","order":11,"name":"conference_url","label":"Conference URL","group":{"name":"ConferenceInfo","label":"Conference Information"}}]}}