{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,17]],"date-time":"2025-10-17T13:43:46Z","timestamp":1760708626275,"version":"3.41.2"},"reference-count":31,"publisher":"Emerald","issue":"6","license":[{"start":{"date-parts":[[2013,10,14]],"date-time":"2013-10-14T00:00:00Z","timestamp":1381708800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2013,10,14]]},"abstract":"<jats:sec>\n               <jats:title content-type=\"abstract-heading\">Purpose<\/jats:title>\n               <jats:p> \u2013 This study aims to examine manually formulated queries and automatic query generation in an early phase of a patent \u201cprior art\u201d search. <\/jats:p>\n            <\/jats:sec>\n            <jats:sec>\n               <jats:title content-type=\"abstract-heading\">Design\/methodology\/approach<\/jats:title>\n               <jats:p> \u2013 The study was performed partly within a patent domain setting, involving three professional patent examiners, and partly in the context of the CLEF 2009 Intellectual Property (CLEF-IP) track. For the exploratory study of user-based query formulation, three patent examiners performed the same three simulated real-life patent tasks. For the automatic query generation, a simple term-weighting algorithm based on the RATF formula was used. The manually and automatically created queries were compared to analyse what kinds of keywords and from which parts of the patent documents were selected. <\/jats:p>\n            <\/jats:sec>\n            <jats:sec>\n               <jats:title content-type=\"abstract-heading\">Findings<\/jats:title>\n               <jats:p> \u2013 For user-formulated queries, it was found that patent documents were read in a specific order of importance and that the time varied. Annotations and collaboration were made while reading and selecting\/ranking terms. Ranking terms was experienced to be harder than selecting terms. For the automatic formulated queries, it was found that the term frequencies used in the RATF alone will not quite approximate what terms will be judged as relevant query terms by the users. Simultaneously, the results suggest that developing a query generation tool for generating initial queries based on patent documents is feasible. <\/jats:p>\n            <\/jats:sec>\n            <jats:sec>\n               <jats:title content-type=\"abstract-heading\">Research limitations\/implications<\/jats:title>\n               <jats:p> \u2013 These preliminary but informative results need to be viewed in the light that only three patent experts were observed and that a small set of topics was used. <\/jats:p>\n            <\/jats:sec>\n            <jats:sec>\n               <jats:title content-type=\"abstract-heading\">Originality\/value<\/jats:title>\n               <jats:p> \u2013 It is usually difficult to get access to the setting of the patent domain and the results of the study show that the methodology provided a feasible way to study manual and the manual query formulation of the patent engineer.<\/jats:p>\n            <\/jats:sec>","DOI":"10.1108\/jd-12-2012-0166","type":"journal-article","created":{"date-parts":[[2013,10,8]],"date-time":"2013-10-08T07:32:18Z","timestamp":1381217538000},"page":"873-898","source":"Crossref","is-referenced-by-count":5,"title":["Exploring manual and automatic query formulation in patent IR"],"prefix":"10.1108","volume":"69","author":[{"given":"Preben","family":"Hansen","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Anni","family":"J\u00e4rvelin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Antti","family":"J\u00e4rvelin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"140","reference":[{"key":"key2022031020364574000_b1","unstructured":"Aula, A.\n                (2003), \u201cQuery formulation in web information search\u201d, Proceedings of the ICWE 2003 Conference, Algarve, November, 2003, IADIS, pp. 403-410."},{"key":"key2022031020364574000_b2","doi-asserted-by":"crossref","unstructured":"Belkin, N.J.\n               , \n                  Kelly, D.\n               , \n                  Kim, G.\n               , \n                  Kim, J.-Y.\n               , \n                  Lee, H.-J.\n               , \n                  Muresan, G.\n               , \n                  Tang, M.-C.\n               , \n                  Yuan, X.-J.\n                and \n                  Cool, C.\n                (2003), \u201cQuery length in interactive information retrieval\u201d, Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, pp. 205-212.","DOI":"10.1145\/860435.860474"},{"key":"key2022031020364574000_b3","doi-asserted-by":"crossref","unstructured":"Bonino, D.\n               , \n                  Corno, F.\n                and \n                  Ciaramella, A.\n                (2010), \u201cReview of the state-of-the-art in patent information and forthcoming evolutions in intelligent patent informatics\u201d, World Patent Information, Vol. 32 No. 1, pp. 30-38.","DOI":"10.1016\/j.wpi.2009.05.008"},{"key":"key2022031020364574000_b4","unstructured":"EPO\n                (2008), \u201cWhat is prior art?\u201d, available at: www.epo.org\/topics\/innovation-and-economy\/handbook\/novelty\/prior-art.html (accessed 20 January 2010)."},{"key":"key2022031020364574000_b5","doi-asserted-by":"crossref","unstructured":"Fujii, A.\n               , \n                  Iwayama, M.\n                and \n                  Kando, N.\n                (2004), \u201cOverview of patent retrieval task at NTCIR-4\u201d, Proceedings of the NTCIR-4, Tokyo, April 2003-June 2004.","DOI":"10.3115\/1119303.1119306"},{"key":"key2022031020364574000_b7","unstructured":"Hansen, P.\n                (2011), \u201cTask-based information seeking and retrieval in the patent domain, processes and relationships\u201d, PhD thesis, Acta Universitatis Tamperensis 1631."},{"key":"key2022031020364574000_b8","unstructured":"Hansen, P.\n                and \n                  J\u00e4rvelin, K.\n                (2000), \u201cThe information seeking and retrieval process at the Swedish Patent- and Registration Office. Moving from lab-based to real life work-task environment\u201d, Proceedings of the ACM-SIGIR 2000 Workshop on Patent Retrieval, Athens, July 28, 2000, pp. 43-53."},{"key":"key2022031020364574000_b6","doi-asserted-by":"crossref","unstructured":"Hansen, P.\n                and \n                  J\u00e4rvelin, K.\n                (2005), \u201cCollaborative information retrieval in an information-intensive domain\u201d, Information Processing and Management, Vol. 41 No. 5, September, pp. 1101-1119.","DOI":"10.1016\/j.ipm.2004.04.016"},{"key":"key2022031020364574000_b9","doi-asserted-by":"crossref","unstructured":"Hearst, M.\n                (2009), Search User Interfaces, University Press, Cambridge.","DOI":"10.1017\/CBO9781139644082"},{"key":"key2022031020364574000_b10","doi-asserted-by":"crossref","unstructured":"Hsieh-yee, I.\n                (1993), \u201cEffects of search experience and subject knowledge on the search tactics of novice and experienced searchers\u201d, Journal of the American Society for Information Science, Vol. 44 No. 3, pp. 161-174.","DOI":"10.1002\/(SICI)1097-4571(199304)44:3<161::AID-ASI5>3.0.CO;2-8"},{"key":"key2022031020364574000_b11","unstructured":"Ingwersen, P.\n                and \n                  J\u00e4rvelin, K.\n                (2005), The Turn. Integration of Information Seeking and Retrieval in Context, Springer, Dordrecht."},{"key":"key2022031020364574000_b12","doi-asserted-by":"crossref","unstructured":"Joho, H.\n               , \n                  Azzopardi, L.\n                and \n                  Vanderbeuwhede, W.\n                (2010), \u201cA survey of patent users: an analysis of tasks, behaviour, search functionality and system requirements\u201d, in \n                  Belkin, N.\n                and \n                  Kelly, D.\n                (Eds), Proceedings of the Third Information Interaction in Context Symposium (IiiX 2010), New Brunswick, NJ, pp. 13-24.","DOI":"10.1145\/1840784.1840789"},{"key":"key2022031020364574000_b13","doi-asserted-by":"crossref","unstructured":"J\u00e4rvelin, A.\n               , \n                  J\u00e4rvelin, A.\n                and \n                  Hansen, P.\n                (2010), \u201cUTA and SICS at CLEF-IP\u201d, Lecture Notes in Computer Science, 2010, Volume 6241, Multilingual Information Access Evaluation I. Text Retrieval Experiments, pp. 460-467.","DOI":"10.1007\/978-3-642-15754-7_55"},{"key":"key2022031020364574000_b14","doi-asserted-by":"crossref","unstructured":"Kek\u00e4l\u00e4inen, J.\n                and \n                  J\u00e4rvelin, K.\n                (2002), \u201cUsing graded relevance assessments in IR evaluation\u201d, ACM TOIS, Vol. 53 No. 13, pp. 1120-1129.","DOI":"10.1002\/asi.10137"},{"key":"key2022031020364574000_b15","unstructured":"Kuhlthau, C.\n                (1993), Seeking Meaning. A Process Approach to Library and Information Services, Ablex Publications, New York, NY."},{"key":"key2022031020364574000_b16","doi-asserted-by":"crossref","unstructured":"Larkey, L.\n                (1999), \u201cA patent search and classification system\u201d, Proceedings of the Digital Libraries 99 \u2013 The Fourth ACM Conference on Digital Libraries, Berkeley, CA, August 11-14 1999), ACM Press, New York, NY, pp. 79-87.","DOI":"10.1145\/313238.313304"},{"key":"key2022031020364574000_b17","unstructured":"Leong, M-K.\n                and \n                  Kando, N.\n                (2000), ACM-SIGIR Workshop on Patent Retrieval, Athens, Greece, July 2000."},{"key":"key2022031020364574000_b18","doi-asserted-by":"crossref","unstructured":"Lupu, M.\n               , \n                  Huang, J.\n                and \n                  Zhu, J.\n                (2011), \u201cEvaluation of chemical information retrieval tools\u201d, in \n                  Lupu, M.\n                \n               et al. (Eds), Current Challenges in Patent Information Retrieval, Springer, Berlin, pp. 109-124.","DOI":"10.1007\/978-3-642-19231-9_5"},{"key":"key2022031020364574000_b19","doi-asserted-by":"crossref","unstructured":"Mase, H.\n               , \n                  Matsubayashi, T.\n               , \n                  Ogawa, Y.\n                and \n                  Iwayama, M.\n                (2005), \u201cProposal of two-stage patent retrieval method considering the claim structure\u201d, ACM Transactions on Asian Language Information Processing, Vol. 4 No. 2, pp. 186-202.","DOI":"10.1145\/1105696.1105702"},{"key":"key2022031020364574000_b20","unstructured":"Pirkola, A.\n               , \n                  Lepp\u00e4nen, E.\n                and \n                  J\u00e4rvelin, K.\n                (2002), \u201cThe RATF formula (Kwok's formula): exploiting average term frequency in cross-language retrieval\u201d, Information Research, Vol. 7 No. 2."},{"key":"key2022031020364574000_b21","doi-asserted-by":"crossref","unstructured":"Roda, G.\n               , \n                  Tait, J.\n               , \n                  Piroi, F.\n                and \n                  Zenz, V.\n                (2009), \u201cCLEF-IP 2009: retrieval experiments in the Intellectual Property domain\u201d, CLEF working notes 2009, Corfu.","DOI":"10.1007\/978-3-642-15754-7_47"},{"key":"key2022031020364574000_b22","doi-asserted-by":"crossref","unstructured":"Roda, G.\n               , \n                  Tait, J.\n               , \n                  Piroi, F.\n                and \n                  Zenz, V.\n                (2010), \u201cCLEF-IP 2009: Retrieval experiments in the Intellectual Property domains\u201d, Lecture Notes in Computer Science, Vol. 6241, pp. 385-409.","DOI":"10.1007\/978-3-642-15754-7_47"},{"key":"key2022031020364574000_b23","unstructured":"Strohman, T.\n               , \n                  Metzler, D.\n               , \n                  Turtle, H.\n                and \n                  Croft, W.B.\n                (2005), \u201cIndri: a language-model based search engine for complex queries\u201d, Proceedings of the International Conference on Intelligence Analysis."},{"key":"key2022031020364574000_b24","doi-asserted-by":"crossref","unstructured":"Tait, J.L.\n                and \n                  Diallo, B.\n                (2011), \u201cFuture patent search\u201d, in \n                  Lupu, M.\n                \n               et al. (Eds), Current Challenges in Patent Information Retrieval, Springer, Berlin, pp. 389-407.","DOI":"10.1007\/978-3-642-19231-9_20"},{"key":"key2022031020364574000_b25","doi-asserted-by":"crossref","unstructured":"Trippe, A.\n                and \n                  Ruthven, I.\n                (2011), \u201cEvaluating real patent retrieval effectiveness\u201d, in \n                  Lupu, M.\n                \n               et al. (Eds), Current Challenges in Patent Information Retrieval, Springer, Berlin, pp. 125-141.","DOI":"10.1007\/978-3-642-19231-9_6"},{"key":"key2022031020364574000_b28","doi-asserted-by":"crossref","unstructured":"Vakkari, P.\n                (1999), \u201cTask complexity, problem structure and information actions: integrating studies on information seeking and retrieval\u201d, Information Processing & Management, Vol. 35 No. 6, pp. 819-837.","DOI":"10.1016\/S0306-4573(99)00028-X"},{"key":"key2022031020364574000_b27","doi-asserted-by":"crossref","unstructured":"Vakkari, P.\n                (2003), \u201cTask-based information searching\u201d, Annual Review of Information Science and Technology, Vol. 37, pp. 413-464.","DOI":"10.1002\/aris.1440370110"},{"key":"key2022031020364574000_b29","doi-asserted-by":"crossref","unstructured":"Wanner, L.\n               , \n                  Baeza-Yates, R.\n               , \n                  Brugmann, S.\n               , \n                  Codina, J.\n               , \n                  Diallo, B.\n               , \n                  Escorsa, E.\n               , \n                  Giereth, M.\n               , \n                  Kompatsiaris, Y.\n               , \n                  Papadopoulos, S.\n               , \n                  Pianta, E.\n               , \n                  Piella, G.\n               , \n                  Puhlmann, I.\n               , \n                  Rao, G.\n               , \n                  Rotard, M.\n               , \n                  Schoester, P.\n               , \n                  Serafini, L.\n                and \n                  Vasiliki, Z.\n                (2008), \u201cTowards content-oriented patent document processing\u201d, World Patent Information, Vol. 30 No. 1, pp. 21-33.","DOI":"10.1016\/j.wpi.2007.03.008"},{"key":"key2022031020364574000_b30","doi-asserted-by":"crossref","unstructured":"Wilkins, P.\n               , \n                  Ferguson, P.\n                and \n                  Smeaton, A.F.\n                (2006), \u201cUsing score distributions for query-time fusion in multimedia retrieval\u201d, MIR 2006 \u2013 8th ACM SIGMM International Workshop on Multimedia Information Retrieval, ACM Press, New York. NY, pp. 51-60.","DOI":"10.1145\/1178677.1178687"},{"key":"key2022031020364574000_b31","doi-asserted-by":"crossref","unstructured":"Wilson, T.D.\n                (1981), \u201cOn user studies and information needs\u201d, Journal of Documentation, Vol. 37 No. 1, pp. 3-15.","DOI":"10.1108\/eb026702"},{"key":"key2022031020364574000_frg1","unstructured":"WIPO, IFIA & BUD\n                (1998), Patent Information in Support of Inventive and Innovative Activities: General Introduction, WIPO\/IFIABUD\/98\/2, March 1998, available at: www.wipo.int\/edocs\/mdocs\/innovation\/en\/wipo_ifia_bud_98\/wipo_ifia_bud_98_2.doc (accessed 14 December 2011)."}],"container-title":["Journal of Documentation"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/www.emeraldinsight.com\/doi\/full-xml\/10.1108\/JD-12-2012-0166","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/JD-12-2012-0166\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/JD-12-2012-0166\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T22:35:33Z","timestamp":1753396533000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/jd\/article\/69\/6\/873-898\/220081"}},"subtitle":["Initial query construction and query generation process"],"short-title":[],"issued":{"date-parts":[[2013,10,14]]},"references-count":31,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2013,10,14]]}},"alternative-id":["10.1108\/JD-12-2012-0166"],"URL":"https:\/\/doi.org\/10.1108\/jd-12-2012-0166","relation":{},"ISSN":["0022-0418"],"issn-type":[{"type":"print","value":"0022-0418"}],"subject":[],"published":{"date-parts":[[2013,10,14]]}}}