{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,5]],"date-time":"2025-10-05T16:49:08Z","timestamp":1759682948478},"reference-count":34,"publisher":"Association for Computing Machinery (ACM)","issue":"1","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2008,8]]},"abstract":"<jats:p>Content Management Systems (CMS) store enterprise data such as insurance claims, insurance policies, legal documents, patent applications, or archival data like in the case of digital libraries. Search over content allows for information retrieval, but does not provide users with great insight into the data. A more analytical view is needed through analysis, aggregations, groupings, trends, pivot tables or charts, and so on. Multidimensional Content eXploration (MCX) is about effectively analyzing and exploring large amounts of content by combining keyword search with OLAP-style aggregation, navigation, and reporting. We focus on unstructured data or generally speaking documents or content with limited metadata, as it is typically encountered in CMS. We formally present how CMS content and metadata should be organized in a well-defined multidimensional structure, so that sophisticated queries can be expressed and evaluated. The CMS metadata provide traditional OLAP static dimensions that are combined with dynamic dimensions discovered from the analyzed keyword search result, as well as measures for document scores based on the link structure between the documents. In addition, we provide means for multidimensional content exploration through traditional OLAP rollupdrilldown operations on the static and dynamic dimensions, solutions for multi-cube analysis and dynamic navigation of the content. We present our prototype, called DBPubs, which stores research publications as documents that can be searched and -most importantly-- analyzed, and explored. Finally, we present experimental results of the efficiency and effectiveness of our approach.<\/jats:p>","DOI":"10.14778\/1453856.1453929","type":"journal-article","created":{"date-parts":[[2014,6,24]],"date-time":"2014-06-24T12:17:57Z","timestamp":1403612277000},"page":"660-671","source":"Crossref","is-referenced-by-count":29,"title":["Multidimensional content eXploration"],"prefix":"10.14778","volume":"1","author":[{"given":"Alkis","family":"Simitsis","sequence":"first","affiliation":[{"name":"IBM Almaden Research Center, San Jose, CA"}]},{"given":"Akanksha","family":"Baid","sequence":"additional","affiliation":[{"name":"Univ. Wisconsin Madison"}]},{"given":"Yannis","family":"Sismanis","sequence":"additional","affiliation":[{"name":"IBM Almaden Research Center, San Jose, CA"}]},{"given":"Berthold","family":"Reinwald","sequence":"additional","affiliation":[{"name":"IBM Almaden Research Center, San Jose, CA"}]}],"member":"320","published-online":{"date-parts":[[2008,8]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11280-006-0222-z"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.14778\/1454159.1454199"},{"key":"e_1_2_1_3_1","first-page":"564","volume-title":"VLDB","author":"Balmin A.","year":"2004","unstructured":"A. Balmin , V. Hristidis , and Y. Papakonstantinou . Objectrank: Authority-based keyword search in databases . In VLDB , pages 564 -- 575 , 2004 . A. Balmin, V. Hristidis, and Y. Papakonstantinou. Objectrank: Authority-based keyword search in databases. In VLDB, pages 564--575, 2004."},{"key":"e_1_2_1_4_1","first-page":"1410","volume-title":"VLDB","author":"Bansal N.","year":"2007","unstructured":"N. Bansal and N. Koudas . Blogscope: A system for online analysis of high volume text streams . In VLDB , pages 1410 -- 1413 , 2007 . N. Bansal and N. Koudas. Blogscope: A system for online analysis of high volume text streams. In VLDB, pages 1410--1413, 2007."},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/1066157.1066215"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1247480.1247504"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1143844.1143859"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0169-7552(98)00110-X"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/248603.248616"},{"key":"e_1_2_1_10_1","unstructured":"CiteSeer. http:\/\/citeseer.ist.psu.edu.  CiteSeer. http:\/\/citeseer.ist.psu.edu."},{"key":"e_1_2_1_11_1","unstructured":"DBLP. http:\/\/www.informatik.uni-trier.de\/ley\/db.  DBLP. http:\/\/www.informatik.uni-trier.de\/ley\/db."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9"},{"key":"e_1_2_1_13_1","volume-title":"VLDB, page 399","author":"DeRose P.","year":"2007","unstructured":"P. DeRose , W. Shen , F. Chen , A. Doan , and R. Ramakrishnan . Building structured web community portals: A top-down, compositional, and incremental approach . In VLDB, page 399 , 2007 . P. DeRose, W. Shen, F. Chen, A. Doan, and R. Ramakrishnan. Building structured web community portals: A top-down, compositional, and incremental approach. In VLDB, page 399, 2007."},{"key":"e_1_2_1_14_1","unstructured":"J. Diederich. Faceted DBLP http:\/\/dblp.13s.de.  J. Diederich. Faceted DBLP http:\/\/dblp.13s.de."},{"key":"e_1_2_1_15_1","unstructured":"Eventseer. http:\/\/eventseer.net.  Eventseer. http:\/\/eventseer.net."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF02163027"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.5555\/645481.655593"},{"key":"e_1_2_1_18_1","unstructured":"Harzing. Publish or Perish http:\/\/www.harzing.com\/pop.htm.  Harzing. Publish or Perish http:\/\/www.harzing.com\/pop.htm."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/312624.312649"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/1093382.1093388"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00799-004-0094-8"},{"key":"e_1_2_1_22_1","volume-title":"The Data Warehouse Lifecycle Toolkit","author":"Kimball R.","year":"1998","unstructured":"R. Kimball , L. Reeves , M. Ross , and W. Thornthwaite . The Data Warehouse Lifecycle Toolkit . Wiley , 1998 . R. Kimball, L. Reeves, M. Ross, and W. Thornthwaite. The Data Warehouse Lifecycle Toolkit. Wiley, 1998."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0306-4379(02)00024-8"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.5555\/646497.695623"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.5555\/646496.695477"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.5555\/1287369.1287400"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/347090.347123"},{"key":"e_1_2_1_28_1","unstructured":"Mondial http:\/\/www.dbis.informatik.uni-goettingen.de\/mondial.  Mondial http:\/\/www.dbis.informatik.uni-goettingen.de\/mondial."},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0306-4379(01)00023-0"},{"key":"e_1_2_1_30_1","first-page":"14","volume-title":"SSDBM","author":"Rafanelli M.","year":"1990","unstructured":"M. Rafanelli and A. Shoshani . Storm: A statistical object representation model . In SSDBM , pages 14 -- 29 , 1990 . M. Rafanelli and A. Shoshani. Storm: A statistical object representation model. In SSDBM, pages 14--29, 1990."},{"key":"e_1_2_1_31_1","volume-title":"EDBT, page 269","author":"Shukla A.","year":"2000","unstructured":"A. Shukla , P. Deshpande , and J. F. Naughton . Materialized view selection for multi-cube data models . In EDBT, page 269 , 2000 . A. Shukla, P. Deshpande, and J. F. Naughton. Materialized view selection for multi-cube data models. In EDBT, page 269, 2000."},{"key":"e_1_2_1_32_1","volume-title":"RT-0760","author":"Takuma D.","year":"2007","unstructured":"D. Takuma and I. Yoshida . Top-n keyword calculation on dynamically selected documents. IBM Research Report , RT-0760 , October 2007 . D. Takuma and I. Yoshida. Top-n keyword calculation on dynamically selected documents. IBM Research Report, RT-0760, October 2007."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/276304.276368"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/1247480.1247549"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/1453856.1453929","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T10:59:56Z","timestamp":1672225196000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/1453856.1453929"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,8]]},"references-count":34,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2008,8]]}},"alternative-id":["10.14778\/1453856.1453929"],"URL":"https:\/\/doi.org\/10.14778\/1453856.1453929","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2008,8]]}}}