{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,18]],"date-time":"2025-11-18T12:11:04Z","timestamp":1763467864037,"version":"3.41.0"},"reference-count":16,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2008,1,1]],"date-time":"2008-01-01T00:00:00Z","timestamp":1199145600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGOPS Oper. Syst. Rev."],"published-print":{"date-parts":[[2008,1]]},"abstract":"<jats:p>Mambo [4] is IBM's full-system simulator which models PowerPC systems, and provides a complete set of simulation tools to help IBM and its partners in pre-hardware development and performance evaluation for future systems. Currently Mambo simulates target systems on a single host thread. When the number of cores increases in a target system, Mambo's simulation performance for each core goes down. As the so-called \"multi-core era\" approaches, both target and host systems will have more and more cores. It is very important for Mambo to efficiently simulate a multi-core target system on a multi-core host system. Parallelization is a natural method to speed up Mambo under this situation.<\/jats:p>\n          <jats:p>Parallel Mambo (P-Mambo) is a multi-threaded implementation of Mambo. Mambo's simulation engine is implemented as a user-level thread-scheduler. We propose a multi-scheduler method to adapt Mambo's simulation engine to multi-threaded execution. Based on this method a core-based module partition is proposed to achieve both high inter-scheduler parallelism and low inter-scheduler dependency. Protection of shared resources is crucial to both correctness and performance of P-Mambo. Since there are two tiers of threads in P-Mambo, protecting shared resources by only OS-level locks possibly introduces deadlocks due to user-level context switch. We propose a new lock mechanism to handle this problem. Since Mambo is an on-going project with many modules currently under development, co-existence with new modules is also important to P-Mambo. We propose a global-lock-based method to guarantee compatibility of P-Mambo with future Mambo modules.<\/jats:p>\n          <jats:p>We have implemented the first version of P-Mambo in functional modes. The performance of P-Mambo has been evaluated on the OpenMP implementation of NAS Parallel Benchmark (NPB) 3.2 [12]. Preliminary experimental results show that P-Mambo achieves an average speedup of 3.4 on a 4-core host machine.<\/jats:p>","DOI":"10.1145\/1341312.1341325","type":"journal-article","created":{"date-parts":[[2008,2,8]],"date-time":"2008-02-08T15:32:16Z","timestamp":1202484736000},"page":"71-76","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":29,"title":["Parallelization of IBM mambo system simulator in functional modes"],"prefix":"10.1145","volume":"42","author":[{"given":"Kun","family":"Wang","sequence":"first","affiliation":[{"name":"IBM China Research Lab"}]},{"given":"Yu","family":"Zhang","sequence":"additional","affiliation":[{"name":"IBM China Research Lab"}]},{"given":"Huayong","family":"Wang","sequence":"additional","affiliation":[{"name":"IBM China Research Lab"}]},{"given":"Xiaowei","family":"Shen","sequence":"additional","affiliation":[{"name":"IBM China Research Lab"}]}],"member":"320","published-online":{"date-parts":[[2008,1]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.5555\/1153925.1154586"},{"volume-title":"Fast and Portable Dynamic Translator. USENIX 2005 Annual Technical Conference, FREENIX Track","year":"2005","author":"Bellard F.","key":"e_1_2_1_2_1"},{"journal-title":"Multifacet's General Execution-driven Multiprocessor Simulator (GEMS) Toolset. Computer Architecture News (CAN)","year":"2005","author":"Binkert N. L.","key":"e_1_2_1_3_1"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1054907.1054910"},{"volume-title":"Proceedings of the Fourth LCI International Conference on Linux Clusters: The HPC Revolution 2003","year":"2003","author":"Ceze L.","key":"e_1_2_1_6_1"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.5555\/1331699.1331723"},{"volume-title":"The PowerPC Architecture: A Specification for a New Family of Processors","year":"1994","author":"I. Corporation","key":"e_1_2_1_8_1"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/264107.264126"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/2.982916"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.5555\/1266366.1266370"},{"key":"e_1_2_1_12_1","unstructured":"NPB. NAS Parallel Benchmarks. http:\/\/www.nas.nasa.gov\/Resources\/Software\/npb.html.  NPB. NAS Parallel Benchmarks. http:\/\/www.nas.nasa.gov\/Resources\/Software\/npb.html."},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/88.473612"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1147\/rd.475.0641"},{"volume-title":"Proceedings of ACM\/IEEE Conference on Supercomputing","year":"2002","author":"Team T. B.","key":"e_1_2_1_15_1"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1216919.1216936"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/233013.233025"}],"container-title":["ACM SIGOPS Operating Systems Review"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1341312.1341325","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1341312.1341325","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T13:39:14Z","timestamp":1750253954000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1341312.1341325"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,1]]},"references-count":16,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2008,1]]}},"alternative-id":["10.1145\/1341312.1341325"],"URL":"https:\/\/doi.org\/10.1145\/1341312.1341325","relation":{},"ISSN":["0163-5980"],"issn-type":[{"type":"print","value":"0163-5980"}],"subject":[],"published":{"date-parts":[[2008,1]]},"assertion":[{"value":"2008-01-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}