{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,20]],"date-time":"2026-01-20T02:34:27Z","timestamp":1768876467220,"version":"3.49.0"},"reference-count":73,"publisher":"Springer Science and Business Media LLC","issue":"5","license":[{"start":{"date-parts":[[2006,9,1]],"date-time":"2006-09-01T00:00:00Z","timestamp":1157068800000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J Comput Sci Technol"],"published-print":{"date-parts":[[2006,9]]},"DOI":"10.1007\/s11390-006-0674-8","type":"journal-article","created":{"date-parts":[[2006,10,15]],"date-time":"2006-10-15T10:38:59Z","timestamp":1160908739000},"page":"674-681","source":"Crossref","is-referenced-by-count":9,"title":["Progress and Challenges in High Performance Computer Technology"],"prefix":"10.1007","volume":"21","author":[{"given":"Xue-Jun","family":"Yang","sequence":"first","affiliation":[]},{"given":"Yong","family":"Dou","sequence":"additional","affiliation":[]},{"given":"Qing-Feng","family":"Hu","sequence":"additional","affiliation":[]}],"member":"297","reference":[{"key":"674_CR1","unstructured":"Susan L Graham, Marc Snir, Cynthia A Patterson. Getting up to speed: The future of supercomputing. Committee on the Future of Supercomputing, National Research Council."},{"key":"674_CR2","unstructured":"PITAC report. Computational science: Ensuring America\u2019s competitiveness. http:\/\/www.nitrd.gov\/pitac\/reports\/20050609_computational\/computational.pdf"},{"key":"674_CR3","unstructured":"Cray History. http:\/\/www.cray.com\/about_cray\/history.html ."},{"key":"674_CR4","unstructured":"CM-5 at UC Berkeley. http:\/\/www.eecs.berkeley.edu\/Resea-rch\/Projects\/CS\/parallel\/cm5.html ."},{"key":"674_CR5","unstructured":"The development road of Chinese supercomputer. http:\/\/www.dawning.com.cn\/4000A\/test_gx_1.htm . (in Chinese)"},{"key":"674_CR6","unstructured":"http:\/\/www.nti.org\/e_research\/profiles\/China\/Chemical\/in-dex.html ."},{"key":"674_CR7","unstructured":"ASCI Red SiteMap. http:\/\/www.sandia.gov\/ASCI\/Red\/Site-Map.htm ."},{"key":"674_CR8","unstructured":"http:\/\/www.top500.org\/ ."},{"key":"674_CR9","unstructured":"The earth simulator center. http:\/\/www.es.jamstec.go.jp\/ ."},{"key":"674_CR10","unstructured":"BlueGene. http:\/\/www.research.ibm.com\/bluegene\/ ."},{"issue":"5","key":"674_CR11","doi-asserted-by":"crossref","first-page":"532","DOI":"10.1145\/42411.42415","volume":"31","author":"John L. Gustafson","year":"May 1988","unstructured":"John L. Gustafson. Reevaluating Amdahl\u2019s law. Communications of the ACM, May 1988, 31(5): 532\u2013533.","journal-title":"Communications of the ACM"},{"key":"674_CR12","doi-asserted-by":"crossref","unstructured":"David Culler, Richard Karp, David Patterson et al. LogP: Towards a realistic model of parallel computation. In Proc. the 4th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, New York, ACM Pres, 1993, pp. 1\u201312.","DOI":"10.1145\/173284.155333"},{"key":"674_CR13","volume-title":"Parallel programming in C with MPI and OpenMP","author":"J Quinn Michael","year":"May 2003","unstructured":"Michael J Quinn. Parallel programming in C with MPI and OpenMP. USA: McGraw-Hill, May 2003."},{"key":"674_CR14","volume-title":"CMOS VLSI design: A Circuits and Systems Perspective","author":"Neil H E Weste","year":"May 2004","unstructured":"Neil H E Weste, David Harris. CMOS VLSI design: A Circuits and Systems Perspective. 3rd Edition, USA: Addison-Wesley, May 2004.","edition":"3"},{"key":"674_CR15","unstructured":"Jose Duato, Sudhakar Yalamanchili Lionel Ni. Interconnection Networks: An Engineering Approach. 2nd Edition, Morgan Kaufmann Publishers, 2002."},{"key":"674_CR16","unstructured":"Rajkumar Buyya. High Performance Cluster Computing Architectures and Systems, Volume 1. Prentice Hall, May 1999."},{"key":"674_CR17","unstructured":"Scientific computing and visualization. http:\/\/scv.bu.edu\/ ."},{"key":"674_CR18","doi-asserted-by":"crossref","unstructured":"Francine Berman, Geoffrey C Fox, Tony Hey. Grid Computing: Making the Global Infrastructure a Reality. John Wiley and Sons, May 2003.","DOI":"10.1002\/0470867167"},{"key":"674_CR19","unstructured":"Xubang Shen, Xixin Cao. The selection of models for LS MPP. Chinese Journal of Computers, 1997, 20(5): 385\u2013390. (in Chinese)"},{"key":"674_CR20","unstructured":"Li li, Xubang Shen. The design of LS SIMD array microprocessor control logic. Chinese Journal of Computers, 2000, 23(5): 557\u2013560. (in Chinese)"},{"key":"674_CR21","unstructured":"Caoyang Chen, Zhong Wang, Xubang Shen et al. The LS MPP parallel image processor. Chinese Journal of Computers, 2002, 25(3): 292\u2013296. (in Chinese)"},{"key":"674_CR22","unstructured":"Ying Zhang, Wei Huang, Qunsheng Ma, Sanli Li. The design and implementation of hierarchical parallel system: MP860 supercomputer. Chinese Journal of Computers, 1998, 21(z1): 230\u2013236. (in Chinese)"},{"key":"674_CR23","unstructured":"Ling Qiao, Zhizhong Tang, Hongbo Rong, Chihong Zhang. The model of instruction level parallel program execution. Chinese Journal of Computers, 1999, 22(5): 476\u2013480. (in Chinese)"},{"key":"674_CR24","unstructured":"Gang Xiao, Xingming Zhou, Ming Xu, Kun Deng. SMA: A speculative multithreaded architecture. Chinese Journal of Computers, 1999, 22(6): 582\u2013590. (in Chinese)"},{"key":"674_CR25","unstructured":"Yunquan Zhang. DRAM(h): A parallel computation model for high performance numerical computing. Chinese Journal of Computers, 2003, 26(12): 1660\u20131670. (in Chinese)"},{"key":"674_CR26","unstructured":"Weiwu Hu, Peisu Xia. Out-of-order execution in sequentially consistent shared memory systems: Principles. Chinese Journal of Computers, 1997, 20(6): 481\u2013490. (in Chinese)"},{"key":"674_CR27","unstructured":"Weiwu Hu, Peisu Xia. Out-of-order execution in sequentially consistent shared memory systems: Simulation results. Chinese Journal of Computers, 1997, 20(6): 491\u2013500. (in Chinese)"},{"key":"674_CR28","unstructured":"Xianghui Xie, Chengde Han, Zhimin Tang. Data pre sending technique in distributed shared memory systems. Chinese Journal of Computers, 1999, 22(3): 241\u2013248. (in Chinese)"},{"key":"674_CR29","unstructured":"Weiwu Hu, Weisong Shi, Zhimin Tang. A software DSM system based on a new cache coherence protocol. Chinese Journal of Computers, 1999, 22(5): 467\u2013475. (in Chinese)"},{"issue":"2","key":"674_CR30","first-page":"351","volume":"40","author":"Dai Huadong","year":"2003","unstructured":"Huadong Dai, Xuejun Yang. An operating system-centric memory consistency model \u2014 Thread consistency model. Journal of Computer Research and Development, 2003, 40(2): 351\u2013359.","journal-title":"Journal of Computer Research and Development"},{"issue":"2","key":"674_CR31","first-page":"81","volume":"8","author":"Dou Yong","year":"1997","unstructured":"Yong Dou, Xingming Zhou. A software controlled data prefetching scheme based on weak order consistency model. Journal of Software, 1997, 8(2): 81\u201386.","journal-title":"Journal of Software"},{"key":"674_CR32","unstructured":"Rong Zeng, Xiangjun Dong, Mingfa Zhu. Wormhole routing and its chip design. Chinese Journal of Computers, 1997, 20(5): 404\u2013411. (in Chinese)"},{"key":"674_CR33","unstructured":"Feng Gao, Zhongcheng Li, Yinghua Min, Jie Wu. A fault-tolerant routing strategy based on extended safety vectors in hypercube multicomputers. Chinese Journal of Computers, 2000, 23(3): 248\u2013254. (in Chinese)"},{"key":"674_CR34","unstructured":"Jianfeng Wu, Shanli Li, Yi Ge. Message memory network interface design in network parallel computing. Chinese Journal of Computers, 2000, 23(2): 195\u2013201. (in Chinese)"},{"key":"674_CR35","unstructured":"Jun Shen, Weimin Zheng, Dapeng Ju. FMP: A fast message passing for workstation clusters. Chinese Journal of Computers, 1998, 21(7): 595\u2013602. (in Chinese)"},{"issue":"10","key":"674_CR36","first-page":"1562","volume":"12","author":"Chen Zuo-ning","year":"2001","unstructured":"Zuo-ning Chen, Yi-lian Jin. A parallel operating system based on multi-virtual-space and multi-mapping technology. Journal of Software, 2001, 12(10): 1562\u20131568.","journal-title":"Journal of Software"},{"key":"674_CR37","unstructured":"Ning-Hui Sun, Zhi-wei Xu. Design of system software of Dawning\/2000 supercomputer. Chinese Journal of Computers, 2000, 23(1): 9\u201320. (in Chinese)"},{"key":"674_CR38","doi-asserted-by":"crossref","unstructured":"Dan Meng, Jian-feng Zhan, Lei Wang et al. Fully integrated cluster operating system: Phoenix. Journal of Computer Research and Development, 2005, 42(6): 979\u2013986. (in Chinese)","DOI":"10.1360\/crad20050612"},{"key":"674_CR39","unstructured":"Hua-ping Chen, Liu-sheng Huang. Processor selection policy in heuristic task scheduling. Journal of Software, 1999, 10(11): 1194\u20131198. (in Chinese)"},{"key":"674_CR40","unstructured":"Jin-gui Huang, Jian-er Chen, Song-qiao Chen. Parallel-job scheduling on cluster computing systems. Chinese Journal of Computers, 2004, 27(6): 765\u2013771. (in Chinese)"},{"key":"674_CR41","doi-asserted-by":"crossref","unstructured":"Qing-hua Li, Jian-jun Han, Abbas A Essa. A fast and effective static task scheduling algorithm in homogeneous computing environments. Journal of Computer Research and Development, 2005, 42(1): 118\u2013125. (in Chinese)","DOI":"10.1360\/crad20050116"},{"key":"674_CR42","unstructured":"Qiang Fu, Wei-min Zheng. A dynamic task scheduling method in cluster of workstations. Journal of Software, 1999, 10(1): 19\u201323. (in Chinese)"},{"key":"674_CR43","unstructured":"Hao Huang, Jian-cheng Du, Dao-xu Chen, Li Xie. Optimum degree of parallelism-based task dependence graph scheduling scheme. Journal of Software, 1999, 10(10): 1038\u20131046. (in Chinese)"},{"key":"674_CR44","unstructured":"Zhou Lei, Zhi-wei Xu, Ming-fa Zhu. A new adaptive processor allocation algorithm for cluster: Limited load balancing allocation (LLBA). Chinese Journal of Computers, 1999, 22(8): 877\u2013881. (in Chinese)"},{"key":"674_CR45","unstructured":"Nong Xiao, Yu-tong Lu, Xi-cheng Lu. A dynamic load distributing algorithm based on a parallel computing network environment. Journal of Computer Research and Development, 1999, 36(2): 238\u2013241. (in Chinese)"},{"key":"674_CR46","unstructured":"Zhi-yan Jin, Ding-xing Wang. Diffusion algorithm of dynamic load balancing for heterogeneous system. Chinese Journal of Computers, 2003, 26(11): 1487\u20131493. (in Chinese)"},{"key":"674_CR47","doi-asserted-by":"crossref","unstructured":"Yan-zhi Wen, Rui-qi Lian, Cheng-yong Wu et al. A micro-scheduling method on directed cyclic graph. Journal of Computer Research and Development, 2005, 42(3): 387\u2013393. (in Chinese)","DOI":"10.1360\/crad20050305"},{"key":"674_CR48","unstructured":"Jin-Wei Hong, Guo-liang Chen, Zhao-qing Zhang, Feng Zhang. Compiling-support communication optimizations for SVMs. Chinese Journal of Computers, 2000, 23(7): 738\u2013743. (in Chinese)"},{"key":"674_CR49","unstructured":"Rui-qi Lian, Zhao-qing Zhang, Ru-liang Qiao. A data prefetching method used in ILP compilers and its optimization. Chinese Journal of Computers, 2000, 23(6): 576\u2013584. (in Chinese)"},{"key":"674_CR50","unstructured":"Rui-qi Lian, Cheng-yong Wu, Zhao-qing Zhang. Integrating code optimization and instruction scheduling. Chinese Journal of Computers, 2001, 24(7): 694\u2013701. (in Chinese)"},{"key":"674_CR51","unstructured":"Yun-zhao Lu, Zhao-qing Zhang, Qui-qi Lian. Predicate analysis techniques in ILP. Chinese Journal of Computers, 2003, 26(10): 1337\u20131342. (in Chinese)"},{"key":"674_CR52","unstructured":"Wenlong Li, Haibo Lin, Zhizhong Tang. Cost model and decision framework for software pipelining. Journal of Software, 2004, 15(7): 1005\u20131011. (in Chinese)"},{"key":"674_CR53","unstructured":"Haibo Lin, Wenlong Li, Zhizhong Tang. Research on register requirements of software pipelined loops in the IA-64 architecture. Journal of Computer Research and Development, 2004, 41(1): 22\u201327. (in Chinese)"},{"key":"674_CR54","doi-asserted-by":"crossref","unstructured":"Li Liu, Wenlong Li, Zhenyu Gu, Shengmei Li, Zhizhong Tang. Optimization to prevent cache penalty in modulo scheduling. Journal of Software, 2005, 16(10): 1842\u20131852. (in Chinese)","DOI":"10.1360\/jos161842"},{"key":"674_CR55","unstructured":"Jun Xia, Xuejun Yang, Lifang Zeng, Haifang Zhou. A projection-delamination based approach to optimizing spatial locality in loop nests. Chinese Journal of Computers, 2003, 26(5): 539\u2013551. (in Chinese)"},{"key":"674_CR56","unstructured":"Jun Xia, Huadong Dai, Xuejun Yang. A linear expressing based approach for optimizing locality using non-singular loop transformations. Chinese Journal of Computers, 2003, 26(12): 1609\u20131620. (in Chinese)"},{"key":"674_CR57","unstructured":"Jun Xia, Xuejun Yang. A data space fusion based approach for global computation and data decompositions. Journal of Software, 2004, 15(9): 1311\u20131327. (in Chinese)"},{"key":"674_CR58","unstructured":"Guokai Ma, Xin Wang, Peng Wwang et al. Increase parallel granularity and data locality by unimodular metrics. Chinese Journal of Computers, 2004, 27(4): 516\u2013523. (in Chinese)"},{"key":"674_CR59","unstructured":"Lifang Zeng, Xuejun Yang, Jun Xia, Juan Chen. Improving data locality and reducing false-sharing based on data fusion. Chinese Journal of Computers, 2005, 27(1): 32\u201341. (in Chinese)"},{"key":"674_CR60","unstructured":"Yijun Yu, Binyu Zang, Wu Shi, Chuanqi Zhu. Automatically computing unimodular transforming matrix to parallelize nested sequential loops. Journal of Software, 1999, 10(4): 366\u2013371. (in Chinese)"},{"key":"674_CR61","unstructured":"Jianping Wang, Xu Cheng, Wenkui Ding et al. The implementation strategy of communication in HPF compiler and related algorithms. Chinese Journal of Computers, 1999, 22(5): 486\u2013496. (in Chinese)"},{"key":"674_CR62","unstructured":"Li Chen, Zhaoqing Zhang, Xiaobing Feng. Redundant computation partitioning in distributed-memory systems. Chinese Journal of Computers, 2003, 26(2): 180\u2013187. (in Chinese)"},{"key":"674_CR63","unstructured":"Bo Yang, Dingxing Wang, Weimin Zheng. An algorithm on task scheduling in structural parallel control mechanism. Journal of Software, 2001, 12(5): 698\u2013705. (in Chinese)"},{"key":"674_CR64","unstructured":"Qiang Liu, Zhaoqing Zhang, Ruliang Qiao. An integrated tool for debugging, monitoring and performance analysis. Journal of Software, 1999, 10(2): 220\u2013224. (in Chinese)"},{"key":"674_CR65","unstructured":"Jian Liu, Hao Wang, Meiming Sheng, Weimin Zheng. A parallel debugger with fast conditional breakpoint. Journal of Software, 2003, 14(11): 1827\u20131833. (in Chinese)"},{"key":"674_CR66","unstructured":"Chao Yan, Taoying Liu, Guoliang Chen. A parallel debugger based on cluster operating system. Journal of Computer Research and Development, 2004, 41(4): 630\u2013636. (in Chinese)"},{"key":"674_CR67","unstructured":"Zhiwei Xu, Wei Li. Research on Vega grid architecture. Journal of Computer Research and Development, 2002, 39(8): 923\u2013929. (in Chinese)"},{"key":"674_CR68","unstructured":"Xicheng Lu, Huaimin Wang, Ji Wang. Internet-based virtual computing environment (iVCE): Concepts and architecture. Science in China, Series E, 2006, 36(10). (To appear)"},{"issue":"4","key":"674_CR69","first-page":"421","volume":"48","author":"Li Dongsheng","year":"2005","unstructured":"Dongsheng Li, Xicheng Lu. A novel constant degree and constant congestion DHT scheme for peer-to-peer networks. Science in China, 2005, 48(4): 421\u2013436.","journal-title":"Science in China"},{"issue":"4","key":"674_CR70","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1145\/1053291.1053325","volume":"48","author":"Zhuge Hai","year":"2005","unstructured":"Hai Zhuge. Semantic grid: Scientific issues, infrastructure, and methodology. Communications of the ACM, 2005, 48(4): 117\u2013119.","journal-title":"Communications of the ACM"},{"key":"674_CR71","unstructured":"HPCS program. http:\/\/www.highproductivity.org\/ ."},{"key":"674_CR72","unstructured":"National energy research scientific computing center 2004 annual report. National Energy Research Scientific Computing Center, 2005, http:\/\/www.nersc.gov\/news\/annual_reports\/an-nre-p04\/annrep04.pdf ."},{"issue":"1","key":"674_CR73","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1145\/216585.216588","volume":"23","author":"W Wulf","year":"1995","unstructured":"Wulf W, McKee S. Hitting the memory wall: Implications of the obvious. Computer Architecture News, 1995, 23(1): 20\u201324.","journal-title":"Computer Architecture News"}],"container-title":["Journal of Computer Science and Technology"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11390-006-0674-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s11390-006-0674-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11390-006-0674-8","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,6,1]],"date-time":"2019-06-01T14:32:37Z","timestamp":1559399557000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s11390-006-0674-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,9]]},"references-count":73,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2006,9]]}},"alternative-id":["674"],"URL":"https:\/\/doi.org\/10.1007\/s11390-006-0674-8","relation":{},"ISSN":["1000-9000","1860-4749"],"issn-type":[{"value":"1000-9000","type":"print"},{"value":"1860-4749","type":"electronic"}],"subject":[],"published":{"date-parts":[[2006,9]]}}}