{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,13]],"date-time":"2026-01-13T23:41:01Z","timestamp":1768347661335,"version":"3.49.0"},"reference-count":32,"publisher":"MDPI AG","issue":"5","license":[{"start":{"date-parts":[[2019,5,4]],"date-time":"2019-05-04T00:00:00Z","timestamp":1556928000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Key R&amp;D Program of China\uff1bNational Natural Science Foundation of China","award":["2017YFC0703903\uff0c2017YFC0704003\uff1b51705341"],"award-info":[{"award-number":["2017YFC0703903\uff0c2017YFC0704003\uff1b51705341"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Symmetry"],"abstract":"<jats:p>During the construction of prefabricated building, there are some problems such as a time consuming, low-level of automation when precast concrete members are assembled and positioned. This paper presents vision-based intelligent assembly alignment guiding technology for columnar precast concrete members. We study the video images of assembly alignment of the hole at the bottom of the precast concrete members and the rebar on the ground. Our goal is to predict the trajectory of the moving target in a future moment and the movement direction at each position during the alignment process by assembly image sequences. However, trajectory prediction is still subject to the following challenges: (1) the effect of external environment (illumination) on image quality; (2) small target detection in complex backgrounds; (3) low accuracy of trajectory prediction results based on the visual context model. In this paper, we use mask and adaptive histogram equalization to improve the quality of the image and improved method to detect the targets. In addition, aiming at the low position precision of trajectory prediction based on the context model, we propose the end point position-matching equation according to the principle of end point pixel matching of the moving target and fixed target, as the constraint term of the loss function to improve the prediction accuracy of the network. In order to evaluate comprehensively the performance of the proposed method on the trajectory prediction in the assembly alignment task, we construct the image dataset, use Hausdorff distance as the evaluation index, and compare with existing prediction methods. The experimental results show that, this framework is better than the existing methods in accuracy and robustness at the prediction of assembly alignment motion trajectory of columnar precast concrete members.<\/jats:p>","DOI":"10.3390\/sym11050629","type":"journal-article","created":{"date-parts":[[2019,5,9]],"date-time":"2019-05-09T11:22:35Z","timestamp":1557400955000},"page":"629","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":13,"title":["Trajectory Prediction of Assembly Alignment of Columnar Precast Concrete Members with Deep Learning"],"prefix":"10.3390","volume":"11","author":[{"given":"Ke","family":"Zhang","sequence":"first","affiliation":[{"name":"School of Mechanical Engineering, Shenyang Jianzhu University, Shenyang 110168, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4321-3674","authenticated-orcid":false,"given":"Shenghao","family":"Tong","sequence":"additional","affiliation":[{"name":"School of Mechanical Engineering, Shenyang Jianzhu University, Shenyang 110168, China"}]},{"given":"Huaitao","family":"Shi","sequence":"additional","affiliation":[{"name":"School of Mechanical Engineering, Shenyang Jianzhu University, Shenyang 110168, China"}]}],"member":"1968","published-online":{"date-parts":[[2019,5,4]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Zhang, X., Zheng, Y., Ota, J., and Huang, Y. (2017). Peg-in-Hole Assembly Based on Two-phase Scheme and F\/T Sensor for Dual-arm Robot. Sensors, 17.","DOI":"10.3390\/s17092004"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"425","DOI":"10.1007\/s12541-014-0353-6","article-title":"Hole detection algorithm for chamferless square peg-in-hole based on shape recognition using F\/T sensor","volume":"15","author":"Kim","year":"2014","journal-title":"Int. J. Precis. Eng. Manuf."},{"key":"ref_3","first-page":"495","article-title":"Visual servoing for low-cost SCARA robots using an RGB-D camera as the only sensor","volume":"58","author":"Cupec","year":"2018","journal-title":"Automatika: \u010casopis za Automatiku, Mjerenje, Elektroniku, Ra\u010dunarstvo i Komunikacije"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1016\/j.neucom.2017.01.077","article-title":"Teaching Robots to Do Object Assembly using Multi-modal 3D Vision","volume":"259","author":"Wan","year":"2017","journal-title":"Neurocomputing"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1347","DOI":"10.1109\/TRO.2016.2596799","article-title":"Surface-Based Detection and 6-DoF Pose Estimation of 3-D Objects in Cluttered Scenes","volume":"32","author":"Teng","year":"2016","journal-title":"IEEE Trans. Robot."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Kitani, K., Ziebart, B., Bagnell, J., and Hebert, M. (2012, January 7\u201313). Activity forecasting. Proceedings of the Computer Vision\u2013ECCV 2012, Florence, Italy.","DOI":"10.1007\/978-3-642-33765-9_15"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Xu, M., Song, Y., Wang, J., Qiao, M., Huo, L., and Wang, Z. (2018). Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach. IEEE Trans. Pattern Anal. Mach. Intell., 1.","DOI":"10.1109\/TPAMI.2018.2858783"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Yoo, Y., Yun, K., Yun, S., Hong, J., Jeong, H., and Young Choi, J. (2016, January 27\u201330). Visual Path Prediction in Complex Scenes with Crowded Moving Objects. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.292"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1016\/j.neunet.2012.02.023","article-title":"Multi-column Deep Neural Network for Traffic Sign Classification","volume":"32","author":"Meier","year":"2012","journal-title":"Neural Netw."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"2314","DOI":"10.1109\/TPAMI.2016.2636150","article-title":"STC: A Simple to Complex Framework for Weakly-Supervised Semantic Segmentation","volume":"39","author":"Wei","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1109\/MSP.2017.2742558","article-title":"Weakly Supervised Learning with Deep Convolutional Neural Networks for Semantic Segmentation: Understanding Semantic Layout of Images with Minimum Human Supervision","volume":"34","author":"Hong","year":"2017","journal-title":"IEEE Signal Process. Mag."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1618","DOI":"10.1080\/09500340.2016.1163428","article-title":"Image contrast enhancement using histogram equalization with maximum intensity coverage","volume":"63","author":"Wong","year":"2016","journal-title":"J. Mod. Opt."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1440","DOI":"10.1080\/09500340.2016.1154194","article-title":"Contrast enhancement via texture region based histogram equalization","volume":"63","author":"Singh","year":"2016","journal-title":"J. Mod. Opt."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1109\/TITS.2011.2113340","article-title":"Vehicle Detection and Tracking in Car Video Based on Motion Model","volume":"12","author":"Jazayeri","year":"2011","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1038\/nature14539","article-title":"Deep learning","volume":"521","author":"Lecun","year":"2015","journal-title":"Nature"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016, January 11\u201314). SSD: Single Shot MultiBox Detector. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You Only Look Once: Unified, Real-Time Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Leng, J., and Liu, Y. (2018). An enhanced SSD with feature fusion and visual reasoning for object detection. Neural Comput. Appl.","DOI":"10.1007\/s00521-018-3486-1"},{"key":"ref_19","unstructured":"Fu, C.Y., Liu, W., Ranga, A., Tyagi, A., and Berg, A.C. (2017). DSSD: Deconvolutional Single Shot Detector. arXiv."},{"key":"ref_20","first-page":"3504","article-title":"Pedestrian Movement Direction Recognition Using Convolutional Neural Networks","volume":"18","author":"Cazorla","year":"2017","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"298","DOI":"10.1016\/j.ins.2016.08.038","article-title":"Ontology-based Deep Learning for Human Behavior Prediction with Explanations in Health Social Networks","volume":"384","author":"Phan","year":"2017","journal-title":"Inf. Sci."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"1401","DOI":"10.1021\/acs.jproteome.6b00618","article-title":"Deep-Learning-Based Drug-Target Interaction Prediction","volume":"16","author":"Wen","year":"2017","journal-title":"J. Proteome Res."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Walker, J., Gupta, A., and Hebert, M. (2014, January 23\u201328). Patch to the Future: Unsupervised Visual Prediction. Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.416"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"529","DOI":"10.1038\/nature14236","article-title":"Human-level control through deep reinforcement learning","volume":"518","author":"Mnih","year":"2015","journal-title":"Nature"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Pfeiffer, M., Schaeuble, M., Nieto, J., Siegwart, R., and Cadena, C. (June, January 29). From Perception to Decision: A Data-driven Approach to End-to-end Motion Planning for Autonomous Ground Robots. Proceedings of the 2017 IEEE International Conference on Robotics and Automation (ICRA), Singapore.","DOI":"10.1109\/ICRA.2017.7989182"},{"key":"ref_26","first-page":"865","article-title":"Traffic Flow Prediction With Big Data: A Deep Learning Approach","volume":"16","author":"Lv","year":"2015","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going deeper with convolutions Computer Vision and Pattern Recognition. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Lin, Y., Dai, X., Li, L., and Wang, F.Y. (2018). Pattern Sensitive Prediction of Traffic Flow Based on Generative Adversarial Framework. IEEE Trans. Intell. Transp. Syst., 1\u20136.","DOI":"10.1109\/TITS.2018.2857224"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"4446","DOI":"10.1109\/TIP.2017.2710620","article-title":"DeepFix: A Fully Convolutional Neural Network for predicting Human Eye Fixations","volume":"26","author":"Kruthiventi","year":"2017","journal-title":"IEEE Trans. Image Process."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1007\/s11263-016-0884-7","article-title":"Visualizing Object Detection Features","volume":"119","author":"Vondrick","year":"2015","journal-title":"Int. J. Comput. Vis."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"38","DOI":"10.1109\/TIP.2017.2754941","article-title":"Video Salient Object Detection via Fully Convolutional Networks","volume":"27","author":"Wang","year":"2017","journal-title":"IEEE Trans. Image Process."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"5892","DOI":"10.1109\/TIP.2016.2613686","article-title":"Deep Learning Driven Visual Path Prediction from a Single Image","volume":"25","author":"Huang","year":"2016","journal-title":"IEEE Trans. Image Process."}],"container-title":["Symmetry"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-8994\/11\/5\/629\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T12:49:05Z","timestamp":1760186945000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-8994\/11\/5\/629"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,5,4]]},"references-count":32,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2019,5]]}},"alternative-id":["sym11050629"],"URL":"https:\/\/doi.org\/10.3390\/sym11050629","relation":{},"ISSN":["2073-8994"],"issn-type":[{"value":"2073-8994","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,5,4]]}}}