{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,28]],"date-time":"2026-02-28T04:28:05Z","timestamp":1772252885017,"version":"3.50.1"},"reference-count":49,"publisher":"MDPI AG","issue":"21","license":[{"start":{"date-parts":[[2021,11,6]],"date-time":"2021-11-06T00:00:00Z","timestamp":1636156800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>In this paper, we present the idea of Self Supervised learning on the shape completion and classification of point clouds. Most 3D shape completion pipelines utilize AutoEncoders to extract features from point clouds used in downstream tasks such as classification, segmentation, detection, and other related applications. Our idea is to add contrastive learning into AutoEncoders to encourage global feature learning of the point cloud classes. It is performed by optimizing triplet loss. Furthermore, local feature representations learning of point cloud is performed by adding the Chamfer distance function. To evaluate the performance of our approach, we utilize the PointNet classifier. We also extend the number of classes for evaluation from 4 to 10 to show the generalization ability of the learned features. Based on our results, embeddings generated from the contrastive AutoEncoder enhances shape completion and classification performance from 84.2% to 84.9% of point clouds achieving the state-of-the-art results with 10 classes.<\/jats:p>","DOI":"10.3390\/s21217392","type":"journal-article","created":{"date-parts":[[2021,11,7]],"date-time":"2021-11-07T20:42:54Z","timestamp":1636317774000},"page":"7392","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["Contrastive Learning for 3D Point Clouds Classification and Shape Completion"],"prefix":"10.3390","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6364-8427","authenticated-orcid":false,"given":"Danish","family":"Nazir","sequence":"first","affiliation":[{"name":"Department of Computer Science, Technical University of Kaiserslautern, 67663 Kaiserslautern, Germany"},{"name":"Mindgarage, Technical University of Kaiserslautern, 67663 Kaiserslautern, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0536-6867","authenticated-orcid":false,"given":"Muhammad Zeshan","family":"Afzal","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Technical University of Kaiserslautern, 67663 Kaiserslautern, Germany"},{"name":"Mindgarage, Technical University of Kaiserslautern, 67663 Kaiserslautern, Germany"},{"name":"German Research Institute for Artificial Intelligence (DFKI), 67663 Kaiserslautern, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alain","family":"Pagani","sequence":"additional","affiliation":[{"name":"German Research Institute for Artificial Intelligence (DFKI), 67663 Kaiserslautern, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4029-6574","authenticated-orcid":false,"given":"Marcus","family":"Liwicki","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Lule\u00e5 University of Technology, 971 87 Lule\u00e5, Sweden"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Didier","family":"Stricker","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Technical University of Kaiserslautern, 67663 Kaiserslautern, Germany"},{"name":"German Research Institute for Artificial Intelligence (DFKI), 67663 Kaiserslautern, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2021,11,6]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Bello, S.A., Yu, S., Wang, C., Adam, J.M., and Li, J. (2020). Deep learning on 3D point clouds. Remote Sens., 12.","DOI":"10.3390\/rs12111729"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Guo, Y., Wang, H., Hu, Q., Liu, H., Liu, L., and Bennamoun, M. (2020). Deep Learning for 3D Point Clouds: A Survey. IEEE Trans. Pattern Anal. Mach. Intell.","DOI":"10.1109\/TPAMI.2020.3005434"},{"key":"ref_3","unstructured":"Qi, C.R., Su, H., Mo, K., and Guibas, L.J. (2017, January 21\u201326). PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA."},{"key":"ref_4","first-page":"1","article-title":"Dynamic Graph CNN for Learning on Point Clouds","volume":"38","author":"Wang","year":"2019","journal-title":"ACM Trans. Graph."},{"key":"ref_5","unstructured":"Qi, C.R., Yi, L., Su, H., and Guibas, L.J. (December, January ). PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space. Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS\u201917, Long Beach, CA, USA."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Zhang, L., and Zhu, Z. (2019, January 16\u201319). Unsupervised Feature Learning for Point Cloud Understanding by Contrasting and Clustering Using Graph Convolutional Neural Networks. Proceedings of the 2019 International Conference on 3D Vision (3DV), Quebec City, QC, Canada.","DOI":"10.1109\/3DV.2019.00051"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Wang, X., Ang, M.H., and Lee, G. (2021). Cascaded Refinement Network for Point Cloud Completion with Self-supervision. IEEE Trans. Pattern Anal. Mach. Intell.","DOI":"10.1109\/TPAMI.2021.3108410"},{"key":"ref_8","first-page":"1109","article-title":"Large Scale Online Learning of Image Similarity Through Ranking","volume":"11","author":"Chechik","year":"2010","journal-title":"J. Mach. Learn. Res. JMLR"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Schroff, F., Kalenichenko, D., and Philbin, J. (2015, January 7\u201312). FaceNet: A Unified Embedding for Face Recognition and Clustering. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298682"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1007\/s11263-016-0923-4","article-title":"Learning a distance metric from relative comparisons between quadruplets of images","volume":"121","author":"Law","year":"2017","journal-title":"Int. J. Comput. Vis."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Feragen, A., Pelillo, M., and Loog, M. (2015). Deep Metric Learning Using Triplet Network. Similarity-Based Pattern Recognition, Springer International Publishing.","DOI":"10.1007\/978-3-319-24261-3"},{"key":"ref_12","unstructured":"Hermans, A., Beyer, L., and Leibe, B. (2017). In defense of the triplet loss for person re-identification. arXiv."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Zhou, M., Niu, Z., Wang, L., Gao, Z., Zhang, Q., and Hua, G. (2020, January 7\u201312). Ladder Loss for Coherent Visual-Semantic Embedding. Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA.","DOI":"10.1609\/aaai.v34i07.7006"},{"key":"ref_14","unstructured":"Chang, A.X., Funkhouser, T., Guibas, L., Hanrahan, P., Huang, Q., Li, Z., Savarese, S., Savva, M., Song, S., and Su, H. (2015). ShapeNet: An Information-Rich 3D Model Repository. arXiv."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Poursaeed, O., Jiang, T., Qiao, Q., Xu, N., and Kim, V.G. (2020). Self-Supervised Learning of Point Clouds via Orientation Estimation. arXiv.","DOI":"10.1109\/3DV50981.2020.00112"},{"key":"ref_16","unstructured":"Sauder, J., and Sievers, B. (2019). Self-Supervised Deep Learning on Point Clouds by Reconstructing Space. arXiv."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Tchapmi, L.P., Kosaraju, V., Rezatofighi, H., Reid, I., and Savarese, S. (2019, January 15\u201320). TopNet: Structural Point Cloud Decoder. Proceedings of the 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00047"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Wen, X., Xiang, P., Han, Z., Cao, Y.P., Wan, P., Zheng, W., and Liu, Y.S. (2021, January 19\u201325). PMP-Net: Point Cloud Completion by Learning Multi-step Point Moving Paths. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition 2021, Virtual.","DOI":"10.1109\/CVPR46437.2021.00736"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Aubry, M., Schlickewei, U., and Cremers, D. (2011, January 6\u201313). The wave kernel signature: A quantum mechanical approach to shape analysis. Proceedings of the 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), Barcelona, Spain.","DOI":"10.1109\/ICCVW.2011.6130444"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Bronstein, M.M., and Kokkinos, I. (2010, January 13\u201318). Scale-invariant heat kernel signatures for non-rigid shape recognition. Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA.","DOI":"10.1109\/CVPR.2010.5539838"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Rusu, R.B., Blodow, N., and Beetz, M. (2009, January 12\u201317). Fast Point Feature Histograms (FPFH) for 3D registration. Proceedings of the 2009 IEEE International Conference on Robotics and Automation, Kobe, Japan.","DOI":"10.1109\/ROBOT.2009.5152473"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Rusu, R.B., Blodow, N., Marton, Z.C., and Beetz, M. (2008, January 22\u201326). Aligning point cloud views using persistent feature histograms. Proceedings of the 2008 IEEE\/RSJ International Conference on Intelligent Robots and Systems, Nice, France.","DOI":"10.1109\/IROS.2008.4650967"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Qi, C.R., Su, H., Niessner, M., Dai, A., Yan, M., and Guibas, L.J. (2016, January 27\u201330). Volumetric and Multi-View CNNs for Object Classification on 3D Data. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.609"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1016\/j.neucom.2018.09.075","article-title":"NormalNet: A voxel-based CNN for 3D object classification and retrieval","volume":"323","author":"Wang","year":"2019","journal-title":"Neurocomputing"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Maturana, D., and Scherer, S. (2015, January 26\u201330). 3d convolutional neural networks for landing zone detection from lidar. Proceedings of the 2015 IEEE international conference on robotics and automation (ICRA), Seattle, WA, USA.","DOI":"10.1109\/ICRA.2015.7139679"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Maturana, D., and Scherer, S. (October, January 28). Voxnet: A 3d convolutional neural network for real-time object recognition. Proceedings of the 2015 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Hamburg, Germany.","DOI":"10.1109\/IROS.2015.7353481"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Kalogerakis, E., Averkiou, M., Maji, S., and Chaudhuri, S. (2017, January 21\u201326). 3D Shape Segmentation With Projective Convolutional Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.702"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Cao, Z., Huang, Q., and Karthik, R. (2017, January 10\u201312). 3D Object Classification via Spherical Projections. Proceedings of the 2017 International Conference on 3D Vision (3DV), Qingdao, China.","DOI":"10.1109\/3DV.2017.00070"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Zhang, L., Sun, J., and Zheng, Q. (2018). 3D Point Cloud Recognition Based on a Multi-View Convolutional Neural Network. Sensors, 18.","DOI":"10.3390\/s18113681"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"4392","DOI":"10.1109\/LRA.2020.2994483","article-title":"ECG: Edge-aware Point Cloud Completion with Graph Convolution","volume":"5","author":"Pan","year":"2020","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Eckart, B., Yuan, W., Liu, C., and Kautz, J. (2021, January 19\u201325). Self-Supervised Learning on 3D Point Clouds by Learning Discrete Generative Models. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Virtual.","DOI":"10.1109\/CVPR46437.2021.00815"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Choy, C., Park, J., and Koltun, V. (November, January 27). Fully Convolutional Geometric Features. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Korea.","DOI":"10.1109\/ICCV.2019.00905"},{"key":"ref_33","unstructured":"Achlioptas, P., Diamanti, O., Mitliagkas, I., and Guibas, L. (2017). Representation learning and adversarial generation of 3d point clouds. arXiv."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Yang, Y., Feng, C., Shen, Y., and Tian, D. (2018, January 18\u201323). FoldingNet: Point Cloud Auto-Encoder via Deep Grid Deformation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00029"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Dai, A., Ruizhongtai Qi, C., and Nie\u00dfner, M. (2017, January 21\u201326). Shape completion using 3d-encoder-predictor cnns and shape synthesis. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.693"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Han, X., Li, Z., Huang, H., Kalogerakis, E., and Yu, Y. (2017, January 22\u201329). High-resolution shape completion using deep neural networks for global structure and local geometry inference. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.19"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Stutz, D., and Geiger, A. (2018, January 18\u201323). Learning 3d shape completion from laser scan data with weak supervision. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00209"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Le, T., and Duan, Y. (2018, January 18\u201323). Pointgrid: A deep network for 3D shape understanding. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00959"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Sarmad, M., Lee, H.J., and Kim, Y.M. (2019, January 15\u201320). RL-GAN-Net: A Reinforcement Learning Agent Controlled GAN Network for Real-Time Point Cloud Shape Completion. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00605"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Yuan, W., Khot, T., Held, D., Mertz, C., and Hebert, M. (2018, January 5\u20138). PCN: Point Completion Network. Proceedings of the 2018 International Conference on 3D Vision (3DV), Verona, Italy.","DOI":"10.1109\/3DV.2018.00088"},{"key":"ref_41","first-page":"40","article-title":"Learning Representations and Generative Models for 3D Point Clouds","volume":"Volume 80","author":"Achlioptas","year":"2018","journal-title":"Proceedings of the 35th International Conference on Machine Learning"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1023\/A:1026543900054","article-title":"The Earth Mover\u2019s Distance as a Metric for Image Retrieval","volume":"40","author":"Rubner","year":"2000","journal-title":"Int. J. Comput. Vis."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Fan, H., Su, H., and Guibas, L.J. (2017, January 21\u201326). A Point Set Generation Network for 3D Object Reconstruction From a Single Image. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.264"},{"key":"ref_44","unstructured":"Wallach, H., Larochelle, H., Beygelzimer, A., d\u2019Alch\u00e9-Buc, F., Fox, E., and Garnett, R. (2019). PyTorch: An Imperative Style, High-Performance Deep Learning Library. Advances in Neural Information Processing Systems 32, Curran Associates, Inc."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., and Li, F.-F. (2009, January 20\u201325). Imagenet: A large-scale hierarchical image database. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1145\/219717.219748","article-title":"WordNet: A Lexical Database for English","volume":"38","author":"Miller","year":"1995","journal-title":"Commun. ACM"},{"key":"ref_47","first-page":"2579","article-title":"Visualizing High-Dimensional Data Using t-SNE","volume":"9","author":"Hinton","year":"2008","journal-title":"J. Mach. Learn. Res."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1016\/j.neucom.2012.02.034","article-title":"Stochastic Neighbor Embedding (SNE) for Dimension Reduction and Visualization Using Arbitrary Divergences","volume":"90","author":"Bunte","year":"2012","journal-title":"Neurocomputing"},{"key":"ref_49","first-page":"3221","article-title":"Accelerating T-SNE Using Tree-Based Algorithms","volume":"15","year":"2014","journal-title":"J. Mach. Learn. Res."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/21\/21\/7392\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T07:26:59Z","timestamp":1760167619000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/21\/21\/7392"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,11,6]]},"references-count":49,"journal-issue":{"issue":"21","published-online":{"date-parts":[[2021,11]]}},"alternative-id":["s21217392"],"URL":"https:\/\/doi.org\/10.3390\/s21217392","relation":{"has-preprint":[{"id-type":"doi","id":"10.20944\/preprints202109.0112.v1","asserted-by":"object"}]},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,11,6]]}}}