{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T12:56:33Z","timestamp":1777640193517,"version":"3.51.4"},"reference-count":50,"publisher":"Emerald","issue":"13","license":[{"start":{"date-parts":[[2024,4,26]],"date-time":"2024-04-26T00:00:00Z","timestamp":1714089600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["ECAM"],"published-print":{"date-parts":[[2024,12,16]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title><jats:p>Factors like bid price, submission time, and number of bidders influence the procurement process in public projects. These factors and the award criteria may impact the project\u2019s financial compliance. Predicting budget compliance in construction projects has been traditionally challenging, but Machine Learning (ML) techniques have revolutionised estimations.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title><jats:p>In this study, Portuguese Public Procurement Data (PPPData) was utilised as the model\u2019s input. Notably, this dataset exhibited a substantial imbalance in the target feature. To address this issue, the study evaluated three distinct data balancing techniques: oversampling, undersampling, and the SMOTE method. Next, a comprehensive feature selection process was conducted, leading to the testing of five different algorithms for forecasting budget compliance. Finally, a secondary test was conducted, refining the features to include only those elements that procurement technicians can modify while also considering the two most accurate predictors identified in the previous test.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Findings<\/jats:title><jats:p>The findings indicate that employing the SMOTE method on the scraped data can achieve a balanced dataset. Furthermore, the results demonstrate that the Adam ANN algorithm outperformed others, boasting a precision rate of 68.1%.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Practical implications<\/jats:title><jats:p>The model can aid procurement technicians during the tendering phase by using historical data and analogous projects to predict performance.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Social implications<\/jats:title><jats:p>Although the study reveals that ML algorithms cannot accurately predict budget compliance using procurement data, they can still provide project owners with insights into the most suitable criteria, aiding decision-making. Further research should assess the model\u2019s impact and capacity within the procurement workflow.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title><jats:p>Previous research predominantly focused on forecasting budgets by leveraging data from the private construction execution phase. While some investigations incorporated procurement data, this study distinguishes itself by using an imbalanced dataset and anticipating compliance rather than predicting budgetary figures. The model predicts budget compliance by analysing qualitative and quantitative characteristics of public project contracts. The research paper explores various model architectures and data treatment techniques to develop a model to assist the Client in tender definition.<\/jats:p><\/jats:sec>","DOI":"10.1108\/ecam-09-2023-0973","type":"journal-article","created":{"date-parts":[[2024,5,6]],"date-time":"2024-05-06T09:28:00Z","timestamp":1714987680000},"page":"285-302","source":"Crossref","is-referenced-by-count":10,"title":["Predicting construction project compliance with machine learning model: case study using Portuguese procurement data"],"prefix":"10.1108","volume":"31","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0789-9368","authenticated-orcid":false,"given":"Lu\u00eds","family":"Jacques de Sousa","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9878-3792","authenticated-orcid":false,"given":"Jo\u00e3o","family":"Po\u00e7as Martins","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2578-6981","authenticated-orcid":false,"given":"Lu\u00eds","family":"Sanhudo","sequence":"additional","affiliation":[]}],"member":"140","published-online":{"date-parts":[[2024,4,26]]},"reference":[{"issue":"4","key":"key2024121104410440400_ref001","doi-asserted-by":"publisher","first-page":"689","DOI":"10.1109\/tem.2018.2856376","article-title":"A big data analytics approach for construction firms failure prediction models","volume":"66","year":"2019","journal-title":"IEEE Transactions on Engineering Management"},{"key":"key2024121104410440400_ref002","doi-asserted-by":"publisher","first-page":"437","DOI":"10.1016\/j.aej.2023.09.069","article-title":"Evaluating construction contractors in the pre-tendering stage through an integrated based model","volume":"82","year":"2023","journal-title":"Alexandria Engineering Journal"},{"issue":"1","key":"key2024121104410440400_ref003","first-page":"621","article-title":"Building information modelling for project cost estimation","volume":"3","year":"2021","journal-title":"Recent Trends in Civil Engineering and Built Environment"},{"key":"key2024121104410440400_ref004","first-page":"356","article-title":"Data-led learning: using natural language processing (NLP) and machine learning to learn from construction site safety failures","year":"2020","journal-title":"Management"},{"issue":"1","key":"key2024121104410440400_ref005","doi-asserted-by":"publisher","first-page":"106","DOI":"10.1186\/1471-2105-14-106","article-title":"SMOTE for high-dimensional class-imbalanced data","volume":"14","year":"2013","journal-title":"BMC Bioinformatics"},{"issue":"3","key":"key2024121104410440400_ref006","doi-asserted-by":"publisher","first-page":"229","DOI":"10.30880\/ijie.2021.13.03.028","article-title":"The key criteria in deciding to tender for construction projects","volume":"13","year":"2021","journal-title":"International Journal of Integrated Engineering"},{"key":"key2024121104410440400_ref007","first-page":"144","article-title":"A training algorithm for optimal margin classifiers","year":"1992"},{"issue":"1","key":"key2024121104410440400_ref008","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/a:1010933404324","article-title":"Machine learning","volume":"45","year":"2001","journal-title":"Machine Learning"},{"key":"key2024121104410440400_ref009","unstructured":"Brownlee, J. (2020a), \u201cHow to calculate feature importance with Python\u201d, available at: https:\/\/machinelearningmastery.com\/calculate-feature-importance-with-python\/ (accessed 24 January 2023)."},{"key":"key2024121104410440400_ref010","unstructured":"Brownlee, J. (2020b), \u201cTour of data sampling methods for imbalanced classification\u201d, available at: https:\/\/machinelearningmastery.com\/data-sampling-methods-for-imbalanced-classification\/#:\u223c:text=The%20simplest%20oversampling%20method%20involves,for%20Synthetic%20Minority%20Oversampling%20Technique"},{"issue":"1","key":"key2024121104410440400_ref011","first-page":"91","article-title":"Estimation of building construction cost using artificial neural networks","volume":"3","year":"2019","journal-title":"Journal of Soft Computing in Civil Engineering"},{"key":"key2024121104410440400_ref012","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1613\/jair.953","article-title":"SMOTE: synthetic minority over-sampling technique","volume":"16","year":"2002","journal-title":"Journal of Artificial Intelligence Research"},{"key":"key2024121104410440400_ref013","doi-asserted-by":"publisher","DOI":"10.1016\/j.seps.2018.02.007","article-title":"Decision making framework for tender evaluation and contractor selection in public organisations with risk considerations","volume":"68","year":"2019","journal-title":"Socio-Economic Planning Sciences"},{"key":"key2024121104410440400_ref014","first-page":"85","article-title":"Towards automated cost analysis, benchmarking and estimating in construction: a machine learning approach","year":"2019"},{"key":"key2024121104410440400_ref015","article-title":"XGBoost: a scalable tree boosting system","year":"2016"},{"issue":"3","key":"key2024121104410440400_ref016","doi-asserted-by":"publisher","first-page":"302","DOI":"10.1016\/j.autcon.2009.12.003","article-title":"Project success prediction using an evolutionary support vector machine inference model","volume":"19","year":"2010","journal-title":"Automation in Construction"},{"issue":"1","key":"key2024121104410440400_ref017","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1109\/tit.1967.1053964","article-title":"Nearest neighbor pattern classification","volume":"13","year":"1967","journal-title":"IEEE Transactions on Information Theory"},{"key":"key2024121104410440400_ref018","first-page":"928","volume-title":"Support Vector Machines","year":"2008"},{"issue":"1","key":"key2024121104410440400_ref019","doi-asserted-by":"publisher","first-page":"120","DOI":"10.1080\/13467581.2020.1838288","article-title":"Artificial intelligence for developing accurate preliminary cost estimates for composite flooring systems of multi-storey buildings","volume":"21","year":"2021","journal-title":"Journal of Asian Architecture and Building Engineering"},{"key":"key2024121104410440400_ref020","doi-asserted-by":"publisher","DOI":"10.1016\/j.dib.2020.105688","article-title":"Data on field canals improvement projects for cost prediction using artificial intelligence","volume":"31","year":"2020","journal-title":"Data in Brief"},{"issue":"3","key":"key2024121104410440400_ref021","doi-asserted-by":"publisher","DOI":"10.1037\/e471672008-001","article-title":"Discriminatory analysis. Nonparametric discrimination: consistency properties","volume":"57","year":"1951","journal-title":"USAF School of Aviation Medicine, Randolph Field, Texas. Archived (PDF)"},{"key":"key2024121104410440400_ref022","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1155\/2020\/8858258","article-title":"Bidders recommender for public procurement auctions using machine learning: data analysis, algorithm, and case study with tenders from Spain","volume":"2020","year":"2020","journal-title":"Complexity"},{"key":"key2024121104410440400_ref023","doi-asserted-by":"publisher","DOI":"10.1016\/j.autcon.2021.104047","article-title":"Collusion detection in public procurement auctions with machine learning algorithms","volume":"133","year":"2022","journal-title":"Automation in Construction"},{"issue":"2","key":"key2024121104410440400_ref024","doi-asserted-by":"publisher","first-page":"47","DOI":"10.3390\/buildings11020047","article-title":"Bidding strategy in construction public procurement: a contractor's perspective","volume":"11","year":"2021","journal-title":"Buildings"},{"key":"key2024121104410440400_ref025","first-page":"55","article-title":"Base de dados: Contrata\u00e7\u00e3o p\u00fablica em Portugal entre 2015 e 2022","year":"2022"},{"key":"key2024121104410440400_ref026","doi-asserted-by":"publisher","DOI":"10.1016\/j.dib.2023.109063","article-title":"Portuguese public procurement data for construction (2015-2022)","volume":"48","year":"2023","journal-title":"Data in Brief"},{"key":"key2024121104410440400_ref027","doi-asserted-by":"crossref","unstructured":"Jacques de Sousa, L., Po\u00e7as Martins, J., Santos Baptista, J. and Sanhudo, L. (2023b), \u201cTowards the development of a budget categorisation machine learning tool: a review\u201d, in Gomes Correia, A., Azenha, M., Cruz, P.J.S., Novais, P. and Pereira, P. (Eds), Trends on Construction in the Digital Era, Guimar\u00e3es, pp.\u00a0101-110.","DOI":"10.1007\/978-3-031-20241-4_8"},{"issue":"13","key":"key2024121104410440400_ref028","doi-asserted-by":"publisher","DOI":"10.3390\/app11136188","article-title":"Automated extraction and time-cost prediction of contractual reporting requirements in construction using natural language processing and simulation","volume":"11","year":"2021","journal-title":"Applied Sciences"},{"issue":"3","key":"key2024121104410440400_ref029","doi-asserted-by":"publisher","first-page":"601","DOI":"10.1108\/jedt-08-2019-0195","article-title":"Estimation of construction project building cost by back-propagation neural network","volume":"18","year":"2020","journal-title":"Journal of Engineering, Design and Technology"},{"issue":"3","key":"key2024121104410440400_ref030","doi-asserted-by":"publisher","first-page":"48","DOI":"10.2478\/ceer-2020-0033","article-title":"Development of cost estimation models based on ANN ensembles and the SVM method","volume":"30","year":"2020","journal-title":"Civil And Environmental Engineering Reports"},{"issue":"7","key":"key2024121104410440400_ref031","doi-asserted-by":"publisher","first-page":"715","DOI":"10.3846\/jcem.2019.10534","article-title":"Forecasting of sports fields construction costs aided by ensembles of neural networks","volume":"25","year":"2019","journal-title":"Journal of Civil Engineering and Management"},{"key":"key2024121104410440400_ref032","unstructured":"Keras (2023a), \u201cModel training APIs\u201d, available at: https:\/\/keras.io\/api\/models\/model_training_apis\/"},{"key":"key2024121104410440400_ref201","unstructured":"Keras (2023b), \u201cKeras\u201d, available at: https:\/\/keras.io\/ (accessed August 2023)."},{"key":"key2024121104410440400_ref033","article-title":"Supply point: an integrated system supporting e-business in the construction sector","year":"2001"},{"key":"key2024121104410440400_ref034","first-page":"123","article-title":"Application of natural language processing (NLP) and text-mining of big-data to engineering-procurement-construction (EPC) bid and contract documents","year":"2020"},{"key":"key2024121104410440400_ref035","article-title":"Adam: a method for stochastic optimisation","year":"2014"},{"key":"key2024121104410440400_ref036","first-page":"25","article-title":"Handling imbalanced datasets: a review","volume":"30","year":"2005","journal-title":"GESTS International Transactions on Computer Science and Engineering"},{"key":"key2024121104410440400_ref037","doi-asserted-by":"publisher","first-page":"131","DOI":"10.1016\/j.autcon.2015.03.016","article-title":"Development of recursive decision making model in bilateral construction procurement negotiation","volume":"53","year":"2015","journal-title":"Automation in Construction"},{"issue":"3","key":"key2024121104410440400_ref038","doi-asserted-by":"publisher","first-page":"808","DOI":"10.3390\/civileng4030045","article-title":"Statistical descriptive analysis of Portuguese public procurement data from 2015 to 2022","volume":"4","year":"2023","journal-title":"Civil Engineering"},{"issue":"6","key":"key2024121104410440400_ref040","doi-asserted-by":"publisher","first-page":"2328","DOI":"10.1108\/ecam-12-2020-1060","article-title":"Mixed qualitative\u2013quantitative approach for bidding decisions in construction","volume":"29","year":"2022","journal-title":"Engineering, Construction and Architectural Management"},{"key":"key2024121104410440400_ref041","doi-asserted-by":"publisher","DOI":"10.1016\/j.autcon.2022.104465","article-title":"Automated detection of contractual risk clauses from construction specifications using bidirectional encoder representations from transformers (BERT)","volume":"142","year":"2022","journal-title":"Automation in Construction"},{"issue":"9","key":"key2024121104410440400_ref042","doi-asserted-by":"publisher","first-page":"3677","DOI":"10.1007\/s12205-022-1377-6","article-title":"Assessment of construction project contractor selection success factors considering their interconnections","volume":"26","year":"2022","journal-title":"KSCE Journal of Civil Engineering"},{"key":"key2024121104410440400_ref051","unstructured":"Po\u00e7as Martins, J. and Abrantes Almeida, V. (2009), \u201cModela\u00e7\u00e3o do fluxo de informa\u00e7\u00e3o no processo de constru\u00e7\u00e3o - Aplica\u00e7\u00e3o ao licenciamento autom\u00e1tico de projectos\u201d, Doctoral Thesis, University of Porto."},{"key":"key2024121104410440400_ref043","unstructured":"Ronaghan, S. (2018), \u201cThe mathematics of decision trees, random forest and feature importance in Scikit-learn and spark\u201d, available at: https:\/\/towardsdatascience.com\/the-mathematics-of-decision-trees-random-forest-and-feature-importance-in-scikit-learn-and-spark-f2861df67e3 (accessed 24 January 2023)."},{"key":"key2024121104410440400_ref044","unstructured":"Sklearn (2023a), \u201cSkLearn label encoder\u201d, available at: https:\/\/scikit-learn.org\/stable\/modules\/generated\/sklearn.preprocessing.LabelEncoder.html"},{"key":"key2024121104410440400_ref045","unstructured":"SKLarn (2023b), \u201cSKLearn random forest classifier\u201d, available at: https:\/\/scikit-learn.org\/stable\/modules\/generated\/sklearn.ensemble.RandomForestClassifier.html"},{"key":"key2024121104410440400_ref202","unstructured":"Sklearn (2023c), \u201cScikit-learn\u201d, available at: https:\/\/scikit-learn.org\/ (accessed August 2023)."},{"key":"key2024121104410440400_ref046","article-title":"Assessing the quality of natural language text data","year":"2004"},{"key":"key2024121104410440400_ref047","first-page":"254","article-title":"A neural network approach to design reality oriented cost estimate model for infrastructure projects","volume":"16","year":"2021","journal-title":"Reliability: Theory and Applications"},{"issue":"7","key":"key2024121104410440400_ref048","first-page":"845","article-title":"The automated cost estimation in construction","volume":"8","year":"2019","journal-title":"International Journal of Innovative Technology and Exploring Engineering"}],"container-title":["Engineering, Construction and Architectural Management"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/ECAM-09-2023-0973\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/ECAM-09-2023-0973\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,25]],"date-time":"2025-07-25T00:45:46Z","timestamp":1753404346000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/ecam\/article\/31\/13\/285-302\/1232829"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,26]]},"references-count":50,"journal-issue":{"issue":"13","published-online":{"date-parts":[[2024,4,26]]},"published-print":{"date-parts":[[2024,12,16]]}},"alternative-id":["10.1108\/ECAM-09-2023-0973"],"URL":"https:\/\/doi.org\/10.1108\/ecam-09-2023-0973","relation":{},"ISSN":["0969-9988"],"issn-type":[{"value":"0969-9988","type":"print"}],"subject":[],"published":{"date-parts":[[2024,4,26]]}}}