Performance Review and Improvement Metrics for Evaluating Project Delivery Methods with Large Language Models (LLMs)
Problem Statement
Alternative Delivery Methods (ADMs) such as Construction Manager/General Contractor (CM/GC) can improve project performance, risk management, and innovation compared to traditional Design-Bid-Build (DBB). However, the effectiveness of ADMs across cost, schedule, quality, and sustainability metrics is not fully assessed.

Figure 1 – Developed RAG framework for construction information retrieval.
Approach
This project integrates quantitative historical data and qualitative stakeholder insights to develop a comprehensive ADM performance evaluation framework. Open-source large language models (LLMs) such as LLaMA3 and Gemma 3, combined with vision-language models (VLMs), will analyze project reports, survey responses, and other documentation. A multi-modal Retrieval-Augmented Generation (RAG) system will be built to enable robust evaluation, prediction, and reporting of ADM performance. The final system will be packaged as a user-friendly, open-source tool for Caltrans, providing actionable insights and supporting decision-making on project delivery methods.
Ongoing Work
We are developing the multi-modal RAG framework, integrating LLMs and VLMs to process textual, visual, and tabular data. Future work will focus on building the user-friendly software interface for project stakeholders to evaluate and forecast ADM performance efficiently.
Students

Bozhou Zhuang
Yannik Hahn (VGR)
Nils Hütten (VGR)