Trustworthy generative AI for computing systems: A review of safety, evaluation, and governance mechanisms

Saif Safaa Shakir; Hasan Fadhil Qasim; Huda Najim Abdulwahed

doi:10.17977/um031v13i12026p061

Trustworthy generative AI for computing systems: A review of safety, evaluation, and governance mechanisms

Authors

Saif Safaa Shakir University of Al-Qadisiyah
Hasan Fadhil Qasim University of Misan https://orcid.org/0009-0001-8762-8276
Huda Najim Abdulwahed University of Al-Qadisiyah https://orcid.org/0000-0002-5518-4885

DOI:

https://doi.org/10.17977/um031v13i12026p061

Abstract

More and more generative AI systems which includes large language models, diffusion models, and multimodal foundations are being integrated into crucial computing infrastructure, including cloud orchestration, code synthesis pipeline, healthcare decision support, and financial risk assessment. Consequently, there is greater demand for frameworks that can evaluate, guarantee, and regulate the trustworthiness of these systems. This article reviewed the development of trustworthy AI research from 2015 to 2025, and the evidence generated across four primary areas: safety and alignment, robustness and reliability, evaluation, and governance. We delivered distinctive comparative assessments of safety benchmarks, alignment methodologies (RLHF, RLAIF, DPO, Constitutional AI), and formal governance frameworks worldwide, pinpointing the critical discrepancies between regulated objectives and actual technical capability. A key finding is the Evaluation Paradox: The benchmarks most commonly relied on to certify systems as “AI safe” are, in fact, the systems least robust to distributional shift and adversarial manipulation. There is an institutional misalignment between the speed of generative AI deployment and the maturity of the governance mechanisms proposed to regulate it. We documented seven priority research challenges for the field. Researchers, system engineers, policymakers, and practitioners pursuing an evidence-based understanding of the current state-of-trustworthiness will benefit from this review.

Author Biographies

Saif Safaa Shakir, University of Al-Qadisiyah

College of Computer Science and Information Technology

Hasan Fadhil Qasim, University of Misan

College of Agriculture

Huda Najim Abdulwahed, University of Al-Qadisiyah

College of Arts, University of Al-Qadisiyah, Iraq

Downloads

Published

2026-05-20

How to Cite

Shakir, S. S., Qasim, H. F., & Abdulwahed, H. N. (2026). Trustworthy generative AI for computing systems: A review of safety, evaluation, and governance mechanisms. Jurnal Inovasi Dan Teknologi Pembelajaran, 13(1), 61–73. https://doi.org/10.17977/um031v13i12026p061

Download Citation

Issue

Vol. 13 No. 1 (2026)

Section

Articles

License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Jurnal Inovasi dan Teknologi Pembelajaran allows readers to read, download, copy, distribute, print, search, or link to the full texts of its articles and allow readers to use them for any other lawful purpose. The journal allows the author(s) to hold the copyright without restrictions. Finally, the journal allows the author(s) to retain publishing rights without restrictions.

Authors are allowed to archive their submitted articles in an open access repository.
Authors are allowed to archive the final published article in an open access repository with an acknowledgment of its initial publication in this journal.