Blockchain-Powered Data Provenance for AI Model Audits

Authors

  • Prof.(Dr.) Arpit Jain K L E F Deemed University Vaddeswaram, Andhra Pradesh 522302, India Author

DOI:

https://doi.org/10.63345/sjaibt.v1.i1.104

Keywords:

Blockchain, Data Provenance, AI Auditing, Explainability, Model Transparency, Compliance, Immutable Logs, Zero-Knowledge Proofs

Abstract

Artificial Intelligence (AI) models are increasingly integrated into high-stakes domains such as finance, healthcare, autonomous systems, and legal decision-making. As their influence expands, concerns about accountability, fairness, transparency, and regulatory compliance have become central to both researchers and practitioners. One of the key challenges is auditing AI models in a manner that is tamper-proof, verifiable, and compliant with evolving regulatory frameworks. Traditional auditing mechanisms rely heavily on centralized logs and organizational trust, which creates vulnerabilities in terms of manipulation, incomplete records, and opacity in data flows. Blockchain technology—owing to its immutable, decentralized, and transparent nature—offers a powerful paradigm for establishing data provenance in AI auditing. By ensuring traceability of datasets, model updates, training logs, and inference outcomes, blockchain can provide regulators, stakeholders, and organizations with reliable audit trails.

This paper presents a comprehensive exploration of blockchain-powered data provenance for AI model audits. It analyzes the limitations of current audit systems, evaluates how distributed ledger systems can strengthen accountability, and proposes an integrated framework that combines blockchain with cryptographic verification, zero-knowledge proofs, and federated logging to ensure verifiability without exposing sensitive data. The study synthesizes contributions from literature, presents a methodology for deploying blockchain-based provenance systems in AI pipelines, and evaluates potential results in terms of efficiency, compliance traceability, and security. Simulation experiments suggest that blockchain-enabled audits improve transparency, reduce fraudulent activities in AI operations, and enhance compliance readiness by more than 50% compared to traditional audit approaches.

Downloads

Download data is not yet available.

References

https://static-cse.canva.com/blob/1420680/long-form_data-flow-diagram_section-1_asset-1.png

https://www.researchgate.net/publication/369558035/figure/fig1/AS:11431281131071144@1680020150719/A-simple-example-provenance-graph-where-observations-are-mapped-to-encounters-to-be.ppm

• Abu-Salah, B., Elsawy, A., & Al-Qutayri, M. (2022). Blockchain for trustworthy machine learning: A survey. IEEE Access, 10(1), 10345–10367. https://doi.org/10.1109/ACCESS.2022.3142136

• Agarwal, R., Gans, J. S., & Goldfarb, A. (2023). Accountability in artificial intelligence: Auditing and explainability. Journal of Economic Perspectives, 37(3), 153–174. https://doi.org/10.1257/jep.37.3.153

• Al-Bassam, M. (2019). Blockchain-based decentralized cloud computing. Future Generation Computer Systems, 90(1), 549–561. https://doi.org/10.1016/j.future.2018.07.016

• Azaria, A., Ekblaw, A., Vieira, T., & Lippman, A. (2016). MedRec: Using blockchain for medical data access and permission management. Proceedings of the 2nd International Conference on Open and Big Data (OBD), 25–30. https://doi.org/10.1109/OBD.2016.11

• Bansal, G., Chowdhury, O., & Mukherjee, S. (2021). Data provenance for accountable AI systems. ACM Computing Surveys, 54(8), 1–35. https://doi.org/10.1145/3460319

• Bhattacharya, P., & Tanwar, S. (2022). Blockchain and AI for security and privacy in healthcare: Opportunities and challenges. Journal of Ambient Intelligence and Humanized Computing, 13(3), 1531–1549. https://doi.org/10.1007/s12652-021-03024-5

• Carminati, B., Ferrari, E., & Rondanini, S. (2020). Blockchain-based data governance in distributed systems. Future Generation Computer Systems, 111(1), 324–338. https://doi.org/10.1016/j.future.2020.05.025

• Chen, M., Hao, Y., Cai, Y., Wang, L., & Song, J. (2020). Blockchain for secure and reliable AI. Computer Communications, 153(1), 372–380. https://doi.org/10.1016/j.comcom.2020.02.041

• Dedeoglu, V., Kanhere, S., Jurdak, R., & Kanhere, A. (2021). Blockchain for AI: Review and open research challenges. IEEE Access, 9(1), 141120–141145. https://doi.org/10.1109/ACCESS.2021.3119093

• Doshi-Velez, F., & Kim, B. (2018). Towards a rigorous science of interpretable machine learning. arXiv preprint arXiv:1702.08608. https://doi.org/10.48550/arXiv.1702.08608

• Gaurav, A., Singh, A., & Patel, S. (2021). Blockchain-enabled auditability of machine learning algorithms. International Journal of Information Management, 58(1), 102317. https://doi.org/10.1016/j.ijinfomgt.2020.102317

• Hashmi, M. A., & Hassan, M. M. (2022). Data integrity and provenance in blockchain-enabled AI systems. Journal of Network and Computer Applications, 200(1), 103332. https://doi.org/10.1016/j.jnca.2022.103332

• Kamble, S., Gunasekaran, A., & Gawankar, S. (2019). Achieving sustainable performance in a data-driven agriculture supply chain. International Journal of Production Economics, 219(1), 409–421. https://doi.org/10.1016/j.ijpe.2019.06.010

• Kouhizadeh, M., Sarkis, J., & Zhu, Q. (2020). Blockchain technology and the sustainable supply chain: Theoretically exploring adoption barriers. International Journal of Production Economics, 231(1), 107831. https://doi.org/10.1016/j.ijpe.2020.107831

• Kshetri, N. (2021). Blockchain-enabled artificial intelligence: Opportunities and challenges. IT Professional, 23(2), 75–80. https://doi.org/10.1109/MITP.2021.3059297

• Liu, Y., Wu, Y., & Xu, H. (2022). Provenance tracking for machine learning pipelines using blockchain. Future Generation Computer Systems, 134(1), 123–136. https://doi.org/10.1016/j.future.2022.04.017

• Mökander, J., Axente, M., Casolari, F., & Floridi, L. (2023). Auditing large language models: A governance perspective. AI & Society, 38(2), 543–556. https://doi.org/10.1007/s00146-022-01399-0

• Nakamoto, S. (2008). Bitcoin: A peer-to-peer electronic cash system. White Paper. https://bitcoin.org/bitcoin.pdf

• Wang, S., Ouyang, L., Yuan, Y., Ni, X., Han, X., & Wang, F. (2019). Blockchain-enabled smart contracts: Applications, challenges, and future trends. Computer Networks, 151(1), 147–166. https://doi.org/10.1016/j.comnet.2019.01.002

• Zyskind, G., Nathan, O., & Pentland, A. (2015). Decentralizing privacy: Using blockchain to protect personal data. 2015 IEEE Security and Privacy Workshops, 180–184. https://doi.org/10.1109/SPW.2015.27

Downloads

Published

06-01-2024

Issue

Section

Original Research Articles

How to Cite

Blockchain-Powered Data Provenance for AI Model Audits. (2024). Scientific Journal of Artificial Intelligence and Blockchain Technologies, 1(1), Jan (28-36). https://doi.org/10.63345/sjaibt.v1.i1.104

Similar Articles

41-50 of 105

You may also start an advanced similarity search for this article.