SlideChain: Semantic Provenance for Lecture Understanding via Blockchain Registration
By: Md Motaleb Hossen Manik, Md Zabirul Islam, Ge Wang
Potential Business Impact:
Makes AI-made lessons trustworthy and checkable.
Modern vision--language models (VLMs) are increasingly used to interpret and generate educational content, yet their semantic outputs remain challenging to verify, reproduce, and audit over time. Inconsistencies across model families, inference settings, and computing environments undermine the reliability of AI-generated instructional material, particularly in high-stakes and quantitative STEM domains. This work introduces SlideChain, a blockchain-backed provenance framework designed to provide verifiable integrity for multimodal semantic extraction at scale. Using the SlideChain Slides Dataset-a curated corpus of 1,117 medical imaging lecture slides from a university course-we extract concepts and relational triples from four state-of-the-art VLMs and construct structured provenance records for every slide. SlideChain anchors cryptographic hashes of these records on a local EVM (Ethereum Virtual Machine)-compatible blockchain, providing tamper-evident auditability and persistent semantic baselines. Through the first systematic analysis of semantic disagreement, cross-model similarity, and lecture-level variability in multimodal educational content, we reveal pronounced cross-model discrepancies, including low concept overlap and near-zero agreement in relational triples on many slides. We further evaluate gas usage, throughput, and scalability under simulated deployment conditions, and demonstrate perfect tamper detection along with deterministic reproducibility across independent extraction runs. Together, these results show that SlideChain provides a practical and scalable step toward trustworthy, verifiable multimodal educational pipelines, supporting long-term auditability, reproducibility, and integrity for AI-assisted instructional systems.
Similar Papers
S-Chain: Structured Visual Chain-of-Thought For Medicine
Machine Learning (CS)
Helps AI doctors explain medical images better.
ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models
CV and Pattern Recognition
Helps computers understand stories told with pictures.
VChain: Chain-of-Visual-Thought for Reasoning in Video Generation
CV and Pattern Recognition
Makes videos show cause and effect better.