Information fusion strategy integrating pre-trained language model and contrastive learning for materials knowledge mining
By: Yongqian Peng , Zhouran Zhang , Longhui Zhang and more
Potential Business Impact:
Helps invent new metals by reading science books.
Machine learning has revolutionized materials design, yet predicting complex properties like alloy ductility remains challenging due to the influence of processing conditions and microstructural features that resist quantification through traditional reductionist approaches. Here, we present an innovative information fusion architecture that integrates domain-specific texts from materials science literature with quantitative physical descriptors to overcome these limitations. Our framework employs MatSciBERT for advanced textual comprehension and incorporates contrastive learning to automatically extract implicit knowledge regarding processing parameters and microstructural characteristics. Through rigorous ablation studies and comparative experiments, the model demonstrates superior performance, achieving coefficient of determination (R2) values of 0.849 and 0.680 on titanium alloy validation set and refractory multi-principal-element alloy test set. This systematic approach provides a holistic framework for property prediction in complex material systems where quantitative descriptors are incomplete and establishes a foundation for knowledge-guided materials design and informatics-driven materials discovery.
Similar Papers
A Materials Map Integrating Experimental and Computational Data via Graph-Based Machine Learning for Enhanced Materials Discovery
Materials Science
Helps scientists find new materials faster.
LLM-Fusion: A Novel Multimodal Fusion Model for Accelerated Material Discovery
Materials Science
Finds new materials faster and better.
MatMMFuse: Multi-Modal Fusion model for Material Property Prediction
Machine Learning (CS)
Finds new materials faster by combining two computer brains.