Score: 0

Information fusion strategy integrating pre-trained language model and contrastive learning for materials knowledge mining

Published: June 14, 2025 | arXiv ID: 2506.12516v1

By: Yongqian Peng , Zhouran Zhang , Longhui Zhang and more

Potential Business Impact:

Helps invent new metals by reading science books.

Business Areas:
Advanced Materials Manufacturing, Science and Engineering

Machine learning has revolutionized materials design, yet predicting complex properties like alloy ductility remains challenging due to the influence of processing conditions and microstructural features that resist quantification through traditional reductionist approaches. Here, we present an innovative information fusion architecture that integrates domain-specific texts from materials science literature with quantitative physical descriptors to overcome these limitations. Our framework employs MatSciBERT for advanced textual comprehension and incorporates contrastive learning to automatically extract implicit knowledge regarding processing parameters and microstructural characteristics. Through rigorous ablation studies and comparative experiments, the model demonstrates superior performance, achieving coefficient of determination (R2) values of 0.849 and 0.680 on titanium alloy validation set and refractory multi-principal-element alloy test set. This systematic approach provides a holistic framework for property prediction in complex material systems where quantitative descriptors are incomplete and establishes a foundation for knowledge-guided materials design and informatics-driven materials discovery.

Page Count
40 pages

Category
Condensed Matter:
Materials Science