Score: 0

Detection and Analysis of Sensitive and Illegal Content on the Ethereum Blockchain Using Machine Learning Techniques

Published: December 19, 2025 | arXiv ID: 2512.17411v1

By: Xingyu Feng

Potential Business Impact:

Finds bad stuff hidden on the internet.

Business Areas:

Ethereum Blockchain and Cryptocurrency

Blockchain technology, lauded for its transparent and immutable nature, introduces a novel trust model. However, its decentralized structure raises concerns about potential inclusion of malicious or illegal content. This study focuses on Ethereum, presenting a data identification and restoration algorithm. Successfully recovering 175 common files, 296 images, and 91,206 texts, we employed the FastText algorithm for sentiment analysis, achieving a 0.9 accuracy after parameter tuning. Classification revealed 70,189 neutral, 5,208 positive, and 15,810 negative texts, aiding in identifying sensitive or illicit information. Leveraging the NSFWJS library, we detected seven indecent images with 100% accuracy. Our findings expose the coexistence of benign and harmful content on the Ethereum blockchain, including personal data, explicit images, divisive language, and racial discrimination. Notably, sensitive information targeted Chinese government officials. Proposing preventative measures, our study offers valuable insights for public comprehension of blockchain technology and regulatory agency guidance. The algorithms employed present innovative solutions to address blockchain data privacy and security concerns.

Bitcoin's Edge: Embedded Sentiment in Blockchain Transactional Data

Machine Learning (CS)

Reads secret messages on blockchains to guess prices.

18 Apr 2025 1

86%

Leveraging Large Language Models and Machine Learning for Smart Contract Vulnerability Detection

Cryptography and Security

Finds hidden bugs in computer money code.

4 Jan 2025 0

86%

Identification of Malicious Posts on the Dark Web Using Supervised Machine Learning

Cryptography and Security

Finds bad guys talking on the dark web.

28 Nov 2025 1

View PDF Login to Bookmark

Page Count

15 pages

Detection and Analysis of Sensitive and Illegal Content on the Ethereum Blockchain Using Machine Learning Techniques

Finds bad stuff hidden on the internet.

Technical Abstract

Bitcoin's Edge: Embedded Sentiment in Blockchain Transactional Data

Leveraging Large Language Models and Machine Learning for Smart Contract Vulnerability Detection

Identification of Malicious Posts on the Dark Web Using Supervised Machine Learning