Score: 2

From Prompt to Pipeline: Large Language Models for Scientific Workflow Development in Bioinformatics

Published: July 27, 2025 | arXiv ID: 2507.20122v1

By: Khairul Alam, Banani Roy

Potential Business Impact:

Helps scientists build complex data tools easily.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

The increasing complexity of bioinformatics data analysis has made Scientific Workflow Systems (SWSs) like Galaxy and Nextflow essential for enabling scalable, reproducible, and automated workflows. However, creating and understanding these workflows remains challenging, particularly for domain experts without programming expertise. This study investigates whether modern Large Language Models (LLMs), GPT-4o, Gemini 2.5 Flash, and DeepSeek-V3, can support the generation of accurate, complete, and usable bioinformatics workflows, and examines which prompting strategies most effectively guide this process. We evaluate these models using diverse tasks such as SNP analysis, RNA-seq, DNA methylation, and data retrieval, spanning both graphical (Galaxy) and script-based (Nextflow) platforms. Expert reviewers assess the generated workflows against community-curated baselines from the Galaxy Training Network and nf-core repositories. The results show that Gemini 2.5 Flash excels in generating Galaxy workflows, while DeepSeek-V3 performs strongly in Nextflow. Prompting strategies significantly impact quality, with role-based and chain-of-thought prompts improving completeness and correctness. While GPT-4o benefits from structured inputs, DeepSeek-V3 offers rich technical detail, albeit with some verbosity. Overall, the findings highlight the potential of LLMs to lower the barrier for workflow development, improve reproducibility, and democratize access to computational tools in bioinformatics, especially when combined with thoughtful prompt engineering.

From Prompt to Pipeline: Large Language Models for Scientific Workflow Development in Bioinformatics

Software Engineering

AI helps scientists build DNA analysis tools faster.

27 Jul 2025 2

90%

Benchmarking Large Language Models on Multiple Tasks in Bioinformatics NLP with Prompting

Computation and Language

Tests AI's ability to solve biology problems.

6 Mar 2025 1

89%

Towards LLM-Powered Task-Aware Retrieval of Scientific Workflows for Galaxy

Software Engineering

Finds the right science tools for any job.

3 Nov 2025 2

View PDF Login to Bookmark

Country of Origin

🇨🇦 Canada

Repos / Data Links

github.com

Page Count

41 pages

From Prompt to Pipeline: Large Language Models for Scientific Workflow Development in Bioinformatics

Helps scientists build complex data tools easily.

Technical Abstract

From Prompt to Pipeline: Large Language Models for Scientific Workflow Development in Bioinformatics

Benchmarking Large Language Models on Multiple Tasks in Bioinformatics NLP with Prompting

Towards LLM-Powered Task-Aware Retrieval of Scientific Workflows for Galaxy