Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
AI in Biocuration: Challenges, Opportunities, and a Roadmap for Sustainable Integration
0
Zitationen
8
Autoren
2026
Jahr
Abstract
Biocuration is the integration of biological information into a database for the enhancement of research. Curation of these databases, or biodata resources, is challenged by the exponential growth of the scientific literature. Integration of machine learning and artificial intelligence methods into biocuration workflows may help address this challenge. We report on the discussions, ideas, and recommendations gathered from a workshop “AI and biodata resources: implications for sustainability and best practices in biocuration” at the 18th Annual International Biocuration Conference 2025. Participants agreed that while AI offers transformative potential for efficiency and expanded curatorial capacity, its integration faces substantial hurdles. Key challenges revolve around data and model quality, including the risk of hallucinations and the need for human validation across all AI outputs. Reproducibility issues due to the stochastic nature of modern models, and a lack of open, domain-specific training datasets further compound these problems. Broader concerns involve inconsistent data standards, underdeveloped ontologies, and infrastructural barriers such as handling unstructured data and integrating with legacy systems, which are especially burdensome for smaller, underfunded teams. Despite these issues, several successful AI applications were highlighted, including tools for literature summarization and workflow assistance. However, participants emphasized the need for a refined model of human-AI collaboration. This requires clear data provenance and transparency, new skills, and a critical approach to avoid over-reliance on AI-generated data. The workshop ultimately calls for concerted efforts in infrastructure development, standardization, training, and quality assurance to guide the community toward effective human-AI collaboration that maintains scientific rigor.
Ähnliche Arbeiten
Research electronic data capture (REDCap)—A metadata-driven methodology and workflow process for providing translational research informatics support
2008 · 49.759 Zit.
Gene Ontology: tool for the unification of biology
2000 · 43.828 Zit.
STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets
2018 · 18.768 Zit.
A translation approach to portable ontology specifications
1993 · 12.444 Zit.
Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research
2005 · 11.958 Zit.