Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
A Blueprint for Open Science: How Transatlantic Teams Built and Deployed Knowledge Graphs to Enable Biological (AI) Models
0
Zitationen
53
Autoren
2025
Jahr
Abstract
Knowledge graphs (KGs) and large language models (LLMs) are increasingly applied in biomedical research; however, LLMs' tendency to hallucinate and lack of evidence traceability poses significant challenges for rigorous scientific applications. To address these limitations, the NVIDIA - AWS Open Data Knowledge Graph Hackathon, which brought together transatlantic teams, catalyzed the development of novel frameworks that built or integrated KGs with graph-based retrieval-augmented generation (GraphRAG) to enhance evidence-grounded generative AI. The hackathon took place on October 1-3, 2025, at two locations - the AWS Skills Center in Arlington, VA, USA and the European Bioinformatics Institute (EBI) Training Center in Cambridge, UK. Across seven prototype projects, participating teams developed systems that construct, validate, and deploy biomedical KGs using open data and cloud-native infrastructure. These included GeNETwork, which integrates pediatric oncology datasets to identify therapeutic targets; ECoGraph, a multi-omics graph framework for characterizing colorectal cancer drivers; ClassiGraph, a graph neural network classifier for cancer subtypes; EasyGiraffe, a validator for multisite polygenicity extraction; MIDAS (Model Integration and Data Assembly System), a pipeline for harmonizing heterogeneous biomedical datasets; KG Model Garbage Collection, a framework for detecting and pruning erroneous AI-generated edges; and BioGraphRAG, which combines precision medicine and literature-derived KGs for evidence-based question answering. Together, these prototypes demonstrate practical strategies for constructing and deploying biomedical KGs and highlight the potential of GraphRAG to produce interpretable, verifiable AI-driven insights. By emphasizing open data, reproducible pipelines, and evidence-grounded reasoning, this work advances methodologies for trustworthy generative AI in biomedical discovery.
Ähnliche Arbeiten
Detecting Functionality-Specific Vulnerabilities via Retrieving Individual Functionality-Equivalent APIs in Open-Source Repositories
2025 · 16.046 Zit.
A tutorial on spectral clustering
2007 · 10.073 Zit.
Authoritative sources in a hyperlinked environment
1999 · 8.986 Zit.
The Graph Neural Network Model
2008 · 8.911 Zit.
A Comprehensive Survey on Graph Neural Networks
2020 · 8.625 Zit.
Autoren
- Mina P. Peyton
- Chao-Kong Chung
- Samarpan Mohanty
- Van Q. Truong
- Andrew Scouten
- Sangeeta Shukla
- Chantera Lazard
- Daniall Masood
- John D. Murphy
- Anne Ketter
- Taha Mohseni Ahooyi
- Viren Amin
- Arshi Arora
- J. Allen Baron
- Fateema Bazzi
- Arun Bondali
- Nishat Anjum Bristy
- Kathleen Carter
- Yibei Chen
- Jean-Paul Courneya
- C. W. Daniels
- Marcin J. Domagalski
- Victor Felix
- Vivien W. Ho
- Michelle Holko
- Toshiaki Katayama
- Yaphet Kebede
- Miriam Kim
- R. M. Kini
- Carmen L Diaz Soria
- Anurag Limdi
- Shakuntala Mitra
- Evan Molinelli
- Erick J. Morris
- Aymen Maqsood Mulbagal
- Bino Paul
- Seeta Ramaraju Pericherla
- Kara Quaid
- Radu Robotin
- Polina Rusina
- Irene López Santiago
- Benjamin Stear
- Karthick Nanmaran
- Shilpa Sundar
- Likhitha Surapaneni
- Deanne Taylor
- Simone Weyand
- Benjamin Wingfield
- Christine Withers
- David Yu Yuan
- Emily Richardson
- Beryl Rabindran
- Ben Busby