J Med Life Sci > Volume 23(1); 2026 > Article
Kim, Kang, Lee, Seo, Roh, Choi, Kim, and Kim: Experience in establishing the female breast and genital diseases with microbiome biobank network
Biobanks are essential in modern biomedical research, enabling the systematic collection, storage, and distribution of biospecimens and associated clinical data to advance precision medicine [1]. Collaborative biobank networks are valuable in large-scale or rare-disease research [2], because they enhance data interconnectivity, standardization, and quality assurance [3].
Female breast and genital diseases, including breast, ovarian, and cervical cancers, remain a major global health burden [4], of which breast cancer is the most common malignancy, with increasing incidence in younger populations and marked molecular heterogeneity [5]. Ovarian cancer is often diagnosed at advanced stages and has the poorest prognosis [6]. Although cervical cancer is largely prevented through human papillomavirus vaccination and screening, it still causes significant morbidity and mortality worldwide [7]. However, the molecular mechanisms, tumor microenvironments, and microbiome interactions involved in these diseases remain unclear.
Therefore, the Female Breast and Genital Diseases with Microbiome Biobank Network (FDMNet) was established in 2021 as a multi-institutional collaborative biobank network to collect and share high-quality biospecimens and harmonized clinical datasets, including microbiome resources for female-specific diseases. Additionally, a recent repeated cross-sectional survey of FDMNet highlighted usage patterns, barriers, and opportunities, underscoring the importance of streamlined governance and researcher support [8].
FDMNet comprises five hospital-based biobanks, Inje University Busan Paik Hospital (hub), Dong-A University Hospital, Inha University Hospital, Kyungpook National University Hospital, and Pusan National University Hospital, and is supported by the Korea Biobank Network (KBN). FDMNet aims to systematically collect, process, manage, and distribute high-quality biospecimens, including microbiome- related specimens, such as stool, vaginal discharge, and cervical discharge, along with linked clinical data for research on female breast and genital diseases. The network is governed by operations, standardization, quality management, and technology support committees to ensure operational efficiency and standardization and three researcher- centered working groups that promote studies on breast diseases, obstetric and gynecological diseases, and microbiome research, actively driving innovative research and maximizing the scientific value of the collected resources.
The overall operational framework of FDMNet is illustrated in Fig. 1. Eligible participants were women aged ≥19 years diagnosed with or treated for breast or female genital diseases who provided written consent for biospecimen donation. Patients with incomplete data, unrelated systemic diseases, or declining donation were excluded.
Each participating hospital collects and manages high-quality biospecimens, along with related clinical and epidemiological information. These resources were subsequently consolidated at Inje University Busan Paik Hospital, enabling standardized data integration and facilitating data sharing for research in academia and industry.
Participating hospitals collected biospecimens and related clinical data, which were transferred to Inje University Busan Paik Hospital. The hub operates an electronic medical record (EMR)-based clinical data warehouse (CDW) with an automated extract, transform, and load (ETL) process to convert deidentified data into the KBN common data model (CDM) format for standardized sharing.
Based on the KBN CDM [9], FDMNet developed and collected >100 supplementary clinical and epidemiological variables, besides 88 core variables, focusing on breast and genital diseases. These included data on cancers, pregnancy, delivery, and postpartum conditions, such as pathology, obstetric information, chemotherapy history, and ultrasonography findings.
These extended variables were integrated into the KBN portal to improve data quality and support disease-specific precision medicine research.
The participating hospitals developed CDM and ETL systems based on their institutional infrastructure. To promote research use, Inje University Busan Paik Hospital provides analysis cost support to investigators. Next-generation sequencing (NGS) and other analytical data generated through this support were deposited in the biobank. For the breast disease cohort, microbiome NGS data generated from stool samples stored at different temperatures were provided to researchers in the FASTQ format.
All datasets underwent standardized quality-control procedures, and data distribution was permitted only to researchers approved by the Institutional Review Board (IRB) approval and completed a data-use agreement.
This reciprocal process eliminates the need for researchers to reproduce the same datasets, reduces unnecessary costs, and enriches resource diversity, thereby strengthening its value as a collaborative research network.
All sites were operated under IRB-approved protocols with informed consent. Biospecimen and data management complied with the human biospecimen management guidelines of the National Biobank of Korea (NBK) and relevant international standards. Nine standard operating procedures covered the entire biospecimen lifecycle. Key quality metrics, DNA purity (A260/280) ≥1.8 and RNA integrity number (RIN) ≥7.0, were consistently met across sites. Participation in NBK external proficiency testing ensures benchmarking and continuous improvement in resource quality.
Between 2021 and 2023, the FDMNet enrolled 7,421 participants (Table 1), including 3,297 with breast cancer, 2,832 with female genital cancer, and 1,292 with placenta-related diseases. The collected biospecimens included 1,041 tumor tissues and 767 normal tissues (often as matched pairs), 4,192 whole blood samples, 6,472 plasma samples, 5,920 serum samples, 5,893 buffy coat samples, and 2,492 urine samples. Microbiome-related specimens totaled 1,711 (vaginal discharge, cervical discharge, and stool). Additional resources included 4,297 imaging datasets (digital pathology, magnetic resonance imaging, computed tomography, mammography, ultrasonography, and positron emission therapy), 405 omics datasets (including NGS), and 307 tissue microarrays (TMAs) for gynecologic and breast cancers.
Between 2021 and 2023, FDMNet distributed 40 disease-specific biospecimens and datasets, including 32 academic and eight industry projects. These resources support studies on biomarker discovery, microbiome-based diagnostics, and precision medicine.
A joint academic-industry study published in The Breast Journal demonstrated that multi-microRNA analysis improved mammography-based breast cancer risk assessment [10]. These outcomes highlight the scientific and translational value of FDMNet resources.
Overall, the FDMNet integrates standardized biospecimens with clinical, epidemiological, imaging, and microbiome data to build a sustainable infrastructure for women’s precision medicine. Its disease-specific focus on female breast and genital diseases enables the efficient integration of tailored data and biospecimens, serving as a scalable model for disease-focused biobank networks.

Notes

ACKNOWLEDGEMENTS

We thank the members of the Inje Biobank, Hye In Lee, Jimin Yu, Ji Eun Lee, and Jin Hui Kim for their valuable contributions and support.

CONFLICT OF INTEREST

No potential conflict of interest relevant to this article was reported.

FUNDING

This study was supported by the Korea National Institute of Health (KNIH) Research Project (Project No. 2024ER050900).

Figure 1.
Operational framework of FDMNet. Participating hospitals collect and manage human biospecimens with related clinical and epidemiological information and support resource distribution. All collected data are transferred to Inje University Busan Paik Hospital, for integration and uploaded to the KBN portal, enabling standardized data sharing and facilitating research use. To promote research, Inje University Busan Paik Hospital provides analysis cost support, and NGS and other analysis data generated through this support are deposited back to the biobank, thereby enhancing the diversity and use of its resources. KBN: Korea Biobank Network, CDM: common data model, ETL: extract, transform, and load, FDMNet: Female Breast and Genital Diseases with Microbiome Biobank Network, NGS: next-generation sequencing.
jmls-2025-12-01f1.jpg
Table 1.
Participants and biospecimens collected by FDMNet from 2021 to 2023, categorized by disease type
Disease N Biospecimen type
Tissue (tumor/normal) Tissue block Whole blood Plasma Serum Buffy coat Urine Vaginal discharge Cervical discharge Stool
Breast cancer 3,297 476/234 1,060 1,201 2,321 1,693 2,154 2,242 2 2 628
Female genital cancer 2,832 385/93 226 2,365 3,402 3,511 2,994 234 501 448 9
Placenta-related disease 1,292 180/440 172 626 749 716 745 16 105 16 0
Total 7,421 1,041/767 1,458 4,192 6,472 5,920 5,893 2,492 608 466 637

Specimen types include tumor and normal tissues, tissue blocks, whole blood, plasma, serum, buffy coat, urine, vaginal discharge, cervical discharge, and stool. When feasible, tumors and matched normal tissues were collected from the same participant to enable comparative analyses of molecular features and the tumor microenvironment. Additional resources include imaging datasets, omics data, and tissue microarrays (TMAs) to support multidimensional research in women’s health. Differences in specimen counts across disease categories reflect recruitment priorities and clinical case availability, rather than deviations from standardized collection protocol.

FDMNet: Female Breast and Genital Diseases with Microbiome Biobank Network, N: number of participants.

REFERENCES

1. Kim EY. Biobanks as a treasury for precision medicine. Healthc Inform Res 2021;27:93-4.
crossref pmid pmc pdf
2. López-Guerrero JA, Mendiola M, Pérez-Fidalgo JA, Romero I, Torres A, Recalde D, et al. Prospective real-world gynaecological cancer clinical registry with associated biospecimens: a collaborative model to promote translational research between GEICO and the Spanish Biobank Network. Cancers (Basel) 2022;14:1965.
crossref pmid pmc
3. Devereux L, Watson PH, Mes-Masson AM, Luna-Crespo F, Thomas G, Pitman H, et al. A review of international biobanks and networks: success factors and key benchmarks - a 10-year retrospective review. Biopreserv Biobank 2019;17:512-9.
crossref pmid pdf
4. Gulyaeva L. Special issue: “Molecular signatures of gynecological cancers-breast cancer, ovarian cancer, cervical cancer, and endometrial cancer 2.0”. Int J Mol Sci 2024;25:11275.
crossref pmid pmc
5. Nik-Zainal S, Davies H, Staaf J, Ramakrishna M, Glodzik D, Zou X, et al. Landscape of somatic mutations in 560 breast cancer whole-genome sequences. Nature 2016;534:47-54.
pmid pmc
6. Torre LA, Trabert B, DeSantis CE, Miller KD, Samimi G, Runowicz CD, et al. Ovarian cancer statistics, 2018. CA Cancer J Clin 2018;68:284-96.
crossref pmid pmc pdf
7. Arbyn M, Weiderpass E, Bruni L, de Sanjosé S, Saraiya M, Ferlay J, et al. Estimates of incidence and mortality of cervical cancer in 2018: a worldwide analysis. Lancet Glob Health 2020;8:e191-203.
crossref pmid
8. Ahn S, Cho HJ, Kang MS, Seo AN, Kim L, Choi KU, et al. Barriers and opportunities in biobank utilization: insights from a 3-year repeated cross-sectional survey of the Female Breast and Genital Disease with Microbiome Biobank Network in South Korea. J Yeungnam Med Sci 2025;42:46.
crossref pmid pmc pdf
9. Korea Biobank Network. Guideline of Korea Biobank Network common data model (KBN CDM) v2.1 [Internet]. Seoul: KBN Portal; c2022 [cited 2025 Oct 23]. Available from: https://www.kbn.re.kr/board/view.do;jsessionid=A8F0D38A9F4A27CAD64D-6A143A398BB9.
10. Song JE, Jang JY, Kang KN, Jung JS, Kim CW, Kim AS. MultimicroRNA analysis can improve the diagnostic performance of mammography in determining breast cancer risk. Breast J 2023;2023:9117047.
crossref pmid pmc pdf
TOOLS
METRICS Graph View
  • 0 Crossref
  •  0 Scopus
  • 725 View
  • 19 Download
ORCID iDs

Woo-Young Kim
https://orcid.org/0000-0003-1234-4823

Mi-Seon Kang
https://orcid.org/0000-0001-9332-8096

Ja Young Lee
https://orcid.org/0000-0001-5534-8248

An Na Seo
https://orcid.org/0000-0001-6412-3067

Mee Sook Roh
https://orcid.org/0000-0002-5676-5569

Kyung Un Choi
https://orcid.org/0000-0002-3848-1781

Lucia Kim
https://orcid.org/0000-0002-4100-6607

Eun-Young Kim
https://orcid.org/0000-0003-0125-2173

Related articles


ABOUT
BROWSE ARTICLES
EDITORIAL POLICY
FOR CONTRIBUTORS
Editorial Office
102, Jejudaehak-ro, Jeju-si, Jeju-do 63243, Republic of Korea
Tel: +82-64-754-8023    E-mail: jmls.editor@e-jmls.org                

Copyright © 2026 by Jeju National University Institute for Medical Science.

Developed in M2PI

Close layer
prev next