大学院講義「先端医科学研究概論」
Introduction to Publicly available Web Tools for Clinical Big Data Analysis
臨床ビッグデータ解析のための一般公開されているウェブツールの紹介
2023-10-24
PROFESSIONAL EXPERIENCE
Bioinformatics Assistant Professor 2023 April-Present
Advanced Medical Research Center, Yokohama City University
Bioinformatics researcher 2020-2023 March
Analytics team, Craif Inc., Japan
EDUCATION
University of Tsukuba 2015-2020
Ph.D. in Human Biology
Graduate School of Integrative and Global Majors
SKILLS
Research and Study Design・Statistical Analyses Planning and Implementation・Clinical Data Management・ R/Shiny/Bioconductor・Python/Streamlit/Django・JavaScript (Google apps script)・Version control using Git・Machine Learning
Sakura Eri
Maezono, Ph.D.
PROFESSIONAL EXPERIENCE
Bioinformatics Assistant Professor 2023 April-Present
Advanced Medical Research Center, Yokohama City University
EDUCATION
University of Tsukuba 2015-2020
Ph.D. in Human Biology
Graduate School of Integrative and Global Majors
SKILLS
Research and Study Design・Statistical Analyses Planning and Implementation・Clinical Data Management・ R/Shiny/Bioconductor・Python/Streamlit/Django・JavaScript (Google apps script)・Version control using Git・Machine Learning
Sakura Eri
Maezono, Ph.D.
PROFESSIONAL EXPERIENCE
Bioinformatics researcher 2020-2023 March
Analytics team, Craif Inc., Japan
EDUCATION
University of Tsukuba 2015-2020
Ph.D. in Human Biology
Graduate School of Integrative and Global Majors
SKILLS
Research and Study Design・Statistical Analyses Planning and Implementation・Clinical Data Management・ R/Shiny/Bioconductor・Python/Streamlit/Django・JavaScript (Google apps script)・Version control using Git・Machine Learning
Sakura Eri
Maezono, Ph.D.
the process of extracting valuable insights from vast and diverse datasets related to healthcare and medicine
How to access?
Databases provide instructions but it is usually via the following:
⭐️Direct download from the website ⭐️FTP server ⭐️API (Shell/Python/R)
Main purpose
Public Web tools turn data into interpretable results democratizing advanced data analysis without complex installations, programming skills, or high expenses
TCGA enabled the researchers to analyze cancer stemness in ~12,000 samples of 33 tumor types
Publication: Fujimoto K., Ito K., Saito Y., et al. Machine Learning Identifies Stemness Features Associated with Oncogenic Dedifferentiation. Cell Reports, 23(11), 3306-3320.e10, 2018. https://doi.org/10.1016/j.cell.2018.03.034
The researchers investigated AKT1, AKT2, AKT3, CHUK, GSK3β, EGFR, PTEN, and PIK3AP1 as participants of EGFR-PI3K-AKT-mTOR signaling using data from cBioPortal
Publication:Brlek, P.; Kafka, A.; Bukovac, A.; Pećina-Šlaus, N. Integrative cBioPortal Analysis Revealed Molecular Mechanisms That Regulate EGFR-PI3K-AKT-mTOR Pathway in Diffuse Gliomas of the Brain. Cancers 2021, 13, 3247. https://doi.org/10.3390/cancers13133247
DOs
Know Your Data – Understand the format and quality of your data
Take your time with Data prep – Clean and preprocess data as needed
Select Appropriate tools – Choose the right tool for your analysis
Read Documentation – Study tool guides and understand their limitations
Pay attention to Parameters – Set tool parameters carefully
Record Parameters – Keep records for reproducibility
Validate results – Verify results with independent data or experiments
Secure Data – Comply with data privacy regulations
Seek Help – Collaborate or ask for assistance if needed
DON’Ts
Misinterpret your Data – Be cautious in result interpretation
Take Data Quality for granted – Assess and preprocess data to ensure quality
Use all Data when unnecessary – Analyze relevant subsets for efficiency
Depend on one Tool – Use multiple tools for comprehensive analysis
Ignore Updates – Use the latest tool versions
Forgo Resource Check – Check hardware for computational capacity
Forget Publication Quality – Follow best practices for reporting
Neglect Ethical Considerations – Respect ethical guidelines and permissions
Teams of healthcare providers, data scientists, and researchers working together to drive innovation
Seamless incorporation of data analysis into everyday healthcare practices for data-driven decision-making and personalized care
Enhanced collaboration and sharing of data among healthcare institutions and researchers to deepen disease understanding and improve treatments
迅速な臨床ビッグデータクリーニングと解析のための統合的ノーコードウェブアプリ
The quality of the input data is critical to the final results and their interpretation
HOWEVER, in Healthcare and Medicine, there are many examples of rich but unorganized, incomplete, and inconsistent data
Integrated no-code web app development for rapid clinical big data cleaning and analysis (Collaborators wanted!)
a free web application that allows medical practitioners to quickly and easily construct initial hypotheses from data with so-called no-code
consists of three major tools:
Contact me: sakura.maezono[at]yokohama-cu.ac.jp
Contact us: bioinfo[at]yokohama-cu.ac.jp
1. Clinical Big Data Analysis has facilitated the extraction of valuable insights from the continuously expanding healthcare and medical data
2. Bioinformatics web tools democratize advanced data analysis through accessibility, cost-effectiveness, and presence of community support; Effective use of these tools involves knowing the correct data input, documentation reading, and data validation
3. Working together, using web tools, and sharing data worldwide can enhance our understanding of diseases and address challenges such as data quality, scalability, and governance in the field
Take-home message
YOU can take advantage of Bioinformatics web tools to dive into the ever-growing Clinical Big Data!
Through collaboration and sharing data, YOU can actively contribute to our collective knowledge about diseases, fostering a brighter future for healthcare
© Copyright 2023, YCU Bioinformatics Laboratory