Research on the Optimization of Pregnant Women's Characteristics and NIPT Detection Strategy Based on Data Driven Modeling
DOI:
https://doi.org/10.54097/fsvtk980Keywords:
Non Invasive Prenatal Testing, Risk Assessment, Generalized Additive Model, K-means Clustering.Abstract
This article focuses on key issues in non-invasive prenatal testing and establishes multiple regression/GAM, BMI clustering risk assessment, machine learning regression prediction, and classification models. By conducting relevant analysis to evaluate the relationship between gestational age, BMI, and Y chromosome concentration in pregnant women, multiple linear regression, second-order polynomial regression, and generalized additive model (GAM) were constructed. The results showed that the GAM model outperformed other models in R², better capturing the nonlinear effects of gestational age and BMI on Y concentration; The significance test showed that these factors had a statistically significant impact on Y concentration (p<0.05). Using K-means clustering to group pregnant women according to BMI, calculate the gestational weeks when the first Y concentration reaches 4% in each group, and construct a comprehensive risk function to consider the impact of the probability of reaching the standard and the detection time point on the risk, and solve for the optimal detection time point for each group. The results showed that there were significant differences in the first gestational age of different BMI groups, and the risk model recommended optimized detection time points for each group.This study provides a systematic, data-driven approach to optimize NIPT detection strategies by integrating regression analysis, clustering, and machine learning models. By revealing the nonlinear impact of gestational age and BMI on Y chromosome concentration and proposing differentiated detection time points for pregnant women with different BMI levels, the research not only improves the scientific basis for clinical decision-making but also reduces the risk of false negatives or delayed diagnoses. The findings contribute to more precise, personalized prenatal testing strategies, thereby enhancing the reliability and effectiveness of NIPT in practical medical applications.
Downloads
References
[1] Wang Yuanyuan, Guo Zheng, Li Guicai, etc Estimation of precipitation and its characteristics in the Three Gorges Reservoir area from 1979 to 2014 based on generalized additive model [J]. Acta Geographica Sinica, 2017, 72 (07): 1207-1220.
[2] He Sixuan, Yang Jiehao, Zhang Guoyou, etc Analysis of Vegetation Changes and Influencing Factors in Mining Damaged Areas in Central Yunnan Based on XGBoost HAP Model [J]. Surveying and Mapping Bulletin, 2025, (07):58-65.
[3] Li Xuan Research on Hospital triage model based on PCA and Random Forest [J]. China Medical Equipment, 2025, 40 (08): 38-42+76.
[4] Xing Li, Wu Shuang, Zhu Yinglin Clothing data analysis based on K-means clustering [J]. Digital Technology and Applications, 2025, 43 (04): 198-200.
[5] Zheng Xiaoliang, Dong Mengyuan, Xia Yingjie, etc Coal and gas outburst prediction based on improved Kmeans SMOTE-RF [J/OL]. Journal of Chongqing Technology and Business University (Natural Science Edition), 1-10 [September 7, 2025].
[6] Sun Y, Chen Y, Xu F, et al. Non-invasive prenatal testing for fetal chromosomal abnormalities by low-coverage whole-genome sequencing: recent progress and perspectives [J]. Frontiers in Genetics, 2021, 12: 689030.
[7] Wang L, Wang D, Xu Y, et al. Association of maternal BMI with cell-free fetal DNA fraction in non-invasive prenatal screening: a retrospective study [J]. BMC Pregnancy and Childbirth, 2022, 22 (1): 149.
[8] Zhao Q, Liu J, Chen Y, et al. Prediction of gestational diabetes mellitus using generalized additive models: a population-based study [J]. Chinese Journal of Preventive Medicine, 2023, 57 (8): 1132-1138.
[9] Li M, Zhou J, Fang H, et al. Application of machine learning models in prediction of non-invasive prenatal testing performance [J]. Scientific Reports, 2022, 12: 14587.
[10] Zhang L, Wang S, Liu J, et al. Maternal BMI classification based on K-means clustering and its association with pregnancy outcomes [J]. Maternal and Child Health Care of China, 2021, 36 (24): 5695-5699.
[11] Huang T, Ma D, Xu X, et al. Improving accuracy of NIPT by integrating maternal characteristics and sequencing features using ensemble learning [J]. Bioinformatics, 2023, 39 (3): btad079.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







