华南预防医学 ›› 2025, Vol. 51 ›› Issue (8): 875-880.doi: 10.12183/j.scjpm.2025.0875

• 论著 • 上一篇    下一篇

卵巢癌高风险人群精准筛查路径构建研究

丁慧, 季晓云, 方原   

  1. 上海市第一妇婴保健院,同济大学附属妇产科医院,上海 201204
  • 收稿日期:2024-12-18 出版日期:2025-08-20 发布日期:2025-09-16
  • 通讯作者: 方原,E-mail:13816279188@163.com
  • 作者简介:丁慧(1982—),女,硕士研究生,主治医师,研究方向为妇科疾病临床及基础研究
  • 基金资助:
    上海市青年科技英才扬帆计划(17YF1411800)

Development of a precision screening pathway for high-risk ovarian cancer populations

DING Hui, JI Xiaoyun, FANG Yuan   

  1. The First Maternity and Infant Hospital, Tongji University Affiliated Obstetrics and Gynecology Hospital, Shanghai 201204, China
  • Received:2024-12-18 Online:2025-08-20 Published:2025-09-16

摘要: 目的 通过分析卵巢癌高风险人群发生卵巢癌的影响因素,构建列线图模型,以优化现有的卵巢癌高风险人群筛查路径。方法 采用2020年1月至2021年10月上海市第一妇婴保健院就诊的卵巢癌高风险人群资料,根据7∶3比例分为训练集(n=385)和验证集(n=165),训练集患者再根据随访36个月后卵巢癌的确诊情况分为发生组和未发生组。通过单因素分析和多因素logistic回归模型分析训练集中卵巢癌高风险人群发生卵巢癌的影响因素,采用R软件构建列线图模型,并通过受试者工作特征曲线(ROC)、校正曲线、Hosmer-Lemeshow拟合优度检验、决策曲线验证模型内外部验证。结果 训练集与验证集的年龄、BMI、卵巢癌家族史等基线特征分布差异无统计学意义(P>0.05)。单因素分析和多因素logistic回归分析结果显示,糖类抗原125(CA125)(>35 U/mL OR=4.705)、糖类抗原199(CA199)(>37 U/mL OR=3.322)、人附睾蛋白4(HE4)(pmol/L OR=12.272)、卵巢恶性肿瘤风险算法(ROMA)指数(高风险OR=7.615)、搏动指数(PI)(低水平OR=3.646)、阻力指数(RI)(低水平OR=2.475)是高风险人群发生卵巢癌的危险因素(均P<0.05)。构建的预测高风险人群发生卵巢癌的列线图模型评价与验证结果显示,ROC曲线提示模型区分度良好[训练集的曲线下面积(AUC)为0.930(95% CI:0.887~0.973),验证集的AUC为0.921(95% CI:0.861~0.981)],校正曲线、Hosmer-Lemeshow拟合优度检验提示模型的拟合度较高(训练集:R2=0.576,χ2=11.347,P=0.124;验证集:R2=0.539,χ2=6.603,P=0.471)。决策曲线显示,训练集模型在0.00~0.91阈值概率范围内具有较高的净收益率,验证集模型在0.00~0.74阈值概率范围内具有较高的净收益率。结论 CA125、CA199、HE4、ROMA指数、PI、RI是卵巢癌高风险人群发生卵巢癌的影响因素,基于上述因素构建的列线图模型可以较为精准地预测高风险人群发生卵巢癌的风险,据此形成风险分层或可辅助临床精准筛查卵巢癌患者。

关键词: 卵巢癌高风险人群, 糖类抗原125, 人附睾蛋白4, 列线图模型, 受试者工作特征曲线, 决策曲线分析

Abstract: Objective To analyze the risk factors associated with the incidence of ovarian cancer in a high-risk population and to construct a nomogram model to optimize existing screening pathways. Methods This study retrospectively analyzed data from 550 individuals at high risk for ovarian cancer who were seen at the Shanghai First Maternity and Infant Hospital between January 2020 and October 2021. The cohort was randomly divided into a training set (n=385) and a validation set (n=165) at a 7∶3 ratio. Patients in the training set were subsequently categorized into an occurrence group and a non-occurrence group based on confirmed ovarian cancer diagnosis following a 36-month follow-up period. Univariate and multivariate logistic regression analyses were conducted to identify independent predictors for the development of ovarian cancer in the training set. A nomogram was constructed using R software. The model's performance was evaluated through internal and external validation using Receiver Operating Characteristic (ROC) curves, calibration curves, the Hosmer-Lemeshow goodness-of-fit test, and Decision Curve Analysis (DCA). Results There were no statistically significant differences in the distribution of baseline characteristics, such as age, BMI, and family history of ovarian cancer, between the training and validation sets (P>0.05). Both univariate and multivariate logistic regression analyses identified elevated levels of carbohydrate antigen 125 (CA125) (>35 U/mL; OR=4.705), carbohydrate antigen 199 (CA199) (>37 U/mL; OR=3.322), and human epididymis protein 4 (HE4) (pmol/L; OR=12.272), a high-risk classification on the risk of ovarian malignancy algorithm (ROMA) index (OR=7.615), and low levels of the Pulsatility Index (PI) (OR=3.646) and resistance index (RI) (OR=2.475) as independent risk factors for ovarian cancer incidence in the high-risk population (all P<0.05). The evaluation and validation of the predictive nomogram demonstrated excellent discriminatory power, with an area under the curve (AUC) of 0.930 (95% CI: 0.887-0.973) in the training set and 0.921 (95% CI: 0.861-0.981) in the validation set. Furthermore, the calibration curves and the Hosmer-Lemeshow test signified a high degree of model calibration (Training set: =0.576, χ2=11.347, P=0.124; Validation set: =0.539, χ2=6.603, P=0.471). Decision Curve Analysis indicated that the model offered a substantial net benefit across a wide range of threshold probabilities (0.00-0.91 in the training set and 0.00-0.74 in the validation set). Conclusions CA125, CA199, HE4, the ROMA index, PI, and RI are significant predictors for the development of ovarian cancer in high-risk individuals. The nomogram constructed based on these factors can accurately predict the risk of ovarian cancer. Therefore, its application for risk stratification may serve as a valuable tool to facilitate precision clinical screening for ovarian cancer.

Key words: High-risk ovarian cancer population, Carbohydrate antigen 125, Human epididymis protein 4, Nomogram model, Receiver operating characteristic curve, Decision curve analysis

中图分类号: 

  • R731.31