If you don't remember your password, you can reset it by entering your email address and clicking the Reset Password button. You will then receive an email that contains a secure link for resetting your password
If the address matches a valid account an email will be sent to __email__ with instructions for resetting your password
Medical UltraSound Computing (MUSIC) Lab, National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Guangdong Provincial Key Laboratory of Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen, China
Medical UltraSound Computing (MUSIC) Lab, National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Guangdong Provincial Key Laboratory of Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen, China
Department of Ultrasound Medicine, the Third Affiliated Hospital of Guangzhou Medical University, Guangzhou, ChinaThe Seventh Affiliated Hospital University of South China, Hunan Veterans Administration Hospital, Changsha, ChinaThe First Affiliated Hospital, Medical Imaging Centre, Hengyang Medical School, University of South China, Hengyang, ChinaInstitution of Medical Imaging, University of South China, Hengyang, China
Can a novel deep learning-based follicle volume biomarker using three-dimensional ultrasound (3D-US) be established to aid in the assessment of oocyte maturity, timing of HCG administration and the individual prediction of ovarian hyper-response?
Design
A total of 515 IVF cases were enrolled, and 3D-US scanning was carried out on HCG administration day. A follicle volume biomarker established by means of a deep learning-based segmentation algorithm was used to calculate optimal leading follicle volume for predicting number of mature oocytes retrieved and optimizing HCG trigger timing. Performance of the novel biomarker cut-off value was compared with conventional two-dimensional ultrasound (2D-US) follicular diameter measurements in assessing oocyte retrieval outcome. Moreover, demographics, infertility work-up and ultrasound biomarkers were used to build models for predicting ovarian hyper-response.
Results
On the basis of the deep learning method, the optimal cut-off value of the follicle volume biomarker was determined to be 0.5 cm3 for predicting number of mature oocytes retrieved; its performance was significantly better than the conventional method (two-dimensional diameter measurement ≥10 mm). The cut-off value for leading follicle volume to optimize HCG trigger timing was determined to be 3.0 cm3 and was significantly associated with a higher number of mature oocytes retrieved (P = 0.01). Accuracy of the multi-layer perceptron model was better than two-dimensional diameter measurement (0.890 versus 0.785) and other multivariate classifiers in predicting ovarian hyper-response (P < 0.001).
Conclusions
Deep learning segmentation methods and multivariate classifiers based on 3D-US were found to be potentially effective approaches for assessing mature oocyte retrieval outcome and individual prediction of ovarian hyper-response.
The aim of assisted reproductive technology (ART) treatment is to achieve and sustain a pregnancy. Exogenous hormones have been used in various forms for hormonal ovarian stimulation in ART with the aim of increasing follicle count (
). Although mature (metaphase II) oocytes may be obtained from follicles of any size, the mature oocyte retrieval rate was lower in groups with follicles that were too small or too large (
). If mature oocyte yield is to be maximized and complications (such as ovarian hyper-stimulation syndrome) are to be avoided, the appropriate assessment of follicular maturation and timing of mature oocyte retrieval is critical (
Correlation of Anti-Mullerian Hormone Level and Antral Follicle Count with Oocyte Number in A Fixed-Dose Controlled Ovarian Hyperstimulation of Patients of In Vitro Fertilization Program.
). The mean diameter of the leading follicle detected by transvaginal ultrasound has been commonly used to define HCG ovulation trigger injection for many years. Previous studies have indicated that HCG ovulation trigger injection should be timed when the leading follicle reaches 16–22 mm in mean diameter observed by ultrasound (
Tthe number of follicles (mean diameter ≥10 mm) measured manually is usually regarded as the basis for estimating the number of mature oocytes retrieved (
Fertility Treatment Outcomes After Follicle Tracking With Standard 2-Dimensional Sonography Versus 3-Dimensional Sonography-Based Automated Volume Count: Prospective Study.
). Nevertheless, several recent studies have questioned the accuracy of the two-dimensional measuring approach mentioned above, especially in multi-follicle development cycles because the shape of follicles is irregular, and mean diameter measured manually by two-dimensional ultrasound (2D-US) neglect the three-dimensional information of follicle shape, which may not reflect the real size of the follicle (
). As such, differences were observed between the number of mature follicles observed by ultrasound and the number of mature oocytes retrieved, which means oocyte maturity may not be optimal at the time of retrieval using the mean diameter for follicles (
CR-Unet-Based Ultrasonic Follicle Monitoring to Reduce Diameter Variability and Generate Area Automatically as a Novel Biomarker for Follicular Maturity.
Luteal phase support in in-vitro fertilization using gonadotrophin releasing hormone analogue before ovarian stimulation: a prospective randomized study of human chorionic gonadotrophin versus intramuscular progesterone.
Power doppler ultrasound assessment of the relationship between age and ovarian perifollicular blood flow in women undergoing in vitro fertilization treatment.
Retrospective cohort study: AMH is the best ovarian reserve markers in predicting ovarian response but has unfavorable value in predicting clinical pregnancy in GnRH antagonist protocol.
). The advances in three-dimensional ultrasound (3D-US) commercial techniques, i.e. SonoAVC, have made it possible to accurately monitor follicular volumes, but this semi-automatic application is limited by precision of recognition, special instrument requirements and the lack of diagnostic standards (
Fertility Treatment Outcomes After Follicle Tracking With Standard 2-Dimensional Sonography Versus 3-Dimensional Sonography-Based Automated Volume Count: Prospective Study.
). To date, methods for optimally selecting suitable patients are lacking, and are, therefore, unable to significantly improve the accuracy of prediction for mature oocytes retrieved by ultrasound (
Recently, artificial intelligence techniques have emerged as objective, standardized and efficient methods for assessing human reproduction, especially in personalized ovarian stimulation, extended embryo culture, semen analyses, pre-implantation genetic testing and embryo selection (
). For the prediction of empty follicles or with oocytes using artificial intelligence techniques, several studies have focused on using the previous ultrasound image segmentation of ovaries and follicles to promote follicular counting accuracy (
). Few studies, however, have associated the segmentation results with oocyte retrieval outcome. In our previous study, a deep learning framework was proposed for the previous segmentation of 3D-US follicle and ovary images. The dice similarity coefficient (DSC) for follicular segmentation reached 89.11%, which outperformed state-of-the-art methods in various evaluation metrics (
The aim of the present study was to expound an artificial intelligence technique that has the potential to change the management of IVF treatment. The feasibility of two approaches have been discussed to promote the efficiency of oocyte maturity and the timing of oocyte retrieval assessment based on the deep learning segmentation model and machine learning. The relationship between follicle volume biomarker automatically calculated by deep learning models on the day of HCG administration and the number of mature oocytes retrieved was assessed, follicular volume cut-off values that aid in the assessment of oocyte maturity were assessed, and its performance was compared with the two-dimensional diameter measurement. In the meantime, the optimal timing of HCG administration for IVF cycles was determined. The significant features in predicting oocyte maturation, and proposed machine learning methods as novel approaches for predicting ovarian response, were also analysed.
Materials and methods
Participants and population
A total of 515 cases of infertile women undergoing their first ovarian stimulation treatment cycles at the reproductive centre of our hospital between August 2019 and December 2020 were recruited in this prospective study. All patients met the inclusion criteria, as follows: women undergoing ovarian stimulation before IVF using a long protocol, antagonist protocol or mini-stimulation protocol; both ovaries present; and no serious systemic illness. Exclusion criteria were incomplete information or images and abnormal ovarian mass of 3 cm in diameter. Ovarian response was monitored by transvaginal ultrasound and serum oestradiol measures beginning on the fifth day of stimulation. Ovulation was triggered using recombinant HCG (1000 IU or 2000 IU) (Ovitrelle, Merck, Lyon, France) when at least three 17 mm follicles or two 18 mm follicles were observed by ultrasound, and oocyte retrieval was carried out 36 h later. The gonadotrophin releasing hormone antagonist (Cetrorelix) (Merck Serono, Darmstadt, Germany) at 250 mg/day was started on the fifth or sixth day of ovarian stimulation and was continued until the day of HCG administration. Clinical ovarian hyper-response was defined as the retrieval of more than 15 mature oocytes with oestradiol (HCG day) over 5000 ng/l. This study was approved by the local Ethics Committee of the Third Affiliated Hospital of Guangzhou Medical University (number 2018-018), and all patients who agreed to voluntarily participate signed a consent statement.
A database was created based on the demographics of previous treatment and an infertility work-up, including the cause of infertility, duration of infertility, BMI and hormonal analysis. IVF cycles were also recorded according to stimulation type and dosage, type of trigger and number of mature oocytes retrieved. Patients with any missing data were excluded.
Ultrasound examination
Voluson E8 (GE Medical Systems, Zipf, Austria) equipped with a 5- to 9-MHz transvaginal volume probe was used for all ultrasound follicular monitoring. An expert with 8 years’ experience in gynaecologic ultrasound scanned all patients recruited in the study. Each follicle measuring 10 mm or wider in mean diameter was measured by 2D-US, and the number of follicles in both ovaries was recorded, which were based on the sum of follicles in the right and left ovaries. The plane of maximal diameter of the ovary was then revealed, the image stabilized for the three-dimensional volume scanning and was stored as DICOM data. Finally, haemodynamic parameters of the stromal artery were examined, and optimal views were found to obtain a clear and stable doppler flow spectrum. The automatic imputation feature was used to determine peak systolic velocity and resistive index.
Statistical analyses
The SPSS version 22.0 package (SPSS, Chicago, IL, USA) was used for statistical analyses. Continuous variables are summarized by mean ± SD. To establish the cut-off value of the follicle volume biomarker that correlates with the outcome of mature oocyte retrieval, the ratio between selected distinct cut-off values and the number of mature oocytes retrieved (ratio = number of mature oocytes retrieved/number of follicles for selected distinct cut-off value) was calculated. That is, the maximum number of follicles at or above the cut-off would be closest to the number of mature oocytes retrieved, and other volume cut-offs either underestimate or overestimate the number of mature oocytes. The Shapiro–Wilk test was used to verify the normality of distribution. Differences in number of mature oocytes among different follicular volume groups were then analysed with either a paired t-test or a Mann–Whitney U test. Homoscedasticity of variance was verified by Levene's test. The comparison between conventional measurement and new follicle volume biomarker was analysed with paired t-tests. P < 0.05 (two-sided test) was considered statistically significant. For a better visualization of results, the average difference of follicle number counting was compared between manual method using ≥ 10 mm two-dimensional diameter measurement and three-dimensional follicle volume biomarker and plotted in a graph (difference = number of mature oocytes retrieved − number of follicle calculated by ultrasound).
Segmentation model training
An in-house developed artificial intelligence method based on deep learning was used to obtain the follicle volume biomarker. A detailed technological description of the pipeline has been published separately (
). C-Rend is a deep learning model for the accurate simultaneous segmentation of ovaries and follicles (Figure 1). This novel model overcomes the limitations of SonoAVC, i.e. unavailability of poor-quality images and time consuming image after processing. Three-dimensional U-Net is the backbone structure of C-Rend. It contains the encoder–decoder structure with skip connection, which enables end-to-end training and can directly predict 3D-US segmentation biomarkers of ovaries and follicles. On the basis of the three-dimensional U-Net backbone structure of C-Rend, several changes were made to the training. First, the dataset was enlarged to further improve performance; another independent dataset of 203 total 3D-US volumes from Mindray (Shenzhen, China) and GE (Medical Systems, Zipf, Austria) ultrasound equipment were regarded as our segmentation model dataset, of which 171 volumes were used as a training set and the remaining 32 were used for testing. Second, a larger patch size of 256 × 256 × 256 was used to reduce the ratio of fused follicles. Finally, online data augmentation methods, such as gamma transformation, scaling and random noise adding, were used to improve the robustness of the model.
Figure 1The deep learning segmentation framework. Original data of three-dimensional follicle volume biomarkers were calculated by the artificial intelligence method based on deep learning segmentation.
Pearson's correlation coefficient was used to establish the correlation between features and the oocyte retrieval outcome. In the prediction of ovarian hyper-response, univariate analysis was carried out using a threshold classifier to determine the features that had a significant correlation with the outcomes for constructing the multivariate classifier. State-of-the-art machine learning multivariate classifiers were compared in a parallel test, decision tree, support vector machine, k-nearest neighbours, random forest and multi-layer perceptron (MLP). The decision tree used decision rules recursively to partition the feature space. Support vector machine was classified through the decision boundary of the maximum-margin hyperplane. The K-nearest neighbours method predicted discrete class labels based on the nearest patterns. The random forest made the final prediction through multiple decision trees that were randomly chosen. The MLP classifier used the hidden layers containing numbers of nodes, as well as activation function and normalization operation to model the complex relationships among nodes. For each classifier, a grid search for different hyper-parameter values was carried out and the best results were reported. The diagnostic values of single variable and multivariate classifiers were assessed using accuracy, sensitivity, specificity, positive predictive value, negative predictive value, F1 score, and area under the curve (AUC). Accuracy indicated the correct classified percentage of all samples. The F1 score considered both positive predictive value and sensitivity.
Results
Patient selection
Complete data were available for 181 patients, and 334 patients were excluded. Patient selection is presented in Figure 2. Baseline characteristics of patients are presented in Table 1.
Figure 2Patient selection and study process. 3D-US, three-dimensional ultrasound; IVF-ET, IVF and embryo transfer.
In this study, data of three-dimensional follicle volume biomarkers were calculated by the artificial intelligence method based on deep learning segmentation. For predicting oocyte maturity, according to the ratio between selected distinct cut-off values and the number of mature oocytes retrieved, the results showed that a follicle volume biomarker 0.5 cm3 or above on the day of HCG administration was more likely to yield mature oocytes, which meant that the number of follicles above this size might correlate best with the number of eggs. The relationship between the number of mature oocytes and different follicular volumes is presented in Table 2. Furthermore, in the comparison of two- and three-dimensional threshold values, when a cut-off value of 0.5 cm3 was chosen, the number of follicles at or above this cut-off was closest to the number of mature oocytes retrieved. Regarding the number of mature oocytes retrieved as the gold standard, the cut-off value of 0.5 cm3 using the three-dimensional follicle volume biomarker performed significantly better than mean diameter measuring 10 mm or wider using the two-dimensional diameter measurement according to the results of a paired t-test (P < 0.001) (Table 2 and Figure 3). Average difference between follicles selected and matures oocytes and its SD using the manual method versus the automatic method were −2.04 ± 0.35 and −0.53 ± 0.31 (P = 0.001), respectively.
TABLE 2RELATIONSHIP BETWEEN THE NUMBER OF OOCYTES RETRIEVED AND DIFFERENT FOLLICULAR VOLUMES
Follicle volume/mean diameter value cut-off
Number of mature oocytes retrieved/number of follicles for selected cut-off value
When a cut-off of 0.5 cm3 is chosen, the maximum number of follicles at or above the cut-off will be closest to the number of mature oocytes retrieved. Other volume cut-offs and mean diameter cut-off (10 mm) either underestimate or overestimate the number of mature oocytes.
Figure 3Difference in follicle number between manual method and automatic method (difference = number of mature oocytes retrieved – number of follicles calculated by ultrasound). In this circumstance, the cut-off of 0.5 cm3 using three-dimensional follicle volume biomarker showed less experimental errors than the cut-off of 10 mm using two-dimensional diameter measurement.
To determine optimal leading follicle volume to optimize HCG trigger timing, experiments were conducted to explore the optimal threshold and then to validate the findings. All cases were classed by the volume of the leading follicle. In Table 3, Levene's test indicated that all groups met the homogeneity of variance. Shapiro–Wilk tests did not show evidence of normality in all groups. The result showed that a leading follicle over 3.0 cm3 using the follicle volume biomarker as a trigger criterion on HCG administration day was significantly associated with a higher number of mature oocytes retrieved. Differences between class (2, 3) cm3 and class (3, 4) cm3 were significant according to the Mann–Whitney U test (P = 0.02) (Table 3). Therefore, it was concluded that the 3.0 cm3 leading follicle volume cut-off value was significantly associated with a higher number of mature oocytes retrieved. On the basis of the results above, it was further investigated whether a 3.0 cm3 leading follicle volume cut-off value was significantly associated with a higher number of mature oocytes retrieved. Two groups, (0, 3) cm3 and (3, ∞) cm3, were set and significant differences were observed between the two (P = 0.01).
TABLE 3CORRELATION BETWEEN THE VOLUME OF LEADING FOLLICLE MEASURED BY DEEP LEARNING MODEL AND THE OUTCOME OF MATURE OOCYTES RETRIEVAL
In the validation set, the DSC of the improved segmentation deep learning model reached 88.07 for three-dimensional follicular segmentation. In univariate analysis predicting ovarian hyper-response, a total of 26 oocytes retrieval-related characteristics were evaluated (Table 4). These included background information, related hormone parameters, two-dimensional ultrasound parameters (ovarian size, haemodynamic parameters of ovarian stromal artery), total follicular count (calculated manually), number of follicles measuring 10 mm or wider (using the two-dimensional diameter measurement), and number of follicles 0.5 cm3 or above (using the three-dimensional follicle volume biomarker). FSH was excluded as these data were not collected for most patients. Oestradiol mean diameter of ovary, ovarian volume, total follicular count (manually), number of follicles measuring 10 mm or wider (using the two-dimensional diameter measurement) and number of follicles measuring 0.5 cm3 or above (using the three-dimensional follicle volume biomarker) were correlated with the outcome of mature oocyte retrieval. The performance of various characteristics in predicting ovarian hyper-response is presented in Table 4. In contrast to number of follicles (two-dimensional diameter measurement ≥10 mm), number of follicles (three-dimensional follicle volume biomarker ≥0.5 cm3) was a stronger predictor of mature oocytes retrieved (P < 0.001) (Figure 4).
TABLE 4UNIVARIATE ANALYSIS OF CHARACTERISTICS IN PREDICTING OVARIAN HYPERSTIMULATION
Pearson's correlation was used for statistical analysis. The number of oocytes retrieved was regarded as the gold standard in this study. The number of follicles (3DFV ≥0.5cm3) was significantly associated with the outcome of ovarian hyperstimulation.
2DDM, two-dimensional diameter measurement; 3DFV, three-dimensional follicle volume biomarker; ACC, accuracy; AUC, area under the curve; F1, F1 score; L, left side; NPV, negative predictive value; PPV, positive predictive value; PSV, peak systolic velocity; R, right side; RI, resistance index; SEN, sensitivity; SPE, specificity.
Figure 4Receiver operator characteristic curves of the two-dimensional diameter measurement and the three-dimensional follicle volume biomarker in predicting the number of mature oocytes retrieved. 2D, two-dimensional; 3D, three-dimensional; FP, false positive; TP, true positive.
The performance of different multivariate classifiers in predicting ovarian hyper-response is presented in Table 5 and Figure 5. The accuracy of the MLP multivariate classifier was 0.890, which was better than the two-dimensional diameter measurement (0.785) and the single use of the three-dimensional follicle volume biomarker (0.846). According to the result of the t-test, significant differences in the performance of the different models were observed, and MLP with an AUC of 0.880 (95% CI, 0.828 to 0.927) achieved significant improvement over other state-of-the-art methods.
TABLE 5PERFORMANCE OF MULTIVARIATE CLASSIFIERS IN PREDICTING OVARIAN HYPERSTIMULATION
P < 0.05 according to the result of t-test. ACC, accuracy; AUC, area under the curve; DT, decision tree; F1, F1 score; KNN, k-nearest neighbour; SVM, support vector machine; MLP, multilayer perceptron; NPV, negative predictive value; PPV, positive predictive value; RF, random forest; SEN, sensitivity; SPE, specificity.
P < 0.05 according to the result of t-test. ACC, accuracy; AUC, area under the curve; DT, decision tree; F1, F1 score; KNN, k-nearest neighbour; SVM, support vector machine; MLP, multilayer perceptron; NPV, negative predictive value; PPV, positive predictive value; RF, random forest; SEN, sensitivity; SPE, specificity.
P < 0.05 according to the result of t-test. ACC, accuracy; AUC, area under the curve; DT, decision tree; F1, F1 score; KNN, k-nearest neighbour; SVM, support vector machine; MLP, multilayer perceptron; NPV, negative predictive value; PPV, positive predictive value; RF, random forest; SEN, sensitivity; SPE, specificity.
P < 0.05 according to the result of t-test. ACC, accuracy; AUC, area under the curve; DT, decision tree; F1, F1 score; KNN, k-nearest neighbour; SVM, support vector machine; MLP, multilayer perceptron; NPV, negative predictive value; PPV, positive predictive value; RF, random forest; SEN, sensitivity; SPE, specificity.
a P < 0.05 according to the result of t-test.ACC, accuracy; AUC, area under the curve; DT, decision tree; F1, F1 score; KNN, k-nearest neighbour; SVM, support vector machine; MLP, multilayer perceptron; NPV, negative predictive value; PPV, positive predictive value; RF, random forest; SEN, sensitivity; SPE, specificity.
This pilot study shows that a prediction of oocyte maturity is achievable by means of a deep learning-based segmentation algorithm and novel follicular volume cut-off values. We further found that multivariate machine learning classifiers represented by MLP achieve great accuracy in the individualized prediction of ovarian hyper-response. The present study has established the premises for additional studies that should determine whether follicle volume biomarker based on artificial intelligence could help improve oocyte retrieval outcome by establishing new criteria for predicting the number of mature oocytes retrieved and determining optimal timing for HCG administration.
Ultrasonic follicular monitoring is a key step in IVF and ovarian stimulation treatment. In general, the mean diameter and number of follicles measured by 2D-US are used to monitor the development of follicles, predict oocyte maturation and determine the timing of HCG ovulation trigger injection (
Three-Dimensional Automated Volume Calculation (Sonography-Based Automated Volume Count) versus Two-Dimensional Manual Ultrasonography for Follicular Tracking and Oocyte Retrieval in Women Undergoing in vitro Fertilization-Embryo Transfer: A Randomized Controlled Trial.
). The 2D-US imaging assumes that follicles have a regular shape, but this is likely inaccurate for follicles during ovarian stimulation because these follicles have uneven and irregular shapes (
Tur-Kaspa I. A critical review of bi-dimensional and three-dimensional ultrasound techniques to monitor follicle growth: do they help improving IVF outcome?.
). Recently, 3D-US was proposed as a better solution to improve the evaluation accuracy for ultrasound follicular monitoring, and SonoAVC was the most prominent representative. It was reported that SonoAVC had a good correspondence with two-dimensional follicular measurements and could significantly reduce the time needed to perform ultrasound examination (
). In previous studies, however, the limitations of this approach were obvious. A study reported that the process of SonoAVC in 3D-US follicular monitoring was long and semi-automated,and was seriously affected by image quality. Furthermore, results from another study agreed that the use of three-dimensional SonoAVC did not lead to an improvement in cycle parameters (
CR-Unet-Based Ultrasonic Follicle Monitoring to Reduce Diameter Variability and Generate Area Automatically as a Novel Biomarker for Follicular Maturity.
). In this study, a deep learning method was used to improve the accuracy of follicle size measurements, to establish a new criterion for identifying follicles that contain the mature oocyte, and to determine the optimal timing for an ovulation trigger. The results showed that the experimental errors in 88.4% of the cases were acceptable (within six follicles) by the deep learning model. In contrast, only 80.7% of cases were acceptable by two-dimensional diameter measurement (Figure 3). This is a step forward to finding significant differences between two-dimensional and three-dimensional measurements in predicting the outcome of oocyte retrieval.
In the present study, we proved that a cut-off value for follicle volume biomarker on HCG day is helpful for identifying follicles containing the best oocyte and for determining the optimal time to administer an HCG trigger for final oocyte maturation before ovum retrieval. To the best of our knowledge, this is the first study focu Rodríguez sed on a follicular volume cut-off value to aid in the optimal timing of HCG administration using 3D-US. In predicting maturity of mature oocytes retrieved, it was reported that follicles with a volume of 0.6 cm3 or above on the day of HCG administration (measured by SonoAVC) were more likely to contain mature oocytes (
). Some previous studies have compared the true volume of follicular aspirate and follicular volume measured by SonoAVC and concluded that the SonoAVC system slightly underestimated the volume (
). Interestingly, the results confirmed that follicles with a volume of 0.5 cm3 using the follicle volume biomarker were highly correlated to the number of mature oocytes retrieved. This can be interpreted as the new 3D-US biomarker for oocyte yield prediction using a deep learning segmentation method being more closely representative of actual circumstances.
Since the outcome of assisted reproductive technology is affected by various related factors, and with factors having individual inaccuracy, the results obtained are often different from the actual results and cannot accurately predict the outcome of oocyte retrieval or pregnancy. In recent years, artificial intelligence technology has been used to combine a variety of fertility-related information and to apply the optimal treatment to improve outcomes, which has helped achieve individual assessment for assisted reproduction (
). For example, in the latest retrospective study, a nomogram model was established to predict the probability of ovarian hyperstimulation syndrome in polycystic ovarian syndrome patients, and the AUC was 0.757 (
). For the prediction of ovarian hyperstimulation, more than 18 follicles, oestradiol of 5000 ng/l, or both, were regarded as criteria based on the previous guidelines (
). Ovarian response is closely correlated to ovarian hyperstimulation. To improve prediction of ovarian hyper-response in each individual patient, we performed univariate analysis and established machine learning classifiers with the new indicator (number of follicles ≥ 0.5 cm3). Our results showed that oestradiol, mean diameter of ovary, ovarian volume, total follicular count (manually), number of follicles measuring 10 mm or wider, and number of follicles measuring 0.5 cm3 or above were correlated with the outcome of oocyte retrieval, which is largely consistent with previous studies (
Melatonin concentration in follicular fluid is correlated with antral follicle count (AFC) and in vitro fertilization (IVF) outcomes in women undergoing assisted reproductive technology (ART) procedures.
). In addition, we showed that MLP achieved a higher accuracy than both the single use of the follicle volume biomarker and other multivariate classifiers. It can not only accurately predict ovarian hyper-response but can also achieve individual quantitative assessment (AUC = 0.880).
On the basis of the deep learning segmentation algorithm, the strengths of our study were the establishment of new criteria for predicting oocyte retrieval outcome and the individual prediction of ovarian hyperstimulation. Limitations of our study included the relatively small sample size and the lack of some patient information, i.e. FSH. In addition to the two-dimensional diameter measurement of 10 mm or wider, the results of other two-dimensional thresholds of follicles had not been considered as controls. Furthermore, this was a cohort study, and we did not change the timing of HCG ovulation trigger injection. Novel criteria may be theorized from the results of this study; however, the introduction of new criteria and its clinical application requires a randomized trial demonstrating that improved timing of HCG administration may lead to better fertilization and pregnancy rates. We did not focus on examination time in the study, because the automated approach was not loaded in ultrasound equipment. Also, deep learning typically involves an algorithm task that is specifically trained and improves with continued application. These limitations and further expectations of the present study are aims of ongoing studies.
In conclusion, artificial intelligence techniques provide a new strategy to solve the problems of available ultrasonic follicular monitoring methods, especially across multiple follicular cycles. Compared with conventional two-dimensional follicular diameter measurements, a cut-off criterion of 3.0 cm3 leading follicle volume for ovulation triggering and 0.5 cm3 leading follicle volume for predicting the number of mature oocytes retrieved using a 3D-US follicle volume biomarker improved oocyte retrieval outcome. Clinical applications of artificial intelligence techniques in the field of reproductive medicine may ultimately encourage doctors to return to the further use of 3D-US in assisted reproduction. Larger prospective randomized controlled trial studies using this methodology are now required.
DATA AVAILABILITY
All data are available from the corresponding author upon reasonable request. All code is available from the corresponding author upon reasonable request.
Funding
This work was supported by Clinical Research 4310 Program of the First Affiliated Hospital of the University of South China (grant number 4310-2021-K06), National Natural Science Foundation of China (grant number 82102054), Major Research Projects of Universities in Guangdong Province (grant number 2019KZDZX1032).
References
Abbara A.
Vuong L.N.
Ho V.N.A.
Clarke S.A.
Jeffers L.
Comninos A.N.
Salim R.
Ho T.M.
Kelsey T.W.
Trew G.H.
Humaidan P.
Dhillo W.S.
Follicle Size on Day of Trigger Most Likely to Yield a Mature Oocyte.
Luteal phase support in in-vitro fertilization using gonadotrophin releasing hormone analogue before ovarian stimulation: a prospective randomized study of human chorionic gonadotrophin versus intramuscular progesterone.
Power doppler ultrasound assessment of the relationship between age and ovarian perifollicular blood flow in women undergoing in vitro fertilization treatment.
CR-Unet-Based Ultrasonic Follicle Monitoring to Reduce Diameter Variability and Generate Area Automatically as a Novel Biomarker for Follicular Maturity.
Three-Dimensional Automated Volume Calculation (Sonography-Based Automated Volume Count) versus Two-Dimensional Manual Ultrasonography for Follicular Tracking and Oocyte Retrieval in Women Undergoing in vitro Fertilization-Embryo Transfer: A Randomized Controlled Trial.
Correlation of Anti-Mullerian Hormone Level and Antral Follicle Count with Oocyte Number in A Fixed-Dose Controlled Ovarian Hyperstimulation of Patients of In Vitro Fertilization Program.
Tur-Kaspa I. A critical review of bi-dimensional and three-dimensional ultrasound techniques to monitor follicle growth: do they help improving IVF outcome?.
Fertility Treatment Outcomes After Follicle Tracking With Standard 2-Dimensional Sonography Versus 3-Dimensional Sonography-Based Automated Volume Count: Prospective Study.
Retrospective cohort study: AMH is the best ovarian reserve markers in predicting ovarian response but has unfavorable value in predicting clinical pregnancy in GnRH antagonist protocol.
Melatonin concentration in follicular fluid is correlated with antral follicle count (AFC) and in vitro fertilization (IVF) outcomes in women undergoing assisted reproductive technology (ART) procedures.
Xiaowen Liang completed her Bachelor of clinical medicine in 2015 and her Master of imaging and nuclear medicine in 2018 at Guangzhou Medical University. She is now working in The Department of Ultrasound Medicine of the Third Affiliated Hospital of Guangzhou Medical University.
Key message
Artificial intelligence is being investigated as a promising means for reproductive medicine. A novel biomarker for oocyte maturity evaluation using artificial intelligence-based three-dimensional ultrasound quantification can provide an accurate strategy to solve the problems of available ultrasonic follicular monitoring methods.
Article info
Publication history
Published online: July 27, 2022
Accepted:
July 18,
2022
Received in revised form:
July 11,
2022
Received:
March 8,
2022
Declaration: The authors report no financial or commercial conflicts of interest.