If you don't remember your password, you can reset it by entering your email address and clicking the Reset Password button. You will then receive an email that contains a secure link for resetting your password
If the address matches a valid account an email will be sent to __email__ with instructions for resetting your password
The combination of morphology and proteomic profile is a powerful tool for selecting embryos.
•
IL-6 and MMP-1 concentrations in culture medium are helpful to identify successful embryos.
•
Artificial neural networks can detect the best embryos from a euploid cohort.
•
Artificial intelligence is effective in predicting live birth from IVF treatments.
ABSTRACT
Research question
The study aimed to develop an artificial intelligence model based on artificial neural networks (ANNs) to predict the likelihood of achieving a live birth using the proteomic profile of spent culture media and blastocyst morphology.
Design
This retrospective cohort study included 212 patients who underwent single blastocyst transfer at IVI Valencia. A single image of each of 186 embryos was studied, and the protein profile was analysed in 81 samples of spent embryo culture medium from patients included in the preimplantation genetic testing programme. The information extracted from the analyses was used as input data for the ANN. The multilayer perceptron and the back-propagation learning method were used to train the ANN. Finally, predictive power was measured using the area under the curve (AUC) of the receiver operating characteristic curve.
Results
Three ANN architectures classified most of the embryos correctly as leading (LB+) or not leading (LB–) to a live birth: 100.0% for ANN1 (morphological variables and two proteins), 85.7% for ANN2 (morphological variables and seven proteins), and 83.3% for ANN3 (morphological variables and 25 proteins). The artificial intelligence model using information extracted from blastocyst image analysis and concentrations of interleukin-6 and matrix metalloproteinase-1 was able to predict live birth with an AUC of 1.0.
Conclusions
The model proposed in this preliminary report may provide a promising tool to select the embryo most likely to lead to a live birth in a euploid cohort. The accuracy of prediction demonstrated by this software may improve the efficacy of an assisted reproduction treatment by reducing the number of transfers per patient. Prospective studies are, however, needed.
). Non-invasive methods (morphological and morphokinetic) as well as invasive methods (genetic testing) are currently used in IVF laboratories in embryo selection. However, new approaches to select embryos are still being investigated due to the limited improvement in live birth rate over the last few years (
New non-invasive methods based on “-omic” sciences, such as metabolomics and proteomics, have emerged to assess embryo viability and improve clinical outcomes (
). There is evidence for the importance of soluble ligands and receptors in both the embryo and the female reproductive tract during the embryonic preimplantation stage (
Human chorionic gonadotropin (hCG) in the secretome of cultured embryos: Hyperglycosylated hCG and hCG-free beta subunit are potential markers for infertility management and treatment.
). Methods for protein analysis have improved in terms of sensitivity, leading to the simultaneous study of multiple proteins (e.g. using mass spectrometry techniques or microarray technology). This has made it possible to observe associations among protein expression patterns, developmental stages and embryo morphology (
Analysis of the embryo culture medium has revealed the secretion and consumption of several proteins by the embryo. The increase in the culture medium of TNF (tumour necrosis factor) R1, IL (interleukin) 10, IL-6, VEGFA (vascular endothelial growth factor A), EMMPRIN (extracellular matrix metalloproteinase inducer), PLGF (placental growth factor), EpCAM (epithelial cell adhesion molecule), caspase-3, HE-4 (Human epididymis protein 4) and IL-6 meant that the embryo had secreted those proteins (
). Similarly, a decrease in CXCL (C-X-C motif chemokine ligand) 13, SCF (stem cell factor), TRAIL-R3 (receptor for the cytotoxic ligand TRAIL), MIP-1β (macrophage inflammatory protein-1β) and MSP-α (macrophage stimulating protein-α) in the culture medium indicated that the embryo had consumed or metabolized those proteins (
). Furthermore, different protein patterns were observed according to implantation potential and embryo quality. Embryos that implanted consumed more CXCL13 and GM-CSF than non-implanted ones (
Embryologic outcome and secretome profile of implanted blastocysts obtained after coculture in human endometrial epithelial cells versus the sequential system.
;). In addition, culture medium from blastocysts showed higher concentrations of EMMPRIN than samples from arrested embryos. Higher concentrations of caspase-3 were found in culture medium from good quality blastocysts (
), which shows the potential of using artificial intelligence to improve the efficiency of assisted reproductive treatments.
It has recently been shown that artificial intelligence is a non-invasive tool with a high potential for predicting a live birth due to its capacity for recognizing patterns (
How much information about embryo implantation potential is included in morphokinetic data? A prediction model based on artificial neural networks and principal component analysis.
). All these techniques use artificial neural networks as the basis of the model.
ANNs are computational techniques inspired by the functioning of the human brain whose main objective is to learn tasks that seek to solve complex problems (
). This technique simulates a biological neuron and imitates its ability to learn through trial and error. The characterization of an ANN is achieved via its self-learning through training. A commonly used algorithm for supervised learning is back-propagation (
Nowadays, the potential of ANNs is being increasingly explored mainly by associating other techniques, such as genetic algorithms, to optimize the results. The latter are evolutionary algorithms based on both Darwin's theory of evolution and Gregor Mendel's laws of genetics. This methodology generates a random population of individuals that is evaluated during the evolutionary process. In this process, the most qualified individuals are maintained in the next generations. After the creation of a new population, the process is repeated until a satisfactory solution is found (
The introduction of artificial intelligence to IVF laboratories would allow the analysis of raw embryo images without previous manual annotations. Even though the incorporation of continuous monitoring increases the objectivity of embryo assessment, manual annotations are subject to disparity among embryologists (
). Recently, several approaches using artificial intelligence techniques for embryo classification and prediction of clinical outcomes have been published (see, for example,
Application of artificial intelligence technology to increase the efficacy of embryo selection and prediction of live birth using human blastocysts cultured in a time-lapse incubator.
The objective of the current study was to develop a combinative predictive model based on artificial intelligence. First, morphological data from several embryos were used to create an ANN capable of predicting live birth. Donated oocytes were used, assuming that, as they were obtained from young women, they would result in mostly euploid embryos (
;). Second, information from blastocyst images and proteomic information was used to predict the potential of a euploid embryo to lead to a live birth. To the authors’ knowledge, this is the first approach to predict the likelihood of achieving a live birth by relying on machine learning and proteomic data.
Materials and methods
Study design and participants
In this single-centre project, two populations were enrolled (Figure 1): 131 recipients of the oocyte donation programme of IVI Valencia (no preimplantation genetic testing [PGT]) and 81 women using autologous eggs in the PGT for aneuploidies (PGT-A) programme. The resulting embryos were individually cultured up to blastocyst stage in a continuous monitoring system (EmbryoScope; Vitrolife, Denmark).
Figure 1The study design shown as a flow diagram representing the distribution of embryos in the project. ANN, artificial neural network; PGT, preimplantation genetic testing.
A single time-lapse image from each embryo acquired at 111.5 ± 1.5 h of development was analysed using computational vision to extract information (as described below). Additionally, 20 µl of culture medium (Gems; Genea Biomedx, Australia) was collected for the proteomic analysis from the biopsied embryos and eight control samples (medium in which no embryos had been cultured). Samples were obtained on day 5 of development and were stored at –80°C until the proteomic analysis. Only the medium from euploid embryos was analysed after single-embryo transfer.
A total of 212 embryos was selected for application of the artificial intelligence technique: the first group consisted of 131 embryos obtained from the oocyte donation programme, and the second group included 81 embryos from autologous treatments with proteomic information. After the image analysis, 26 embryos were excluded from the second group: 19 blastocysts were outside the zona pellucida at 111 ± 1.5 h of development, which made image analysis difficult and not comparable with the 186 images finally remaining, and seven embryos did not reach the blastocyst stage at the proposed time and were discarded due to their early developmental stage. Therefore, the second group included 55 embryos for analysis, of which 11 were used for the blind test. Thus, the database totalled 186 embryos to undergo application of the artificial intelligence technique.
Ovarian stimulation in treatments with donated oocytes
Donors received stimulatory treatment using the conventional ovarian stimulation protocol with gonadotrophin-releasing hormone (GnRH) agonist treatment. GnRH agonist (Decapeptyl; Ipsen Pharma, Spain) was administered by intramuscular injection until more than eight follicles had reached a mean diameter of 18 mm or more. Transvaginal oocyte retrieval was scheduled for 36 h later. Endometrial preparation was undertaken using hormone replacement therapy as described by Cerrillo and colleagues (
). After embryo transfer, oocyte recipients received a daily dose of 400 mg of vaginal micronized progesterone (Progeffik; Lab. Effik, Spain) every 12 h as luteal phase support.
Ovarian stimulation in treatments with autologous oocytes
GnRH-antagonist treatments were applied, the GnRH-agonist being administered when at least three leading follicles had reached a mean diameter of 18 mm. Transvaginal oocyte retrieval was scheduled for 36 h later through follicular aspiration, and oocytes were washed in gamete medium (Cook Medical, Australia).
Oocyte retrieval and embryo incubation
Oocytes were cultured in fertilization medium (Origio; CooperSurgical, Denmark) in 5% CO2 and 5% O2 at 37°C. Denudation was carried out just before intracytoplasmic sperm injection (ICSI), 4 h after oocyte retrieval, using mechanical and chemical procedures (pipetting in 40 IU/ml hyaluronidase). ICSI was performed in a HEPES-buffered gamete medium at × 400 magnification using an Olympus IX7 microscope (Olympus Corporation, Japan). Finally, oocytes were placed in EmbryoSlides (Vitrolife, Denmark) pre-equilibrated to blastocyst stage with 28 µl of single-step medium (Gems; Genea Biomedx, Australia) and 1.6 ml of mineral oil.
Embryos were cultured individually up to the fifth or sixth day of development in the time-lapse system EmbryoScope (Vitrolife, Denmark). Successful fertilization was assessed at 16–19 h after ICSI.
Embryo morphology was evaluated on day 3 (62–72 h after ICSI) based on digital images, taking into consideration the number, the symmetry of the blastomeres, the percentage of fragmentation and the degree of compaction. Blastocysts were scored on day 5 (120 h after ICSI) based on the Association for the Study of Biology of Reproduction (ASEBIR) criteria (Supplementary Tables 1, 2 and 3) and the KIDScore Day 5 (with EmbryoViewer software; Vitrolife, Denmark). Embryologists annotated the morphokinetic parameters: the timings of cell divisions to the 2-cell (t2), 3-cell (t3), 4-cell (t4), 5-cell (t5), 8-cell (t8), 9-cell (t9), compaction (CP), morula (tM), start of blastulation (tSB), blastocyst (tB) and expanding blastocyst (tEB) were calculated, with the start of ICSI being used as t0. The durations of the cell cycle intervals t3–t2 (cc2), t4–t3 (s2), t5–t2, t5–t3 (cc3) and t8–t5 (s3) were also calculated. If there was more than one embryo of the same morphological quality, the score provided by the KIDScore Day 5 decided which would be transferred.
PGT-A
Embryos were taken out of the incubator on day 3 of development to undergo a small laser incision traversing the zona pellucida (assisted hatching) using a Lykos laser (Hamilton Thorne, USA). This procedure made the trophectoderm biopsy on day 5 of culture easier, when a biopsy pipette was used to remove approximately 5 cells. Chromosome analysis was performed using next-generation sequence technology (Thermo Fisher Scientific, USA).
Embryo transfer and clinical outcome
A single-embryo transfer of one blastocyst was performed for all patients; for those in the PGT-A programme, the blastocyst had previously been vitrified and warmed using the Cryotop method (Kitazato Biopharma, Japan). Embryo selection for transfer was based on chromosomal status, morphology and morphokinetics. The β-HCG concentration was determined 10 days after embryo transfer, and clinical pregnancy was confirmed by the presence of gestational sac at the fifth week of pregnancy. Finally, patients inform about live birth after the delivery.
Protein analysis in spent culture media
The relative concentrations of 92 proteins from 81 samples of spent embryo culture medium and eight control samples (medium in which no embryos had been cultured) were analysed using Proseek Multiplex Assays (Olink Bioscience, Sweden) based on proximity extension assay (PEA) technology.
Proteins were measured using the Olink Inflammation panel (Olink Proteomics, Sweden) according to the manufacturer's instructions. The PEA technology used for the Olink protocol has been well described (
); it enables 92 proteins to be analysed simultaneously, using 1 µl of each sample. In brief, pairs of oligonucleotide-labelled antibody probes bind to their target protein. If the two probes are closely located, the oligonucleotides will hybridize in a pairwise manner. The addition of a DNA polymerase leads to a proximity-dependent DNA polymerization event, generating a unique polymerase chain reaction (PCR) target sequence. The resulting DNA sequence is subsequently detected and quantified using a microfluidic real-time PCR instrument (Biomark HD; Fluidigm, USA). Data are then quality controlled and normalized using an internal extension control and an inter-plate control, to adjust for intra- and inter-run variation. The final assay readout is presented as Normalized Protein eXpression (NPX) values, which are arbitrary units on a log2 scale, where a high value corresponds to a higher protein expression. Hence, a 1 NPX difference means a doubling of protein concentration. All assay validation data (detection limits, intra- and inter-assay precision data, reproducibility and specificity) are available on manufacturer's website (www.olink.com).
Artificial intelligence model: image analysis, collinearity analysis, ANN and genetic algorithms
Standardization of the 186 blastocyst images was necessary before applying the artificial intelligence technique. The methodology used was as described by Rocha and colleagues (
). The images were initially imported automatically into Matlab software (MathWorks, USA) and a standardization algorithm was run to normalize them in terms of contrast and resolution (
): the area of the expanded blastocyst, the inner cell mass and the trophectoderm. Finally, the algorithm obtained 33 mathematical variables by using the measurement of the area, the number of pixels present in each segmented portion of the blastocyst image, the binary patterns (
). These variables were chosen seeking to represent all the relevant characteristics of the blastocyst for embryo quality assessment and live birth prediction through artificial intelligence.
After the embryo images had been standardized and isolated, and the mathematical variables had been extracted, a collinearity analysis was used to check if the variables were correlated with one another based on the variance inflation factor (VIF). The VIF represents the degree of independence or non-redundancy between a variable and another independent variable. According to previous studies, collinear variables were considered to be those with VIF values higher than 10 (
); removal of collinear variables reduced the number of variables representing the human embryo to 20 (Supplementary Table 4).
Likewise, a collinearity analysis was performed to discriminate the independent and non-redundant proteins. As it is shown in detail in ‘Collinearity analysis for proteins’, below, seven proteins were suitable for using as input to the ANN, in conjunction with the 20 morphological variables. The artificial intelligence technique associated ANNs (multilayer perceptron) with the genetic algorithm by using the back-propagation learning algorithm (
) for the training phase. The genetic algorithm used the ANNs as individuals in a population, which, over generations, end up selecting the best ANN (i.e. the one with the highest accuracy for live birth prediction).
The dataset of 131 embryos was randomly divided into 70% for training, 15% for validation and 15% for testing the ANN. Of the dataset of 55 embryos, 20% were used for the blind test, and the remaining 44 embryos were randomly divided into 68% for training, 16% for validation and 16% for testing the ANN (
Statistical tests were applied to probe significant differences in the values of each protein in conditioned compared with control media and in conditioned media from implanted compared with non-implanted embryos. A t-test was used for parameters with a normal distribution, and a Wilcoxon rank sum test for those with a non-normal distribution. Values of P that were <0.05 were considered statistically significant.
The image analysis and the final model were tested using two techniques. First, receiver operating characteristic (ROC) curves were used to analyse the artificial intelligence results for pattern recognition. The resulting graph represents the ratio of true test positives to total positives (the sensitivity) per the false positive fraction, i.e. the ratio of false test positives to total negatives (1 – specificity). An area under the ROC curve (AUC) of greater than 0.5 might indicate a predictive power to identify embryos that lead (LB+) or do not lead (LB–) to a live birth. The greater the AUC, the more favourable the compensation between sensitivity and specificity. It tells how much the model is capable of distinguishing among LB+ and LB– embryos. Second, confusion matrix methodology was used to analyse the intersection between the data provided by the model (the artificial intelligence system) and the real results. The authors considered as true positive the number of embryos that achieved a live birth, the model classifying them as positive, and as true negatives those embryos that did not achieve a live birth, the model classifying them as negative. The embryos wrongly classified as leading to a positive or negative live birth were described as false positive and false negative, respectively.
Ethical approval
An Institutional Review Board (IRB reference 1802-VLC-012-MM), which regulates and approves database analysis and clinical IVF procedures for research at IVI, approved the procedure and protocol on 10 April 2018. Additionally, the project complies with the Spanish law governing assisted reproductive technologies (14/2006).
Results
The mean age of the patients included in the study was 41.6 years, with a mean body mass index (BMI) of 23.2 kg/m3 for the women receiving autologous oocytes. Regarding the clinical outcome, this group showed a positive β-HCG of 63.0%, an implantation rate of 56.8% and a live birth rate of 47.0%. The patients included in the oocyte donation programme had a mean age of 37.9 years with a mean BMI of 22.9 kg/m3 and their treatments achieved a positive β-HCG of 68.7%, an implantation rate of 54.20% and a live birth rate of 40.5%.
Proteomic profile of preimplantation embryos
Of the total of 92 proteins, 67 had identical NPX values in all the samples analysed (conditioned and control media). The lack of variation in the signal for each sample was decisive for not including these proteins in the following analyses. Only 25 of the total protein samples analysed had different NPX values. The means of the NPX values for these 25 proteins in control and conditioned medium are shown in Figure 2. Higher concentrations of three proteins were detected in the spent embryo culture media compared with background concentrations in control media. These proteins were IL-8 (P = 0.025), IL-6 (P = 0.001) and uPA (urokinase-type plasminogen activator; P = 0.006). Furthermore, lower concentrations of 14 proteins were detected in spent embryo culture media compared with background concentrations. These proteins were DNER (Delta/Notch-like EGF-related receptor; P < 0.001), CSF-1 (macrophage colony-stimulating factor 1; P < 0.001), Flt3L (FMS-like tyrosine kinase 3 ligand; P < 0.001), SCF (P < 0.001), CD40 (P < 0.001), MCP-1 (methyl-accepting chemotaxis protein; P < 0.001), CX3CL1 (P < 0.001), CD6 (P < 0.001), TRAIL (P = 0.002), TNFRSF9 (tumour necrosis factor receptor superfamily member 9; P < 0.001), CD244 (P < 0.001), IL-18 (P < 0.001), CCL23 (P < 0.001) and IL-18R1 (P < 0.001).
Figure 2The grouped column chart represents the average of the Normalized Protein eXpression (NPX) value obtained using the proximity extension assay technique for the 25 useful proteins analysed in culture medium from 81 embryos generated from autologous eggs. Grey columns represent control media (n = 8), and blue columns represent conditioned media collected on day 5 of embryo culture.
The only protein with a different NPX value in implanted (NPX value 2.44) and non-implanted embryos (NPX value 2.76) was VEGFA (P = 0.017).
Collinearity analysis for proteins
The collinearity analysis of the 25 proteins demonstrated that most of them were highly correlated with one another (data not shown). Thus, after correcting the collinearity, seven independent and non-redundant proteins remained for use in the ANN: matrix metalloproteinase-1 (MMP-1), IL-6, VEGFA, uPA, TNF-related activation-induced cytokine (TRANCE), Flt3L and DNER. The relative NPX values for each protein are shown in Table 1.
Table 1NPX values obtained using the PEA technique for the seven independent proteins resulting from the collinearity analysis
Protein
NPX value (mean ± SD)
LB+ (n = 38)
LB– (n = 43)
P-value
MMP-1
–0.39 ± 0.83
–0.49 ± 0.53
0.579
IL-6
0.94 ± 0.53
0.75 ± 0.67
0.261
VEGFA
2.40 ± 0.37
2.59 ± 0.53
0.137
uPA
0.67 ± 0.42
0.61 ± 0.52
0.634
TRANCE
0.19 ± 0.51
–0.05 ± 0.15
0.023
Flt3L
5.87 ± 0.20
6.00 ± 0.27
0.051
DNER
4.57 ± 0.29
4.71 ± 0.27
0.077
LB+, positive for live birth; LB–, negative for live birth; NPX, Normalized Protein eXpression; PEA, proximity extension assay; SD, standard deviation.
An extraction of relevant variables from the 131 blastocyst images included in the first population was required to design the ANN using morphological data. The training performed with 70% of the images was capable of correctly classifying 89% of the embryos as either LB+ or LB– (true positive, 33; true negative, 48; false positive, 6; false negative, 4), with 95% correctly classified in the test (true positive, 6; true negative, 13; false positive, 1; false negative, 0).
Regarding the validation of the ANN with the second embryo population and the additional proteomic data, the three most efficient architectures obtained using the genetic algorithm technique are shown in Table 2. Considering only the test data, the ANN was successful in predicting positive and negative live birth (mean of total success 89.67%).
Table 2Test accuracies of the three most efficient ANN architectures in live birth prediction
Regarding the test dataset, the ROC curves for the three architectures are shown in Figure 3. The architecture developed using IL-6 and MMP-1 achieved a correct classification of all the embryos as a positive or negative live birth in the training, validation and test phases (total success 100%). The resulting AUC to predict the positive and negative live birth reached the highest value, 1.0.
Figure 3Receiver operating characteristic (ROC) curves for live birth (LB) prediction using the testing dataset (n = 7) and the artificial neural network with architecture 1 (A), architecture 2 (B) and architecture 3 (C) (see Table 2 for information on the architectures). The y-axis represents the sensitivity, and the x-axis refers to 1 – Specificity. Class 1, positive live birth; Class 2, negative live birth.
The blind test for architecture 1 was performed with 11 embryos that had not previously been used, and reached an accuracy of prediction of 72.7% (Figure 4). It correctly classified eight embryos out of the total number (true positive 4, true negative 4, false positive 1 and false negative 2).
Figure 4Confusion matrix for the blind testing dataset using the artificial neural network (ANN) with architecture 1. The y-axis refers to the output value predicted by the ANN, and the x-axis to the real value. Green areas represent the embryos classified correctly, and orange ones the embryos classified incorrectly. Dark blue areas represent the total hits (72.7%) and total mistakes (27.3%), and light blue areas provide the right:wrong ratio for each column. LB+, positive for live birth; LB–, negative for live birth.
Information from the blastocyst images and proteomic information gained from the analysis of the embryonic secretome were used to predict the potential of a euploid embryo to lead to a live birth.
The proteomic analysis of the culture media showed that, out of 92 proteins measured, only IL-6, uPA and IL-8 were differentially secreted by the developing human embryos. Previous studies revealed that the concentration of IL-6 in the culture medium could be useful in selecting the embryo for implantation (
). In addition, there is evidence about the secretion of IL-6 by the endometrial epithelial cells that emphasizes the importance of this cytokine in embryo development (
Embryologic outcome and secretome profile of implanted blastocysts obtained after coculture in human endometrial epithelial cells versus the sequential system.
). The other prominent proteins found in the current work to be differentially secreted have also previously been detected. Whereas the first evidence of uPA in embryo culture media was in 1996 (
At the end of the collinearity analysis, seven proteins remained for use in the artificial intelligence model. According to the current study's results, two of these proteins (IL-6 and uPA) were secreted by the embryos, two others (DNER and Flt3L) were the ones consumed, and one protein (VEGFA) was related to poor implantation. Regarding the other two proteins, the authors consider that they have passed the collinearity analysis due to their exclusive characteristics: MMP-1 was the only one with a negative mean value in both control medium and medium from the embryos; and TRANCE was not found in control media, but was present in all media that had contained an embryo (see Figure 2). Furthermore, TRANCE was the only one out of the seven proteins with significant difference in relative concentration between LB+ and LB– samples (Table 1).
The image analysis performed in the present research on the blastocyst pictures has been proven in a previous study that showed good results in classifying the quality of bovine blastocysts (
). This software considered that 20 morphological variables (Supplementary Table 4) of human embryos were enough to predict the likelihood of achieving a live birth. These parameters were combined with the data from the proteomic analysis to develop the current artificial intelligence model of prediction.
The current results demonstrated that the accuracy of prediction was higher as the ANN architecture improved. First, the model created using 25 proteins correctly classified 83.3% of the embryos used in the test. Second, the collinearity analysis was shown to be efficient as the predictive power improved to 85.7% when only the resulting seven independent proteins were used. Finally, the artificial intelligence model achieved the highest accuracy in predicting live birth (AUC = 1) when considering IL-6, MMP-1 and the 20 morphological variables. It is reported that these proteins play an important role in reproductive function. IL-6 is relevant for embryonic development (
Embryologic outcome and secretome profile of implanted blastocysts obtained after coculture in human endometrial epithelial cells versus the sequential system.
A recent publication demonstrated that, for embryo selection an objective time-lapse imaging algorithm is superior to the subjective blastocyst morphological scoring system (
) and obtained an AUC of 67.43% for live birth prediction, compared with 61.74% using the blastocysts’ morphological grade. The accuracy of embryo selection using the current proposed model is higher than using standard morphological selection alone, as well as using algorithms developed with time-lapse images.
This study differs from previously published approaches to artificial intelligence in terms of the experimental design, including the proteomic analysis, and the outcome. The first application of ANN to predict the outcome of an IVF treatment achieved an accuracy of 59% (
). Since then, several predictive models have been developed based on different populations of patients, such as in cycles associated with male factor infertility (
How much information about embryo implantation potential is included in morphokinetic data? A prediction model based on artificial neural networks and principal component analysis.
). Machine learning methods have been used to combine morphokinetic algorithms and factors of infertility such as sperm motility, anti-Müllerian hormone (AMH) concentration and blastomere size on day 3 (
Embryo images provided by time-lapse systems have now become the subject of studies based on artificial intelligence. Deep learning techniques have been used to predict blastocyst quality and select the most appropriate embryo to transfer (
). Embryos classified as good quality by these authors’ deep neural network, called STORK, showed higher probabilities of leading to a live birth than those classified as bad quality (61.4% and 50.9%, respectively). Nevertheless, STORK cannot estimate the pregnancy rate, although it showed a very high AUC (0.98) in predicting blastocyst quality. Another deep learning model was recently developed by Tran and colleagues (
) that used the complete video of embryo development to predict the likelihood of pregnancy, with an AUC of 0.93 in 5-fold stratified cross-validation. The likelihood of an IVF treatment ending up with a live birth has recently been analysed considering patient and treatment characteristics (
Limitations of the current study that should be considered are reflected in the design of the research. This model was trained with proteomic data and morphological variables from the image analysis of one time-lapse system. The total population of embryos was distributed into two groups with different phases (training, validation and testing), resulting in a small number of embryos in each. Additionally, only one laboratory and a unique culture medium were involved in this study. This could be considered as an advantage in the proteomic analysis, but applicability to other laboratories remains unclear. In addition, there is evidence of inter-batch protein and pH variability with the same medium (
). In general, the models developed with ANNs could also be affected by the overfitting phenomenon. The current study tried to avoid this by defining the input and output variables. The overall success on the blind test may be considered as evidence (72.7% for the blind test versus 100% for training, validation and testing). It is also necessary to highlight that the model with autologous oocytes was built using only euploid embryos, so the clinical value lies in distinguishing the most viable embryo among those that test as euploid. Further studies should have a large sample size and multicentric nature, and should include data from different time-lapse systems to standardize the artificial intelligence model and globalize its use.
In conclusion, the introduction of artificial intelligence to IVF laboratories would help embryologists to predict the success of an embryo for achieving a live birth. The combination of proteomic analysis of the embryo culture medium and morphological information from the blastocyst images has never previously been assessed using artificial intelligence techniques. The present preliminary research has shown the predictive power of this combination. The ANN achieved excellent accuracy for detecting euploid embryos capable of resulting in a live birth, especially in terms of IL-6 and MMP-1. In fact, the model proposed in this manuscript is a promising tool to select the most successful embryo of a euploid cohort. In further studies, the ANN should be retrospectively tested with an appropriately sized study to confirm the effectiveness of this innovative method before its prospective application.
Acknowledgements
The authors acknowledge the embryologists and technicians of the IVF laboratory at IVI-RMA Global Valencia for their clinical support. The authors also thank the contribution of the IVI Foundation and the State University of São Paolo in this project. This work was supported by the Ministry of Science and Universities CDTI (IDI-20191102) awarded to M.M.; grants # 2017/19323-5, 2018/19371-2 and 2018/24252-2 from the São Paulo Research Foundation (FAPESP); and from the Spanish Ministry of Economy and Competitiveness through the Miguel Servet programme [CPII018/00002] awarded to F.D.
Human chorionic gonadotropin (hCG) in the secretome of cultured embryos: Hyperglycosylated hCG and hCG-free beta subunit are potential markers for infertility management and treatment.
Embryologic outcome and secretome profile of implanted blastocysts obtained after coculture in human endometrial epithelial cells versus the sequential system.
How much information about embryo implantation potential is included in morphokinetic data? A prediction model based on artificial neural networks and principal component analysis.
Application of artificial intelligence technology to increase the efficacy of embryo selection and prediction of live birth using human blastocysts cultured in a time-lapse incubator.
Lorena Bori is doctoral researcher at IVIRMA Valencia, Spain. She received her biology degree in 2016 and her Master's degree in the biotechnology of human assisted reproduction in 2018 from the University of Valencia. Her primary field of research is embryo evaluation and selection using non-invasive methodologies, especially artificial intelligence.
Key Message
An artificial intelligence model was designed using proteomic and morphological data from blastocysts. The algorithm, based on artificial neural networks, is capable of discriminating with high accuracy between euploid embryos that lead to a live birth and those which do not.
Article info
Publication history
Published online: October 07, 2020
Accepted:
September 30,
2020
Received in revised form:
September 17,
2020
Received:
May 11,
2020
Declaration: The authors report no financial or commercial conflicts of interest