Wenyuan Chen’s research while affiliated with University of Toronto and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (20)


An interpretable artificial intelligence approach to differentiate between blastocysts with similar or same morphological grades
  • Article

April 2025

·

6 Reads

Human Reproduction

Hang Liu

·

Longbin Chen

·

·

[...]

·

STUDY QUESTION Can a quantitative method be developed to differentiate between blastocysts with similar or same inner cell mass (ICM) and trophectoderm (TE) grades, while also reflecting their potential for live birth? SUMMARY ANSWER We developed BlastScoringNet, an interpretable deep-learning model that quantifies blastocyst ICM and TE morphology with continuous scores, enabling finer differentiation between blastocysts with similar or same grades, with higher scores significantly correlating with higher live birth rates. WHAT IS KNOWN ALREADY While the Gardner grading system is widely used by embryologists worldwide, blastocysts having similar or same ICM and TE grades cause challenges for embryologists in decision-making. Furthermore, human assessment is subjective and inconsistent in predicting which blastocysts have higher potential to result in live birth. STUDY DESIGN, SIZE, DURATION The study design consists of three main steps. First, BlastScoringNet was developed using a grading dataset of 2760 blastocysts with majority-voted Gardner grades. Second, the model was applied to a live birth dataset of 15 228 blastocysts with known live birth outcomes to generate blastocyst scores. Finally, the correlation between these scores and live birth outcomes was assessed. The blastocysts were collected from patients who underwent IVF treatments between 2016 and 2018. For external application study, an additional grading dataset of 1455 blastocysts and a live birth dataset of 476 blastocysts were collected from patients who underwent IVF treatments between 2021 and 2023 at an external IVF institution. PARTICIPANTS/MATERIALS, SETTING, METHODS In this retrospective study, we developed BlastScoringNet, an interpretable deep-learning model which outputs expansion degree grade and continuous scores quantifying a blastocyst’s ICM morphology and TE morphology, based on the Gardner grading system. The continuous ICM and TE scores were calculated by weighting each base grade’s predicted probability and summing the predicted probabilities. To represent each blastocyst’s overall potential for live birth, we combined the ICM and TE scores using their odds ratios (ORs) for live birth. We further assessed the correlation between live birth rates and the ICM score, TE score, and the OR-combined score (adjusted for expansion degree) by applying BlastScoringNet to blastocysts with known live birth outcomes. To test its generalizability, we also applied BlastScoringNet to an external IVF institution, accounting for variations in imaging conditions, live birth rates, and embryologists’ experience levels. MAIN RESULTS AND THE ROLE OF CHANCE BlastScoringNet was developed using data from 2760 blastocysts with majority-voted grades for expansion degree, ICM, and TE. The model achieved mean area under the receiver operating characteristic curve values of 0.997 (SD 0.004) for expansion degree, 0.903 (SD 0.031) for ICM, and 0.943 (SD 0.040) for TE, based on predicted probabilities for each base grade. From these predicted probabilities, BlastScoringNet generated continuous ICM and TE scores, as well as expansion degree grades, for an additional 15 228 blastocysts with known live birth outcomes. Higher ICM and TE scores, along with their OR-combined scores, were significantly correlated with increased live birth rates (P < 0.0001). By fine-tuning, BlastScoringNet was applied to an external IVF institution, where higher OR-combined ICM and TE scores also significantly correlated with increased live birth rates (P = 0.00078), demonstrating consistent results across both institutions. LIMITATIONS, REASONS FOR CAUTION This study is limited by its retrospective nature. Further prospective randomized trials are required to confirm the clinical impact of BlastScoringNet in assisting embryologists in blastocyst selection. WIDER IMPLICATIONS OF THE FINDINGS BlastScoringNet provides an interpretable and quantitative method for evaluating blastocysts, aligned with the widely used Gardner grading system. Higher OR-combined ICM and TE scores, representing each blastocyst’s overall potential for live birth, were significantly correlated with increased live birth rates. The model’s demonstrated generalizability across two institutions further supports its clinical utility. These findings suggest that BlastScoringNet is a valuable tool for assisting embryologists in selecting blastocysts with the highest potential for live birth. The code and pre-trained models are publicly available to facilitate further research and widespread implementation. STUDY FUNDING/COMPETING INTEREST(S) This work was supported by the Vector Institute and the Temerty Faculty of Medicine at the University of Toronto, Toronto, Ontario, Canada, via a Clinical AI Integration Grant, and the Natural Science Foundation of Hunan Province of China (2023JJ30714). The authors declare no competing interests. TRIAL REGISTRATION NUMBER N/A.


Automated Sperm Tracking and Immobilization With a Clinically-Compatible XYZ Stage

January 2025

·

20 Reads

IEEE Transactions on Automation Science and Engineering

Automated positioning systems play a pivotal role in micro-scale cell manipulation. In clinical intracytoplasmic sperm injection (ICSI) for infertility treatment, a motile sperm needs to be immobilized by glass micropipette tapping for subsequent surgical steps. The process requires accurate tracking of the target sperm and precise alignment between the sperm tail and the micropipette. Manual sperm immobilization suffers from inconsistent success rates, and current robotic systems developed for the task fail to comply with the standard clinical setup. Instead of using a motorized micromanipulator as in existing robotic systems, this paper presents an automated and compact three-dimensional (3-D) positioning stage for sperm immobilization that can be seamlessly integrated into standard clinical platforms. To tackle the challenge of accurately tracking the target sperm with degraded detection quality due to the complex 3-D motion of the positioning stage, a multi-stage sperm tracking scheme is designed for detection-to-tracklet association. To prevent physical contact between the sperm head and the micropipette, an adaptive tail-tapping planning strategy based on the sperm head orientation analysis is established. A visual servo controller equipped with a dynamic sperm motion observer is further employed to achieve precise positioning of the target sperm during the immobilization process. Experimental results demonstrated that the proposed system achieved a sperm tracking accuracy of 88.12%, and a sperm positioning accuracy of 2.3 ± 1.2 μm. Further experiments revealed the system achieved a success rate of 93.5% and a time cost of 5.5 s for automated sperm immobilization.


Automated Parts Segmentation of Sperm via a Contrastive Learning-Based Part Matching Network

January 2025

·

7 Reads

IEEE Transactions on Automation Science and Engineering

Sperm morphology measurement is vital for diagnosing male infertility, which involves quantification of multiple subcellular parts for each sperm. Instance-aware part segmentation networks have been introduced to address this task by automatically identifying individual sperm and segmenting their subcellular parts. However, major limitations of state-of-the-art instance-aware part segmentation networks include: 1) they are time-consuming and computational expensive due to sequential processing and multi-stage frameworks; 2) they perform poorly for densely packed sperm that overlap or cross over one another. To overcome these challenges, this paper proposes 1) integrating instance identification and subcellular part segmentation within a single-stage framework to save inference time and memory usage; 2) dividing a sperm target into simpler components (head and tail) to improve prediction accuracy, followed by a contrastive learning-based matching method to pair the head and tail. Experimental results on our clinically collected human sperm dataset demonstrated that the proposed network not only outperformed state-of-the-art CP-Net (by 3.5% APp vol) but also achieved realtime inference (48.0 frames per second), effectively meeting the clinical requirements for automated parts segmentation of sperm. final part segmentation results. 2) Since the sperm head and tail have simpler shapes, they are detected separately to improve segmentation accuracy. A contrastive learning-based method is then designed to pair head and tail based on similarity of feature embeddings extracted from the proposed instance prediction branch. The proposed method significantly outperformed existing networks, particularly in handling densely packed sperm. The presented method has applicability to analyzing sperm and more broadly other cell types.




Fig. 1. (a) An image of stained sperm. Each sperm is to be segmented into five parts: acrosome, vacuole, nucleus, midpiece and tail. (b) Instanceaware part segmentation that not only distinguishes different sperm, but also segments parts for each sperm. (c) Ellipse and rectangle fitting for measuring sperm head and midpiece morphology parameters. (d) Centerline fitting for measuring sperm tail morphology parameters.
Fig. 2. (a) Context loss due to bounding box cropping. (b) Feature distortion due to resizing in ROI Align. (c) Endpoints mislocated in Steger-based methods. (d) The normal of endpoints is mislocated due to the influence of gradient from the intersecting edge.
Fig. 3. The structure of our proposed attention-based instance-aware part segmentation network. The convolutional backbone and FPN first extracts features from the input image then rescales extracted features to multiscale. Next, the preliminary segmentation module generates instance-level parsing masks as in top-down methods. Finally, the refinement module refines preliminary generated masks by merging features extracted by FPN through the attention mechanism. Besides, the edge information is utilized to better separate boundary between adjacent sperm.
Fig. 5. Qualitative comparisons of instance-aware part segmentation networks. PGN (bottom-up method) has low instance distinction and cannot separate intersecting sperm parts. RP-R-CNN (top-down method) has distorted features and contexts outside a bounding box are cropped out. In comparison, our proposed network achieves the best prediction by refining preliminary segmented masks to retrieve lost contexts outside the bounding box and fix distorted features.
Fig. 6. (a) & (b) Qualitative comparison results for Steger-based methods and our proposed method. The proposed method can effectively fix mislocated endpoints. (c) Quantitative comparison of errors for measuring tail length, width and curvature with Steger-based methods and the proposed method. The experiment was conducted on 50 sperm with manual benchmarking.
Automated Sperm Morphology Analysis Based on Instance-Aware Part Segmentation
  • Preprint
  • File available

July 2024

·

52 Reads

Traditional sperm morphology analysis is based on tedious manual annotation. Automated morphology analysis of a high number of sperm requires accurate segmentation of each sperm part and quantitative morphology evaluation. State-of-the-art instance-aware part segmentation networks follow a "detect-then-segment" paradigm. However, due to sperm's slim shape, their segmentation suffers from large context loss and feature distortion due to bounding box cropping and resizing during ROI Align. Moreover, morphology measurement of sperm tail is demanding because of the long and curved shape and its uneven width. This paper presents automated techniques to measure sperm morphology parameters automatically and quantitatively. A novel attention-based instance-aware part segmentation network is designed to reconstruct lost contexts outside bounding boxes and to fix distorted features, by refining preliminary segmented masks through merging features extracted by feature pyramid network. An automated centerline-based tail morphology measurement method is also proposed, in which an outlier filtering method and endpoint detection algorithm are designed to accurately reconstruct tail endpoints. Experimental results demonstrate that the proposed network outperformed the state-of-the-art top-down RP-R-CNN by 9.2% [AP]_vol^p, and the proposed automated tail morphology measurement method achieved high measurement accuracies of 95.34%,96.39%,91.2% for length, width and curvature, respectively.

Download


Flowchart of study design, including experiments and samples used in each section
Ablation studies were performed to investigate how model generalizability is affected by imaging magnification, imaging mode, and sample preprocessing protocols. A-D In the ablation experiment, each investigated factor was removed from the training dataset and the model was re-trained to compare the precision and recall. The detection result images and visualization heatmap are also shown. Example raw sample images are shown in (B), and example processed sample images are shown in (A), (C), (D). Each scale bar represents 10μm\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mu m$$\end{document}. Each error bar represents the standard deviation of repeatedly training the model on the same dataset by three times. E, F The decrease in precision and recall caused by each factor was ranked. Removing raw sample images from the training dataset caused the largest drop in model precision, whereas removing 20×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times$$\end{document} images caused the largest drop in model recall. (*p<0.1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p<0.1$$\end{document}, **p<0.01\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p<0.01$$\end{document}, ***p<0.001\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p<0.001$$\end{document}, ****p<0.0001\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p<0.0001$$\end{document})
Testing the hypothesis in three clinics. The model precision and recall were tested using both raw samples and processed samples in two clinical applications. There was no significant difference in model precision and recall among three clinics as compared to the performance tested in during model training. (ns: not significant, p>0.05\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p>0.05$$\end{document})
Architecture of the deep learning-based sperm detection model. The model takes a single microscopic image as input (raw sample in this example), then uses image preprocessing to normalize image luminance, resolution, and color. Sperm is detected using Yolo v5, one of the state-of-the-art convolutional neural networks for object detection. The model outputs the image of the detected with anchor box markers (bounding boxes) and coordinates of each sperm
Testing the generalizability and effectiveness of deep learning models among clinics: sperm detection as a pilot study

May 2024

·

48 Reads

·

3 Citations

Reproductive Biology and Endocrinology

Background Deep learning has been increasingly investigated for assisting clinical in vitro fertilization (IVF). The first technical step in many tasks is to visually detect and locate sperm, oocytes, and embryos in images. For clinical deployment of such deep learning models, different clinics use different image acquisition hardware and different sample preprocessing protocols, raising the concern over whether the reported accuracy of a deep learning model by one clinic could be reproduced in another clinic. Here we aim to investigate the effect of each imaging factor on the generalizability of object detection models, using sperm analysis as a pilot example. Methods Ablation studies were performed using state-of-the-art models for detecting human sperm to quantitatively assess how model precision (false-positive detection) and recall (missed detection) were affected by imaging magnification, imaging mode, and sample preprocessing protocols. The results led to the hypothesis that the richness of image acquisition conditions in a training dataset deterministically affects model generalizability. The hypothesis was tested by first enriching the training dataset with a wide range of imaging conditions, then validated through internal blind tests on new samples and external multi-center clinical validations. Results Ablation experiments revealed that removing subsets of data from the training dataset significantly reduced model precision. Removing raw sample images from the training dataset caused the largest drop in model precision, whereas removing 20x images caused the largest drop in model recall. by incorporating different imaging and sample preprocessing conditions into a rich training dataset, the model achieved an intraclass correlation coefficient (ICC) of 0.97 (95% CI: 0.94-0.99) for precision, and an ICC of 0.97 (95% CI: 0.93-0.99) for recall. Multi-center clinical validation showed no significant differences in model precision or recall across different clinics and applications. Conclusions The results validated the hypothesis that the richness of data in the training dataset is a key factor impacting model generalizability. These findings highlight the importance of diversity in a training dataset for model evaluation and suggest that future deep learning models in andrology and reproductive medicine should incorporate comprehensive feature sets for enhanced generalizability across clinics.




Citations (10)


... Images after data augmentation 5 8 11 15 26 28 37 39 48 50 Pyriform 5 15 21 24 25 35 39 46 48 57 Tapered 6 14 23 27 32 37 41 48 50 52 Chenwy Sperm-Dataset contains 320 RGB images with a resolution of 1280 × 1024 [26]. The authors split the data into training and testing sets at an 8:2 ratio. ...

Reference:

Automated Deep Learning Model for Sperm Head Segmentation, Pose Correction, and Classification
Automated Sperm Morphology Analysis Based on Instance-Aware Part Segmentation
  • Citing Conference Paper
  • May 2024

... Therefore, accurate segmentation of unstained sperm is not only technically challenging but also clinically important, as it better reflects real-world applications. In addition, current research often focuses on individual algorithms, stained datasets, or specific regions, which makes it challenging to evaluate the performance of different segmentation techniques under standardized conditions [31][32][33][34][35][36]. Consequently, current studies lack comprehensive segmentation of all sperm components (head, acrosome, nucleus, neck, and tail) within a unified framework, as well as systematic comparisons of various segmentation methods applied to unstained live human sperm datasets. ...

CP-Net: Instance-aware part segmentation network for biological cell parsing
  • Citing Article
  • June 2024

Medical Image Analysis

... Uma ampla gama de tarefas é coberta por esses modelos, entre elas prever resultados de estimulação ovariana [10] , otimização da dosagem hormonal [11] , segmentação 4 semântica de imagens de oócitos [12] , predição da qualidade do sêmen [13] e segmentação inicial de embriões [14] . No total, 19 artigos foram selecionados para inclusão nesta revisão, incluindo onze estudos [15,25] relacionados a estimulação ovariana controlada e termos correlacionados, três artigos [26,28] que dizem respeito análise de oócitos e termos correlacionados e cinco estudos [13,29,32] relacionados a análise de sêmen e termos correlacionados. ...

Testing the generalizability and effectiveness of deep learning models among clinics: sperm detection as a pilot study

Reproductive Biology and Endocrinology

... Kim [91] 2555, day5 image (static)+1 tabular data 2024 Image segmentation+CNN+MLP AUC=74.10% Liu [92] 17580, day5 image (static)+16 tabular data 2022 CNN+MLP AUC=77.00% Kim [93] 3286, day1-day5 image (temporal)+ unknown tabular data 2024 Transformer AUC=68.80% ...

Development and evaluation of a live birth prediction model for evaluating human blastocysts: a retrospective study

eLife

... After segmentation, addressing measurement errors caused by low-resolution nonstained sperm images is essential for accurate morphology evaluation. Key parameters of the sperm head, midpiece, and tail are highly sensitive to such errors (e.g., the head's length, width, etc.) [27,28], with minor deviations causing significant inaccuracies (see Figure 1c). Currently, methods to improve measurement accuracy under low-resolution conditions can be broadly classified into three categories: super-resolution reconstruction, multi-frame image fusion, and image enhancement techniques. ...

Staining-free, Automated Sperm Analysis for In Vitro Fertilization Lab Use
  • Citing Article
  • September 2022

The Journal of Urology

... Since the projector cannot image directly, it is usually regarded as an inverse camera, and the feature point coordinates of the projector are determined through the conversion relationship between phase and coordinates [19]. However, factors such as the nonlinear distortion of the projector [20], the reflectivity of the object surface [21], and other variables typically result in significant phase errors. Wang et al. [22] adopted a planar model to fit the phase to correct phase errors, but a complex iterative process is required to remove outliers. ...

Automated Exposures Selection for High Dynamic Range Structured-Light 3D Scanning
  • Citing Article
  • January 2022

IEEE Transactions on Industrial Electronics

... Robot localization is defined as the problem of determining the pose of a robot given a map of the environment [1][2][3], which can be divided into three sub-problems: pose tracking [4,5], global localization [6,7] and the kidnapped robot problem [6,[8][9][10]. The sensors commonly used in robot localization problems are 2D laser scanners [11][12][13], 3D LiDar [14,15], monocular camera [16,17], binocular camera [18] and depth camera [19,20], which is used to obtain environmental information near the robot position. In most case, the robot does not know its initial pose when starting to work. ...

Automated Eye-in-Hand Robot-3D Scanner Calibration for Low Stitching Errors
  • Citing Conference Paper
  • May 2020

... Recent algorithmic advancements focus on improving accuracy and efficiency. Liu et al. [21] optimize camera orientation and suggest that the optimal camera pair viewing angle lies between 20 • and 60 • ; however, this still leaves a considerable range for further refinement. Zimiao et al. [22] introduce a non-iterative line-constraint approach, while Zheng et al. [23] enhance stability with affine constraints. ...

Camera Orientation Optimization in Stereo Vision Systems for Low Measurement Error
  • Citing Article
  • August 2020

IEEE/ASME Transactions on Mechatronics

... The eye-in-hand configuration, with a scanner mounted on the end effector of the robotic arm, was employed, resulting in high accuracy and efficiency. This system uses a camera attached to the end of the robot arm to monitor movements in the work area [19], [20]. The experiments demonstrate the rationality of the vision system, but these methods have some limitations: using expensive cameras, the systems are often validated in simulation. ...

Fast Eye-in-Hand 3D Scanner-Robot Calibration for Low Stitching Errors
  • Citing Article
  • July 2020

IEEE Transactions on Industrial Electronics

... In a deep learning-based method, Zhang et al. [26] designed a specialized convolutional neural network that takes high dynamic range (HDR) fringe patterns with three-step phase shifting as input, enabling accurate extraction of phase information in both low signal-to-noise ratio (SNR) and HDR scenes. Liu et al. [27] proposed a Skip Pyramid Context Aggregation Network (SP-CAN) to enhance fringe images captured synchronously by a single-exposure camera, while precisely preserving encoded phase details near edges and corners. Shen et al. [28] employed an improved UNet-based deep neural network to establish a "many-to-one" mapping, utilizing π-phase-shifted binary fringes to acquire more saturated fringe information, thereby enabling fast and accurate retrieval of wrapped phase maps for HDR objects. ...

Optical Measurement of Highly Reflective Surfaces From a Single Exposure
  • Citing Article
  • April 2020

IEEE Transactions on Industrial Informatics