Fenglin Bi’s research while affiliated with East China Normal University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (5)


Assessing Maintainability Risks in the Open Source Software Supply Chain: An Empirical Quality Approach
  • Conference Paper

July 2024

·

7 Reads

Fenglin Bi

·

Sijia Zhao

·

Wei Wang

·

Songlin Wu


BotHawk: An Approach for Bots Detection in Open Source Software Projects

July 2023

·

61 Reads

Social coding platforms have revolutionized collaboration in software development, leading to using software bots for streamlining operations. However, The presence of open-source software (OSS) bots gives rise to problems including impersonation, spamming, bias, and security risks. Identifying bot accounts and behavior is a challenging task in the OSS project. This research aims to investigate bots' behavior in open-source software projects and identify bot accounts with maximum possible accuracy. Our team gathered a dataset of 19,779 accounts that meet standardized criteria to enable future research on bots in open-source projects. We follow a rigorous workflow to ensure that the data we collect is accurate, generalizable, scalable, and up-to-date. We've identified four types of bot accounts in open-source software projects by analyzing their behavior across 17 features in 5 dimensions. Our team created BotHawk, a highly effective model for detecting bots in open-source software projects. It outperforms other models, achieving an AUC of 0.947 and an F1-score of 0.89. BotHawk can detect a wider variety of bots, including CI/CD and scanning bots. Furthermore, we find that the number of followers, number of repositories, and tags contain the most relevant features to identify the account type.



Figure 1. Proposed Methodology.
Figure 7. Accuracy graph of CT scan images during training phase: (a) Fold 1, (b) Fold 2, (c) (d) Fold 4, and (e) Fold 5.
Figure 10. Confusion Metrics for X-ray images on 5-fold: (a) Fold 1, (b) Fold 2, (c) Fold 3, (d) Fold 4, and (e) Fold 5.
Model classification performance.
Novel Light Convolutional Neural Network for COVID Detection with Watershed Based Region Growing Segmentation
  • Article
  • Full-text available

February 2023

·

76 Reads

·

12 Citations

A rapidly spreading epidemic, COVID-19 had a serious effect on millions and took many lives. Therefore, for individuals with COVID-19, early discovery is essential for halting the infection’s progress. To quickly and accurately diagnose COVID-19, imaging modalities, including computed tomography (CT) scans and chest X-ray radiographs, are frequently employed. The potential of artificial intelligence (AI) approaches further explored the creation of automated and precise COVID-19 detection systems. Scientists widely use deep learning techniques to identify coronavirus infection in lung imaging. In our paper, we developed a novel light CNN model architecture with watershed-based region-growing segmentation on Chest X-rays. Both CT scans and X-ray radiographs were employed along with 5-fold cross-validation. Compared to earlier state-of-the-art models, our model is lighter and outperformed the previous methods by achieving a mean accuracy of 98.8% on X-ray images and 98.6% on CT scans, predicting the rate of 0.99% and 0.97% for PPV (Positive predicted Value) and NPV (Negative predicted Value) rate of 0.98% and 0.99%, respectively.

Download

Citations (2)


... To focus our analysis in RQ3 on widely adopted libraries, we filtered the 546 ML libraries with 2,436 bindings based on the number of stars and selected 127 ML libraries with more than 1,000 stars. Though we acknowledge that stars do not provide a complete picture of real-world usage, they are commonly seen as a proxy for the popularity of a project within the software engineering domain (Borges et al. 2016;Fang et al. 2022;Han et al. 2019;Wolter et al. 2023;Xia et al. 2023). For instance, TensorFlow's binding tfjs has gained over 17,000 stars on GitHub, 9 suggesting significant attention from developers. ...

Reference:

Bridging the language gap: an empirical study of bindings for open source machine learning libraries across software package ecosystems
Understanding the Archived Projects on GitHub
  • Citing Conference Paper
  • March 2023

... The COVID-19 pandemic underscored the importance of DL in medical image analysis. Automated evaluation of CT images using DL techniques enabled rapid and accurate differentiation of COVID-19 from other clinical conditions [134]- [136]. Similarly, DL has significantly advanced the detection of cervical and breast cancers, facilitating early diagnosis and improving treatment outcomes. ...

Novel Light Convolutional Neural Network for COVID Detection with Watershed Based Region Growing Segmentation