Yi Liu’s research while affiliated with Beijing Institute of Technology and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (324)


Application of multiple inflammatory markers combined with PIVKA-II in differential diagnosis of AFP-NHCC
  • Article

November 2024

Gene & Protein in Disease

Wen-Tan Hu

·

Xin-Ying Ji

·

Zhi-Liang Jiang

·

[...]

·

Ning Luo

Alpha-fetoprotein (AFP) is a proven blood biomarker widely used in clinical detection of liver cancer, and its concentration increases immediately after liver injury, with high diagnostic specificity and low sensitivity. However, in patients with AFP-negative hepatocellular carcinoma (AFP-NHCC), also known as small liver cancer, their early stage of AFP expression is usually low or close to the normal range. Compared with AFP-positive HCC, AFP-NHCC is more likely to be subjected to missed diagnosis due to solely dependence on the measurement of blood AFP. Therefore, it is necessary to find a reliable, effective, and economical detection method for the early diagnosis of AFP-NHCC. PIVKA-II is closely related to the occurrence, development, invasion, and metastasis of liver cancer, and is also a new hematological marker widely used in the diagnosis of liver cancer in recent years. PIVKA-II effectively makes up for the limitation of negative AFP. 63.2% – 76.3% of patients with negative AFP showed positive PIVKA-II, and if only PIVKA-II detection was relied on, there would still be missed diagnosis in early patients with AFP-NHCC. In this retrospective study, we selected several commonly used inflammatory indicators to explore the diagnostic efficacy of PIVKA-II, an abnormal form of prothrombin, combined with inflammatory indicators in patients with early-stage AFP-NHCC and analyzed the relationship between some clinical features and hematological indicators in patients with AFP-NHCC. Serum levels of high-sensitivity C-reactive protein (hs-CRP), prealbumin (PA), neutrophil/lymphocyte ratio (NLR), and PIVKA-II were compared among three groups (AFP-NHCC group, benign lesion group, and healthy subjects). The diagnostic efficacy of PIVKA-II alone for AFP-NHCC and the diagnostic efficacy of three inflammatory indicators combined with PIVKA-II for AFP-NHCC were calculated, and their diagnostic specificity and sensitivity were compared. Compared with the other two groups, the AFP-NHCC group showed significant changes in three inflammatory markers and PIVKA-II, which may be related to the inflammatory progression of the tumor. Therefore, we recommend the establishment of a laboratory-based detection approach for diagnosing early-stage AFP-NHCC by combining PIVKA-II with inflammatory indicators hs-CRP, PA, and NLR, to facilitate diagnosis, treatment, and prognosis of AFP-NHCC.


Study flowchart. iNPH, idiopathic normal pressure hydrocephalus; MRI, magnetic resonance imaging.
Distribution of CSVD imaging markers in patients with iNPH. For lacunes and CMBs, a score of 0 indicated absence, and a score of 1 indicated the presence of lacunes or CMBs. For PWMH and DMWH, scores ranging from 0 to 3 correspond to Fazekas scores. For BG‐EPVS, 0 = no EPVS, 1 ≤ 10 EPVS, 2 = 11 to 20 EPVS, 3 = 21 to 40 EPVS, and 4 ≥ 40 EPVS. As for CSVD burden, scores from 0 to 4 reflect the overall burden of cerebral small vessel disease in the patient. BG‐EPVS, basal ganglia enlarged perivascular spaces; CMBs, cerebral microbleeds; CSVD, cerebral small vessel disease; DWMH, deep white matter hyperintensity; iNPH, idiopathic normal pressure hydrocephalus; PWMH, periventricular white matter hyperintensity.
Comparison of baseline characteristics among the different CSVD burden groups. Patients were categorized into four groups based on total CSVD score quartiles in this cohort (0 = no CSVD burden, 1 = low CSVD burden, 2 = moderate CSVD burden, and 3–4 = high CSVD burden): (A) age, (B) Trail‐Making Test Part A, (C) Trail‐Making Test Part B, (D) stride length, (E) step height, (F) gait velocity, (G) Timed Up and Go Test time, (H) iNPH grading scale–urinary function, (I) DESH score. ANOVA, analysis of variance; CSVD, cerebral small vessel disease; DESH, disproportionately enlarged subarachnoid hydrocephalus; iNPH, idiopathic normal pressure hydrocephalus.
Associations between CSVD imaging markers and cognitive and motor functions in patients with iNPH. A, Heatmap of Spearman correlation between the clinical outcomes and CSVD imaging markers. * Statistically significant. B, Results of multivariable linear regressions are presented in the forest plot adjusted for age, sex, and confounding vascular risk factors. AVLT‐T, Auditory Verbal Learning Test–Total score; BG‐EPVS, basal ganglia enlarged perivascular spaces; BNT, Boston Naming Test; CMBs, cerebral microbleeds; CSVD, cerebral small vessel disease; DWMH, deep white matter hyperintensity; MMSE, Mini‐Mental State Examination; MoCA, Montreal Cognitive Assessment; PWMH, periventricular white matter hyperintensity; TMT, Trail‐Making Test; TUG, Timed Up and Go test.
The contribution of cerebral small vessel disease in idiopathic normal pressure hydrocephalus: Insights from a prospective cohort study
  • Article
  • Full-text available

November 2024

·

19 Reads

INTRODUCTION Idiopathic normal pressure hydrocephalus (iNPH) and cerebral small vessel disease (CVSD) are age‐related diseases, but their prevalence and clinical relationship are unclear. METHODS This prospective cohort study enrolled 95 patients with probable iNPH in China and evaluated their CSVD burden using magnetic resonance imaging. Linear regression models were used to analyze the association between CSVD scores and clinical outcomes. RESULTS The results showed 78% of the patients had at least one CSVD imaging marker, and higher total CSVD scores were significantly associated with declines in attention, executive function, psychomotor speed, and gait performance after multivariate adjustments. However, the preoperative CSVD score did not affect the post‐shunt improvement in modified Rankin scale or iNPH grading scale scores. DISCUSSION Our findings suggest that CSVD is prevalent in patients with iNPH and is associated with more severe symptoms, but it may not affect shunt outcomes. Future studies are needed to elucidate the underlying mechanisms. Highlights We found that 78% of the patients with idiopathic normal pressure hydrocephalus (iNPH) had at least one type of cerebral small vessel disease (CSVD) imaging marker. The CSVD burden aggravates cognitive and gait impairments in patients with iNPH but may not affect shunt outcomes. The effects of different imaging markers of CSVD on cognition and gait are different and worthy of further investigation.

Download

The flow diagram of the study. Abbreviations: AI-cTB, MRI artificial intelligence-guided cognitive fusion targeted biopsy; cTB, cognitive fusion targeted biopsy
Study flowchart for AI-cTB versus routine cTB for PCa diagnosis. Abbreviations: AI, artificial intelligence; AI-cTB, MRI artificial intelligence-guided cognitive fusion targeted biopsy; csPCa, clinically significant prostate cancer; cTB, cognitive fusion targeted biopsy; DRE, digital rectal examination; MRI, magnetic resonance imaging; PI-RADS, Prostate Imaging Reporting and Data System; PSA, prostate-specific antigen; SB: systematic biopsy; TB, targeted biopsy; TRUS, transrectal ultrasound
Histogram of ISUP. a ISUP of patient biopsy specimens (n = 380). b ISUP of lesion biopsy specimens (n = 444). c ISUP of RP specimens (n = 69). Abbreviations: AI-cTB, MRI artificial intelligence-guided cognitive fusion targeted biopsy; cTB, cognitive fusion targeted biopsy; ISUP, International Society of Urological Pathology; RP, radical prostatectomy
Comparison of MRI artificial intelligence-guided cognitive fusion-targeted biopsy versus routine cognitive fusion-targeted prostate biopsy in prostate cancer diagnosis: a randomized controlled trial

November 2024

·

11 Reads

BMC Medicine

Background Cognitive fusion MRI-guided targeted biopsy (cTB) has been widely used in the diagnosis of prostate cancer (PCa). However, cTB relies heavily on the operator’s experience and confidence in MRI readings. Our objective was to compare the cancer detection rates of MRI artificial intelligence-guided cTB (AI-cTB) and routine cTB and explore the added value of using AI for the guidance of cTB. Methods This was a prospective, single-institution randomized controlled trial (RCT) comparing clinically significant PCa (csPCa) and PCa detection rates between AI-cTB and cTB. A total of 380 eligible patients were randomized to the AI-cTB group (n = 191) or the cTB group (n = 189). The AI-cTB group underwent AI-cTB plus systematic biopsy (SB) and the cTB group underwent routine cTB plus SB. The primary outcome was the detection rate of csPCa. The reference standard was the pathological results of the combination of TB (AI-cTB/cTB) and SB. Comparisons of detection rates of csPCa and PCa between groups were performed using the chi-square test or Fisher’s exact test. Results The overall csPCa and PCa detection rates of the whole inclusion cohort were 58.8% and 61.3%, respectively. The csPCa detection rates of TB combined with SB in the AI-cTB group were significantly greater than those in the cTB group at both the patient level (58.64% vs. 46.56%, p = 0.018) and per-lesion level (61.47% vs. 47.79%, p = 0.004). Compared with cTB, the AI-cTB could detect a greater proportion of patients with csPCa at both the per-patient level (69.39% vs. 49.71%, p < 0.001) and per-lesion level (68.97% vs. 48.57%, p < 0.001). Multivariate logistic analysis indicated that compared with the cTB, the AI-cTB significantly improved the possibility of detecting csPCa (p < 0.001). Conclusions AI-cTB effectively improved the csPCa detection rate. This study successfully integrated AI with TB in the routine clinical workflow and provided a research paradigm for prospective AI-integrated clinical studies. Trial registration ClinicalTrials.gov, NCT06362291.


The schematic diagram of correlation coefficient. (A): The correlation coefficient among the NLR, PLR and NAR; (B): The correlation coefficient between the NAR and outcome. NAR, neutrophil-albumin ratio; NLR, neutrophil-lymphocyte ratio; PLR: platelet-lymphocyte ratio.
ROC curve of NAR, NLR and serum cholesterol concentration at TBI patient admission. NAR, neutrophil-albumin ratio; NLR, neutrophil-lymphocyte ratio; AUC: area under curve.
The boxplots of the NAR, NLR and PLR grouped based on survival and prognosis. (A) The mean NAR of survivors group was higher than that in non-survivors group; (B) The mean NLR of survivors group was higher than that in non-survivors group; (C) The mean PLR of survivors group was lower than that in non-survivors group; (D) The mean NAR of poor-outcome group was higher than that in good-outcome group ; (E) The mean NLR of poor-outcome group was higher than that in good-outcome group ; (F) The mean PLR of poor-outcome group was lower than that in good-outcome group. NAR, neutrophil-albumin ratio; NLR, neutrophil-lymphocyte ratio; PLR: platelet-lymphocyte ratio.
Neutrophil-albumin ratio serves as a superior prognostic biomarker for traumatic brain injury

November 2024

·

19 Reads

·

1 Citation

Traumatic brain injury (TBI) represents a common and severe medical condition necessitating prompt risk stratification to enhance patient outcomes. Although substantial research has been conducted on the prognostic utility of various biomarkers for TBI, no single biomarker has been definitively recognized as the most precise predictor of disease outcomes. In comparison to other markers, the neutrophil-albumin ratio (NAR) has emerged as a cost-effective and reproducible inflammatory biomarker, demonstrating potential in evaluating the severity of inflammation and prognosticating outcomes in infections and cerebrovascular diseases. This study evaluated the prognostic significance of the NAR in comparison to two other readily accessible and cost-effective composite indices: the Neutrophil-Lymphocyte Ratio (NLR) and the Platelet-Lymphocyte Ratio (PLR) in individuals with TBI. We conducted a retrospective cohort analysis involving 297 hospitalized TBI patients, gathering comprehensive demographic, anthropometric, medical, clinical, laboratory, and imaging data to assess the expression changes of these biomarkers. Our findings suggest that both the NAR and the NLR possess predictive value regarding prognosis following TBI. However, receiver operating characteristic (ROC) curve analysis revealed that NAR outperformed NLR as a prognostic predictor. In conclusion, our examination of blood biochemistry composite indicators indicates that, while both NAR and NLR serve as significant prognostic markers, NAR is a more effective predictor of outcomes in patients with TBI.



Drowzee: Metamorphic Testing for Fact-Conflicting Hallucination Detection in Large Language Models

October 2024

·

3 Reads

·

6 Citations

Proceedings of the ACM on Programming Languages

Large language models (LLMs) have revolutionized language processing, but face critical challenges with security, privacy, and generating hallucinations — coherent but factually inaccurate outputs. A major issue is fact-conflicting hallucination (FCH), where LLMs produce content contradicting ground truth facts. Addressing FCH is difficult due to two key challenges: 1) Automatically constructing and updating benchmark datasets is hard, as existing methods rely on manually curated static benchmarks that cannot cover the broad, evolving spectrum of FCH cases. 2) Validating the reasoning behind LLM outputs is inherently difficult, especially for complex logical relations. To tackle these challenges, we introduce a novel logic-programming-aided metamorphic testing technique for FCH detection. We develop an extensive and extensible framework that constructs a comprehensive factual knowledge base by crawling sources like Wikipedia, seamlessly integrated into Drowzee. Using logical reasoning rules, we transform and augment this knowledge into a large set of test cases with ground truth answers. We test LLMs on these cases through template-based prompts, requiring them to provide reasoned answers. To validate their reasoning, we propose two semantic-aware oracles that assess the similarity between the semantic structures of the LLM answers and ground truth. Our approach automatically generates useful test cases and identifies hallucinations across six LLMs within nine domains, with hallucination rates ranging from 24.7% to 59.8%. Key findings include LLMs struggling with temporal concepts, out-of-distribution knowledge, and lack of logical reasoning capabilities. The results show that logic-based test cases generated by Drowzee effectively trigger and detect hallucinations. To further mitigate the identified FCHs, we explored model editing techniques, which proved effective on a small scale (with edits to fewer than 1000 knowledge pieces). Our findings emphasize the need for continued community efforts to detect and mitigate model hallucinations.



Integrative Analysis of Transcriptome and Metabolome Reveals the Pivotal Role of the NAM Family Genes in Oncidium hybridum Lodd. Pseudobulb Growth

September 2024

·

7 Reads

International Journal of Molecular Sciences

Oncidium hybridum Lodd. is an important ornamental flower that is used as both a cut flower and a potted plant around the world; additionally, its pseudobulbs serve as essential carriers for floral organs and flower development. The NAM gene family is crucial for managing responses to various stresses as well as regulating growth in plants. However, the mechanisms by which NAM genes regulate the development of pseudobulbs remain unclear. In this study, a total of 144 NAM genes harboring complete structural domains were identified in O. hybridum. The 144 NAM genes were systematically classified into 14 distinct subfamilies via phylogenetic analysis. Delving deeper into the conserved motifs revealed that motifs 1–6 exhibited remarkable conservation, while motifs 7–10 presented in a few NAM genes only. Notably, NAM genes sharing identical specific motifs were classified into the same subfamily, indicating functional relatedness. Furthermore, the examination of occurrences of gene duplication indicated that the NAM genes display 16 pairs of tandem duplications along with five pairs of segmental duplications, suggesting their role in genetic diversity and potential adaptive evolution. By conducting a correlation analysis integrating transcriptomics and metabolomics at four stages of pseudobulb development, we found that OhNAM023, OhNAM030, OhNAM007, OhNAM019, OhNAM083, OhNAM047, OhNAM089, and OhNAM025 exhibited significant relationships with the endogenous plant hormones jasmonates (JAs), hinting at their potential involvement in hormonal signaling. Additionally, OhNAM089, OhNAM025, OhNAM119, OhNAM055, and OhNAM136 showed strong links with abscisic acid (ABA) and abscisic acid glucose ester (ABA-GE), suggesting the possible regulatory function of these NAM genes in plant growth and stress responses. The 144 NAM genes identified in this study provide a basis for subsequent research and contribute to elucidating the intricate molecular mechanisms of NAM genes in Oncidium and potentially in other species.


Figure 1. Color changes of leaves at different heat stress times. The horizontal axis represents different heat stress times from 0 to 11 days, and the vertical axis represents the color parameters of L value, a value, b value, c value, and h value, respectively. The L value represents the brightness of the sample, with higher values indicating a brighter color. The a value represents the red-green color degree of the sample, with positive values indicating a red bias and negative values indicating a green bias. The b value represents the yellow-blue color degree of the sample, with positive values indicating a yellow bias and negative values indicating a blue bias. The c value represents the chroma of the sample, with higher values indicating a more saturated color. The h value represents the color phase of the sample, with numerical values indicating color angles.
Figure 2. Results of gene expression patterns and KEGG enrichment analyses: (a) KEGG enrichment analysis plot for H0 vs. H1 comparison group. According to the KEGG enrichment results, the degree of enrichment is measured by the Rich factor, FDR value, and the number of genes enriched on this pathway. Among them, the Rich factor refers to the ratio of the number of enriched differentially expressed genes in the pathway to the number of annotated differentially expressed genes. The larger the Rich factor, the greater the degree of enrichment becomes. The general range of FDR values is 0-1, and the closer it is to zero, the more significant the enrichment; (b) Venn diagram of differentially expressed genes between H0 vs. H1 and H1 vs. H2 groups; (c) Bar graph of differential transcription factors. The left graph represents the H0 vs. H1 group and the right graph represents the H1 vs. H2 group. The horizontal axis represents different transcription factor families, and the vertical axis represents the number of genes belonging to each transcription factor family.
Figure 3. Quality control of metabolomics data and content of metabolites: (a) Heatmap of the dem cluster analysis results. In the matrix, columns represent samples and rows represent metabolites. The clustering tree on the left displays the clustering of different metabolites, while the clustering tree at the top represents the clustering of samples. The gradient colors indicate the magnitude of the quantitative values; the deeper the red, the higher the expression level, while the deeper the blue, the lower the expression level. Metabolite names are not displayed when the number of metabolites exceeds 150; (b) DEM principal component analysis; (c) DEM analysis based on PLS-DA score. The x-axis (PC1) represents the scores of the first principal component, and the y-axis (PC2) represents the scores of the second principal component. Each point symbolizes a sample, the shaded area denotes the 95% confidence interval, and the colors indicate different groups.
Figure 4. Common metabolites: (a) Venn diagram of high-temperature metabolites in H0 vs. H1 and H1 vs. H2 groups; (b) changes in the content of common metabolites of H0 vs. H1 and H1 vs. H2 comparison groups over time under heat stress. The abscissa represents the change time of metabolite content from 0 to 2d, and the ordinate represents the relative content of metabolites. "*" indicates statistical significance, p ≤ 0.05. "**" indicates stronger statistical significance, p ≤ 0.01. While "ns" denotes non-significance, p > 0.05.
H0 vs. H1, changes in the common genes of the common metabolite Trehalose and Sinapoyl aldehyde under the H1 vs. H2 comparison group.
Combined Analysis of Transcriptome and Metabolome Provides Insights in Response Mechanism under Heat Stress in Avocado (Persea americana Mill.)

September 2024

·

16 Reads

International Journal of Molecular Sciences

Plants generate a range of physiological and molecular responses to sustain their growth and development when suffering heat stress. Avocado is a type of tropical fruit tree with high economic value. Most avocado cultivars delete, wither, or even die when exposed to heat stress for a long time, which seriously restricts the introduction and cultivation of avocados. In this study, samples of a heat-intolerant variety (‘Hass’) were treated under heat stress, and the transcriptomics and metabolomics were analyzed, with the expectation of providing information on the variety improvement and domestication of avocados. The differentially expressed genes identified using transcriptome analysis mainly involved metabolic pathways such as plant hormone signal transduction, plant–pathogen interaction, and protein processing in the endoplasmic reticulum. Combined transcriptome and metabolome analysis indicated that the down-regulation of Hass.g03.10206 and Hass.g03.10205 in heat shock-like proteins may result in the reduced Trehalose and Sinapoyl aldehyde content. Metabolomics analysis results indicated that the decrease in Trehalose and Sinapoyl aldehyde content may be an important factor for heat intolerance. These results provide important clues for understanding the physiological mechanisms of adaptation to heat stress in avocados.



Citations (39)


... Serum GFAP and UCH-L1 have also been identified as biomarkers for brain injury [19]. Recently, the neutrophil-albumin ratio has been suggested as a superior prognostic biomarker for traumatic brain injury [20]. Unfortunately, S100B, serum GFAP or UCH-L1, and albumin are not routinely measured in poisoned patients in our ED. ...

Reference:

Munich cCT Rule for Patients with Recreational Drug and Ethanol Poisoning
Neutrophil-albumin ratio serves as a superior prognostic biomarker for traumatic brain injury

... The alternative is Retrieval Augmented Generation (RAG) [6], which is better suited for tasks requiring evolving knowledge, such as integration of the latest industry news. However, both methods do not fully circumvent the inherent limitation of LLMs -hallucination, which manifests in inconsistent or fabricated claims [7] that can be subtle and phrased confidently even if factually incorrect [8]. For highly sensitive working environments such as financial institutions, the inability to ensure faithful LLM outputs can be one of the biggest limitations to widespread adoption of LLM applications [9]. ...

Drowzee: Metamorphic Testing for Fact-Conflicting Hallucination Detection in Large Language Models
  • Citing Article
  • October 2024

Proceedings of the ACM on Programming Languages

... Regarding the security risks, allowing users to fine-tune the LLMs with customized datasets will introduce additional security risks and challenges, e.g., safety alignment breaking, backdoor attack, and hallucination [46,54,58], even though the malicious adversaries are limited in their ability to finetune details. Firstly, safety alignment can be vulnerable when facing malicious fine-tuning [54,57]. ...

A Comprehensive Study of Jailbreak Attack versus Defense for Large Language Models
  • Citing Conference Paper
  • January 2024

... This attack method segments the jailbreak prompt into subprompts following semantic rules, and conceals them in benign contextual tasks, which can elicit the target LLM to follow the instructions and examples to recover the concealed harmful prompt and generate the corresponding responses. Besides, Chang et al. [12] develop Puzzler, which provides clues about the jailbreak objective by first querying LLMs about their defensive strategies, and then acquiring the offensive methods from LLMs. After that, Puzzler encourages LLMs to infer the true intent concealed within the fragmented information and generate malicious responses. ...

Play Guessing Game with LLM: Indirect Jailbreak Attack with Implicit Clues
  • Citing Conference Paper
  • January 2024

... Filter können etwa bestimmte Schlüsselwörter oder Themengebiete erkennen, die gegen Richtlinien verstoßen, und die Generierung entsprechender Inhalte blockieren oder modifizieren. Dabei ist inzwischen ein Wettstreit zwischen Anbietern und Angreifern entstanden [17], da Angreifer die Filter durch gezielte Formulierungen zu umgehen versuchen. So könnte bspw. ...

A Hitchhiker’s Guide to Jailbreaking ChatGPT via Prompt Engineering
  • Citing Conference Paper
  • July 2024

... To answer this, consider the following two hypothetical settings where this question might be asked: (1) Internal Audit: At the new start-up Chasm Intellect, the LM alignment team is looking for a way to trigger an alarm when LM behavior changes. They know that LM model behavior can change unexpectedly (Li et al., 2024b). For example, fine-tuning a model that has undergone safety evaluation (e.g., GPT-3.5 Turbo (OpenAI, 2023)) can cause it to become less safe, even when using a benign fine-tuning dataset (e.g., Alpaca (Taori et al., 2023); (Qi et al., 2023)). ...

Glitch Tokens in Large Language Models: Categorization Taxonomy and Effective Detection
  • Citing Article
  • July 2024

... There is a positive correlation between organic carbon and microbial biomass carbon (Kamala Haripal et al.). Liao et al (2024) found that MBC increases with organic inputs, especially in soils under agricultural use. Sahoo et al (2019) confirmed that organic amendments improve microbial biomass, enhancing long-term soil productivity and resilience. ...

Contribution and control of microbial necromass carbon in wetland soils
  • Citing Article
  • July 2024

Aquatic Sciences

... Furthermore, healthy D. citri prefer feeding on CLas-infected citrus plants, resulting in the rapid spread of HLB. [5][6][7][8][9][10] Therefore, controlling the occurrence of D. citri and reducing its ability to acquire CLas are currently the main strategies for preventing and controlling the spread of HLB. 11 Chemical 12 and microbial [13][14][15] pesticides are the main tools used to control D. citri. ...

Candidatus Liberibacter asiaticus influences the emergence of the Asian citrus psyllid Diaphorina citri by regulating key cuticular proteins
  • Citing Article
  • June 2024

Insect Science

... The tokens transformed from the training corpus form the vocabulary dictionary of LLMs, and the vocabulary dictionary in turn determines the capacity of LLMs to produce diverse and comprehensive output. The rapid advancement of LLMs has brought attention to various anomalous phenomena [10,11,18,19,21,23,39], one of which is the existence of "glitch tokens". These tokens exhibit anomalies in constructing the expected semantics, and are subsequently reflected in the abnormal and unexpected decoding in the LLM's output. ...

MeTMaP: Metamorphic Testing for Detecting False Vector Matching Problems in LLM Augmented Generation
  • Citing Conference Paper
  • June 2024

... In recent years, the Bird's Eye View (BEV) representation has shown significant potential in the field of multi-modal fusion [5]. By integrating various sensors under the BEV representation, a unified 3D perception space is created, allowing for seamless integration with downstream tasks such as(planning [6] [7], map building [8] [9] and semantic understanding [10]). ...

Perception and Planning of Intelligent Vehicles Based on BEV in Extreme Off-Road Scenarios
  • Citing Article
  • April 2024

IEEE Transactions on Intelligent Vehicles