Xuan Gong’s research while affiliated with Zhejiang University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (2)


Figure 1: Trends in GPT model parameter size and growth multiplier. Data Source: https://huggingface. co/docs/transformers/en/index, designed by authors.
Figure 3: Composite risks and governance gaps for AI training data utilized by pre-trained and posttrained models. Source: Designed by authors.
Figure 4: Lifecycle-based AI training data governance mechanism for blended models. Source: Designed by authors.
Big Data Versus Big GPU: Evolving Requirements and Governance Dynamics of AI Training Data
  • Article
  • Full-text available

April 2025

·

3 Reads

International Journal of Digital Law and Governance

Le Cheng

·

Xuan Gong

·

Yun Zhao

Pre-trained large language models (LLMs), epitomized by ChatGPT, have leveraged a cornucopia of “big data” to attain substantial leaps in artificial intelligence (AI). Whereas the diminishing returns from pre-training and the depletion of available training data have become evident, the post-training scaling law bolstered by “big GPU” has surfaced as an overriding strategy. Since 2024, post-trained models exemplified by o1 and DeepSeek-R1 have been widely acclaimed as successes in logic-intensive fields like advanced scientific problem-solving, serving as a bellwether for artificial general intelligence (AGI). Driven by the two cardinal elements of computing power and task-specific datasets, the data training processes of post-trained models exhibit more erratic and uncontrollable tendencies, which may be a menace to core societal domains and precipitate systemic friction vis-à-vis the existing data governance derived from pre-trained models. At this watershed moment, this article aims to conduct a comprehensive comparison of training data paradigms between pre-trained and post-trained models and to further develop cogent and favorable governance responses to mitigate emerging risks. Consequently, data security must be established as a prerequisite for AI development, and a lifecycle-based governance framework for AI training data in blended models can be introduced in the metamorphosis toward “bigger AI models”.

Download

An illustrative schema of the interrelation of ANI, AGI, and ASI.
An illustrative diagram of the upsides and downsides of ANI, AGI, and ASI.
An illustrative figure of the regulatory approaches of technical, legal, and ethical levels during the whole operational process of AGI.
Appraising Regulatory Framework Towards Artificial General Intelligence (AGI) Under Digital Humanism

December 2024

·

76 Reads

·

2 Citations

International Journal of Digital Law and Governance

The explosive advancement of contemporary artificial intelligence (AI) technologies, typified by ChatGPT, is steering humans towards an uncontrollable trajectory to artificial general intelligence (AGI). Against the backdrop of a series of transformative breakthroughs, big tech companies such as OpenAI and Google have initiated an “AGI race” on a supranational level. As technological power becomes increasingly absolute, structural challenges may erupt with an unprecedented velocity, potentially resulting in disorderly expansion and even malignant development of AI technologies. To preserve the dignity and safety of human-beings in a brand-new AGI epoch, it is imperative to implement regulatory guidelines to limit the applications of AGI within the confines of human ethics and rules to further counteract the potential downsides. To promote the benevolent evolution of AGI, the principles of Humanism should be underscored and the connotation of Digital Humanism should be further enriched. Correspondingly, the current regulatory paradigm for generative AI may also be overhauled under the tenet of Digital Humanism to adapt to the quantum leaps and subversive shifts produced by AGI in the future. Positioned at the nexus of legal studies, computer science, and moral philosophy, this study therefore charts a course for a synthetic regulation framework of AGI under Digital Humanism.

Citations (1)


... Shortly after DeepSeek's launch, its online service was subjected to large-scale attacks, with its cloud servers experiencing one major API disruption in December 2024, two in January 2025, and daily disruptions lasting five to 6 h from January 27 to February 5, 2025. 3 Data security and privacy protection are critical metrics in the competitive computing market (Cheng and Gong 2024). They not only determine whether Chinese AI enterprises can successfully expand internationally but also serve as common leverage used by Western countries. ...

Reference:

Opportunities, Challenges, and Regulatory Responses to China’s AI Computing Power Development under DeepSeek’s Changing Landscape
Appraising Regulatory Framework Towards Artificial General Intelligence (AGI) Under Digital Humanism

International Journal of Digital Law and Governance