Yao Xuan’s research while affiliated with University of California, Santa Barbara and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (11)


Figure 2: Applying location encoders to differentiate two visually similar species ((a)-(j)) or two visually similar land use types ((k)-(t)). Arctic fox and bat-eared fox might look very similar visually as shown in (a) and (f). However, they have different spatial distributions. (b) and (g) show their distinct patterns in species image locations. (c)-(e): The predicted distributions of Arctic fox from different location encoders (without images as input). (h)-(j): The predicted distributions of bat-eared fox. Similarly, it might be hard to differentiate factories/powerplants from multi-unit residential buildings only based on their overhead satellite imgeries as shown in (k) and (p). However, as shown in (l) and (q), they have very different global spatial distributions. (m)-(o) and (r)-(t) show the predicted spatial distributions of factories/powerplants and multi-unit residential buildings from different location encoders. We can see that while í µí±¤í µí±Ÿí µí±Ží µí± (Mac Aodha et al., 2019) produces a over-generalized spatial distribution, í µí± í µí±ℎí µí±’í µí±Ÿí µí±’í µí° ¶+ and
Figure 3: An illustration for map projection distortion: (a)-(d): Tissot indicatrices for four projections. The equal area circles are putted in different locations to show how the map distortion affect its shape.
Figure 5: The data distributions of four synthetic datasets (U1, U2, U3, and U4) generated from the uniform sampling method. (e) shows the U4 dataset in a 3D Euclidean space. We can see that if we treat these datasets as 2D data points as í µí±”í µí±Ÿí µí±–í µí±‘ and
Table 5
Figure 6: The data distributions of four synthetic datasets (S1.3, S2.3, S3.3, and S4.3) generated from the stratified sampling method with í µí¼ í µí±ší µí±Ží µí±¥ = 64. We can see that when í µí± í µí¼‡ increases, a more fine-grain stratified sampling is carried out. The resulting dataset has a larger data bias toward the polar areas.

+7

Sphere2Vec: A General-Purpose Location Representation Learning over a Spherical Surface for Large-Scale Geospatial Predictions
  • Preprint
  • File available

June 2023

·

235 Reads

·

Yao Xuan

·

·

[...]

·

Generating learning-friendly representations for points in space is a fundamental and long-standing problem in ML. Recently, multi-scale encoding schemes (such as Space2Vec and NeRF) were proposed to directly encode any point in 2D/3D Euclidean space as a high-dimensional vector, and has been successfully applied to various geospatial prediction and generative tasks. However, all current 2D and 3D location encoders are designed to model point distances in Euclidean space. So when applied to large-scale real-world GPS coordinate datasets, which require distance metric learning on the spherical surface, both types of models can fail due to the map projection distortion problem (2D) and the spherical-to-Euclidean distance approximation error (3D). To solve these problems, we propose a multi-scale location encoder called Sphere2Vec which can preserve spherical distances when encoding point coordinates on a spherical surface. We developed a unified view of distance-reserving encoding on spheres based on the DFS. We also provide theoretical proof that the Sphere2Vec preserves the spherical surface distance between any two points, while existing encoding schemes do not. Experiments on 20 synthetic datasets show that Sphere2Vec can outperform all baseline models on all these datasets with up to 30.8% error rate reduction. We then apply Sphere2Vec to three geo-aware image classification tasks - fine-grained species recognition, Flickr image recognition, and remote sensing image classification. Results on 7 real-world datasets show the superiority of Sphere2Vec over multiple location encoders on all three tasks. Further analysis shows that Sphere2Vec outperforms other location encoder models, especially in the polar regions and data-sparse areas because of its nature for spherical surface distance preservation. Code and data are available at https://gengchenmai.github.io/sphere2vec-website/.

Download

Sphere2Vec: A General-Purpose Location Representation Learning over a Spherical Surface for Large-Scale Geospatial Predictions

June 2023

·

693 Reads

·

44 Citations

ISPRS Journal of Photogrammetry and Remote Sensing

We propose a general-purpose spherical location encoder, Sphere2Vec, which, as far as we know, is the first location encoder which aims at preserving spherical distance. • We provide a theoretical proof about the spherical-distance-kept nature of Sphere2Vec. • We provide theoretical proof to show why the previous 2D location encoders and NeRF-style 3D location encoders cannot model spherical distance correctly. • We first construct 20 synthetic datasets based on the mixture of von Mises-Fisher (MvMF) distributions and show that Sphere2Vec can outperform all baseline models including the state-of-the-art (SOTA) 2D location encoders and NeRF-style 3D location encoders on all these datasets with an up to 30.8% error rate reduction. • Next, we conduct extensive experiments on seven real-world datasets for three geo-aware image classification tasks. Results show that Sphere2Vec outperforms all baseline models on all datasets. • Further analysis shows that Sphere2Vec is able to produce finer-grained and compact spatial distributions, and does significantly better than 2D and 3D Euclidean location encoders in the polar regions and areas with sparse training samples. A B S T R A C T Generating learning-friendly representations for points in space is a fundamental and long-standing problem in machine learning. Recently, multi-scale encoding schemes (such as Space2Vec and NeRF) were proposed to directly encode any point in 2D or 3D Euclidean space as a high-dimensional vector, and has been successfully applied to various (geo)spatial prediction and generative tasks. However, all current 2D and 3D location encoders are designed to model point distances in Euclidean space. So when applied to large-scale real-world GPS coordinate datasets (e.g., species or satellite images taken all over the world), which require distance metric learning on the spherical surface, both types of models can fail due to the map projection distortion problem (2D) and the spherical-to-Euclidean distance approximation error (3D). To solve these problems, we propose a multi-scale location en-coder called Sphere2Vec which can preserve spherical distances when encoding point coordinates on a spherical surface. We developed a unified view of distance-reserving encoding on spheres based on the Double Fourier Sphere (DFS). We also provide theoretical proof that the Sphere2Vec encoding preserves the spherical surface distance between any two points, while existing encoding schemes such as Space2Vec and NeRF do not. Experiments on 20 synthetic datasets show that Sphere2Vec can outperform all baseline models including the state-of-the-art (SOTA) 2D location encoder (i.e., Space2Vec) and 3D encoder NeRF on all these datasets with up to 30.8% error rate reduction. We then apply Sphere2Vec to three geo-aware image classification tasks-fine-grained species recognition, Flickr image recognition, and remote sensing image classification. Results on 7 real-world datasets show the superiority of Sphere2Vec over multiple 2D and 3D location encoders on all three tasks. Further analysis shows that Sphere2Vec outperforms other location encoder models, especially in the polar regions and data-sparse areas because of its nature for spherical surface distance preservation. Code and data of this work are available at https://gengchenmai.github.io/sphere2vec-website/.


Machine learning and polymer self-consistent field theory in two spatial dimensions

April 2023

·

33 Reads

·

6 Citations

A computational framework that leverages data from self-consistent field theory simulations with deep learning to accelerate the exploration of parameter space for block copolymers is presented. This is a substantial two-dimensional extension of the framework introduced in the work of Xuan et al. [J. Comput. Phys. 443, 110519 (2021)]. Several innovations and improvements are proposed. (1) A Sobolev space-trained, convolutional neural network is employed to handle the exponential dimension increase of the discretized, local average monomer density fields and to strongly enforce both spatial translation and rotation invariance of the predicted, field-theoretic intensive Hamiltonian. (2) A generative adversarial network (GAN) is introduced to efficiently and accurately predict saddle point, local average monomer density fields without resorting to gradient descent methods that employ the training set. This GAN approach yields important savings of both memory and computational cost. (3) The proposed machine learning framework is successfully applied to 2D cell size optimization as a clear illustration of its broad potential to accelerate the exploration of parameter space for discovering polymer nanostructures. Extensions to three-dimensional phase discovery appear to be feasible.


Machine Learning and Polymer Self-Consistent Field Theory in Two Spatial Dimensions

December 2022

·

1,041 Reads

A computational framework that leverages data from self-consistent field theory simulations with deep learning to accelerate the exploration of parameter space for block copolymers is presented. This is a substantial two-dimensional extension of the framework introduced in [1]. Several innovations and improvements are proposed. (1) A Sobolev space-trained, convolutional neural network (CNN) is employed to handle the exponential dimension increase of the discretized, local average monomer density fields and to strongly enforce both spatial translation and rotation invariance of the predicted, field-theoretic intensive Hamiltonian. (2) A generative adversarial network (GAN) is introduced to efficiently and accurately predict saddle point, local average monomer density fields without resorting to gradient descent methods that employ the training set. This GAN approach yields important savings of both memory and computational cost. (3) The proposed machine learning framework is successfully applied to 2D cell size optimization as a clear illustration of its broad potential to accelerate the exploration of parameter space for discovering polymer nanostructures. Extensions to three-dimensional phase discovery appear to be feasible.



Towards general-purpose representation learning of polygonal geometries

October 2022

·

451 Reads

·

42 Citations

GeoInformatica

Neural network representation learning for spatial data (e.g., points, polylines, polygons, and networks) is a common need for geographic artificial intelligence (GeoAI) problems. In recent years, many advancements have been made in representation learning for points, polylines, and networks, whereas little progress has been made for polygons, especially complex polygonal geometries. In this work, we focus on developing a general-purpose polygon encoding model, which can encode a polygonal geometry (with or without holes, single or multipolygons) into an embedding space. The result embeddings can be leveraged directly (or finetuned) for downstream tasks such as shape classification, spatial relation prediction, building pattern classification, cartographic building generalization, and so on. To achieve model generalizability guarantees, we identify a few desirable properties that the encoder should satisfy: loop origin invariance, trivial vertex invariance, part permutation invariance, and topology awareness. We explore two different designs for the encoder: one derives all representations in the spatial domain and can naturally capture local structures of polygons; the other leverages spectral domain representations and can easily capture global structures of polygons. For the spatial domain approach we propose ResNet1D, a 1D CNN-based polygon encoder, which uses circular padding to achieve loop origin invariance on simple polygons. For the spectral domain approach we develop NUFTspec based on Non-Uniform Fourier Transformation (NUFT), which naturally satisfies all the desired properties. We conduct experiments on two different tasks: 1) polygon shape classification based on the commonly used MNIST dataset; 2) polygon-based spatial relation prediction based on two new datasets (DBSR-46K and DBSR-cplx46K) constructed from OpenStreetMap and DBpedia. Our results show that NUFTspec and ResNet1D outperform multiple existing baselines with significant margins. While ResNet1D suffers from model performance degradation after shape-invariance geometry modifications, NUFTspec is very robust to these modifications due to the nature of the NUFT representation. NUFTspec is able to jointly consider all parts of a multipolygon and their spatial relations during prediction while ResNet1D can recognize the shape details which are sometimes important for classification. This result points to a promising research direction of combining spatial and spectral representations.


Towards General-Purpose Representation Learning of Polygonal Geometries

September 2022

·

245 Reads

Neural network representation learning for spatial data is a common need for geographic artificial intelligence (GeoAI) problems. In recent years, many advancements have been made in representation learning for points, polylines, and networks, whereas little progress has been made for polygons, especially complex polygonal geometries. In this work, we focus on developing a general-purpose polygon encoding model, which can encode a polygonal geometry (with or without holes, single or multipolygons) into an embedding space. The result embeddings can be leveraged directly (or finetuned) for downstream tasks such as shape classification, spatial relation prediction, and so on. To achieve model generalizability guarantees, we identify a few desirable properties: loop origin invariance, trivial vertex invariance, part permutation invariance, and topology awareness. We explore two different designs for the encoder: one derives all representations in the spatial domain; the other leverages spectral domain representations. For the spatial domain approach, we propose ResNet1D, a 1D CNN-based polygon encoder, which uses circular padding to achieve loop origin invariance on simple polygons. For the spectral domain approach, we develop NUFTspec based on Non-Uniform Fourier Transformation (NUFT), which naturally satisfies all the desired properties. We conduct experiments on two tasks: 1) shape classification based on MNIST; 2) spatial relation prediction based on two new datasets - DBSR-46K and DBSR-cplx46K. Our results show that NUFTspec and ResNet1D outperform multiple existing baselines with significant margins. While ResNet1D suffers from model performance degradation after shape-invariance geometry modifications, NUFTspec is very robust to these modifications due to the nature of the NUFT.


Figure 2: Schematic plot of fictitious play: each player derives optimal policies at stage m + 1 assuming other players take optimal strategies at stage m.
Figure 3: Illustration of one stage of enhanced deep fictitious play. At the (m + 1) th stage, one needs to solve the PDEs (8), which is approximated by solving the BSDEs (9)-(10). Then with the help of neural networks, one solves the variational problem (VP) given by Equation (11) to get the optimal strategy.
Figure 4: Plots of optimal policies (top-left), Susceptibles (top-right), Exposed (bottom-left) and Infectious (bottom-right) for three states: New York (blue), New Jersey (orange) and Pennsylvania (green). The shaded areas depict the mean and 95% confidence interval over 256 sample paths. Choices of parameters are a = 100 and θ = 0.99.
Pandemic Control, Game Theory and Machine Learning

August 2022

·

143 Reads

Game theory has been an effective tool in the control of disease spread and in suggesting optimal policies at both individual and area levels. In this AMS Notices article, we focus on the decision-making development for the intervention of COVID-19, aiming to provide mathematical models and efficient machine learning methods, and justifications for related policies that have been implemented in the past and explain how the authorities' decisions affect their neighboring regions from a game theory viewpoint.


Figure 9: Embedding clusterings of iNat2018 models. (a) wrap˚withwrap˚with 4 hidden ReLU layers of 256 neurons; (d) rbf with the best kernel size σ " 1 and number of anchor points m " 200; (b)(c)(e)(f) are Space2V ec models [25] with different min scale rmin " t10´6t10´6 , 10´310´3 u. a (g)-(l) are Sphere2V ec models with different min scale. b
Sphere2Vec: Multi-Scale Representation Learning over a Spherical Surface for Geospatial Predictions

January 2022

·

147 Reads

Generating learning-friendly representations for points in a 2D space is a fundamental and long-standing problem in machine learning. Recently, multi-scale encoding schemes (such as Space2Vec) were proposed to directly encode any point in 2D space as a high-dimensional vector, and has been successfully applied to various (geo)spatial prediction tasks. However, a map projection distortion problem rises when applying location encoding models to large-scale real-world GPS coordinate datasets (e.g., species images taken all over the world) - all current location encoding models are designed for encoding points in a 2D (Euclidean) space but not on a spherical surface, e.g., earth surface. To solve this problem, we propose a multi-scale location encoding model called Sphere2V ec which directly encodes point coordinates on a spherical surface while avoiding the mapprojection distortion problem. We provide theoretical proof that the Sphere2Vec encoding preserves the spherical surface distance between any two points. We also developed a unified view of distance-reserving encoding on spheres based on the Double Fourier Sphere (DFS). We apply Sphere2V ec to the geo-aware image classification task. Our analysis shows that Sphere2V ec outperforms other 2D space location encoder models especially on the polar regions and data-sparse areas for image classification tasks because of its nature for spherical surface distance preservation.


Deep learning and self-consistent field theory: A path towards accelerating polymer phase discovery

June 2021

·

122 Reads

·

22 Citations

Journal of Computational Physics

A new framework that leverages data obtained from self-consistent field theory (SCFT) simulations with deep learning to accelerate the exploration of parameter space for block copolymers is presented. Deep neural networks are adapted and trained in Sobolev space to better capture the saddle point nature of the SCFT approximation. The proposed approach consists of two main problems: 1) the learning of an approximation to the effective Hamiltonian as a function of the average monomer density fields and the relevant physical parameters and 2) the prediction of saddle density fields given the polymer parameters. There is an additional challenge: the effective Hamiltonian has to be invariant under shifts (and rotations in 2D and 3D). A data-enhancing approach and an appropriate regularization are introduced to effectively achieve said invariance. In this first study, the focus is on one-dimensional (in physical space) systems to allow for a thorough exploration and development of the proposed methodology.


Citations (5)


... • Geo-tagged Imagery refers to images linked with metadata from platforms like social media (e.g., YouTube, Facebook), open mapping services (e.g., Google Maps), which contains geographic coordinates, timestamps, identifiers, etc., [65,66,107,109]. • Descriptive Texts sourced from online encyclopedias (e.g., websites [94], Wikipedia [84]), and generative models [30,103], provide contextual details about locations and entities [78]. ...

Reference:

Unlocking Location Intelligence: A Survey from Deep Learning to The LLM Era
Sphere2Vec: A General-Purpose Location Representation Learning over a Spherical Surface for Large-Scale Geospatial Predictions

ISPRS Journal of Photogrammetry and Remote Sensing

... Progress has also been made in introducing deep learning methods for the SCFT simulation of BCPs. Xuan et al. 40,41 trained a surrogate model for the Hamiltonian in Sobolev space, which predicts the Hamiltonian using the average monomer density and related physical parameters. They subsequently utilized this surrogate model to identify the density field that minimizes the Hamiltonian given physical parameters. ...

Machine learning and polymer self-consistent field theory in two spatial dimensions
  • Citing Article
  • April 2023

... A Nash equilibrium computed from such a game will definitely provide some qualitative guidance and insights for policymakers on the impact of certain policies. However, even with only three states (New York, New Jersey, and Pennsylvania) and a simple stochastic SEIR model as in [239,240], this problem's state space is already twelve dimensions. Figure 1 below showcases the equilibrium lockdown policy corresponding to the multi-region SEIR model solved by a deep learning algorithm proposed in [122] (see Section 3.1.2) ...

Pandemic Control, Game Theory, and Machine Learning
  • Citing Article
  • December 2022

Notices of the American Mathematical Society

... Learning spatial representations and encoding geographic locations is an active area of work within GeoAI, with many recent approaches and even libraries to standardize them [21,37]. Broadly, these works have focused on the following aspects: encoding a point or a spatial object through analytical functions [22,23], using aerial and street-level imagery at a location for learning imagebased representations [5,16], and encoding various task-relevant properties at given geographic coordinates [3,27,36]. Many of these approaches have proved effective on a range of geospatial classification and regression tasks. ...

Towards general-purpose representation learning of polygonal geometries

GeoInformatica

... Progress has also been made in introducing deep learning methods for the SCFT simulation of BCPs. Xuan et al. 40,41 trained a surrogate model for the Hamiltonian in Sobolev space, which predicts the Hamiltonian using the average monomer density and related physical parameters. They subsequently utilized this surrogate model to identify the density field that minimizes the Hamiltonian given physical parameters. ...

Deep learning and self-consistent field theory: A path towards accelerating polymer phase discovery
  • Citing Article
  • June 2021

Journal of Computational Physics