About
Activity
-
Happy to share that I recently joined Meta to work on GenAI models for Wearable devices (smart glasses). Excited to work with the team and deliver a…
Happy to share that I recently joined Meta to work on GenAI models for Wearable devices (smart glasses). Excited to work with the team and deliver a…
Liked by Fei Xia
-
Exciting Updates on My New Journey! 🚀 It’s hard to believe it’s been almost four months since my last update—time truly flies! Many friends have…
Exciting Updates on My New Journey! 🚀 It’s hard to believe it’s been almost four months since my last update—time truly flies! Many friends have…
Liked by Fei Xia
Experience & Education
Licenses & Certifications
-
-
Successful Negotiation: Essential Strategies and Skills
Coursera Verified Certificates
-
Publications
-
Block-coordinate Frank-Wolfe Optimization for Counting Objects in Images
Neural Information Processing Systems (NeurIPS) Optimization Workshop
We develop an optimization method to count objects in images. To avoid the detection of individual objects, which is computationally expensive and relies heavily on image quality, we model the object density as a linear transformation of each pixel feature, and obtain the object count by integrating density over the image. Learning such linear transformation is formulated as the minimization of a regularized quadratic function. Solving this optimization problem is highly nontrivial because it…
We develop an optimization method to count objects in images. To avoid the detection of individual objects, which is computationally expensive and relies heavily on image quality, we model the object density as a linear transformation of each pixel feature, and obtain the object count by integrating density over the image. Learning such linear transformation is formulated as the minimization of a regularized quadratic function. Solving this optimization problem is highly nontrivial because it has exponentially large number of constraints. To cope with this challenge, inspired by the structural support vector machine (SVM), we explore the Block-Coordinate Frank-Wolfe (BCFW) algorithm, which is a state-of-the-art algorithm to solve structural SVM. However, BCFW cannot be directly applied to our problem. We derive the dual of our optimization problem and solve it by BCFW with modifications. Experiments show that BCFW solves our problem with lower iteration cost, faster convergence, and decent error rate.
Other authorsSee publication -
Max-margin Latent Feature Relational Models for Entity-Attribute Networks
International Joint Conference on Neural Networks (IJCNN)
Link prediction is a fundamental task in statistical analysis of network data. Though much research has concentrated on predicting entity-entity relationships in homogeneous networks, it has attracted increasing attentions to predict relationships in heterogeneous networks, which consist of multiple types of nodes and relational links. Existing work on heterogeneous network link prediction mainly focuses on using input features that are explicitly extracted by humans. This paper presents…
Link prediction is a fundamental task in statistical analysis of network data. Though much research has concentrated on predicting entity-entity relationships in homogeneous networks, it has attracted increasing attentions to predict relationships in heterogeneous networks, which consist of multiple types of nodes and relational links. Existing work on heterogeneous network link prediction mainly focuses on using input features that are explicitly extracted by humans. This paper presents an
approach to automatically learn latent features from partially observed heterogeneous networks, with a particular focus on entity-attribute networks (EANs), and making predictions for
unseen pairs. To make the latent features discriminative, we adopt the max-margin idea under the framework of maximum entropy discrimination (MED). Our maximum entropy discrimination joint relational model (MED-JRM) can jointly predict entity-entity relationships as well as the missing attributes of entities in EANs. Experimental results on several real networks demonstrate
that our model has improved performance over state-of-the-art homogeneous and heterogeneous network link prediction algorithms.Other authorsSee publication -
Parameter Server for Distributed Machine Learning
Neural Information Processing Systems (NeurIPS Workshop)
We propose a parameter server framework to solve distributed machine learning problems. Both data and workload are distributed into client nodes, while server nodes maintain globally shared parameters, which are represented as sparse vectors and matrices. The framework manages asynchronous data communications between clients and servers. Flexible consistency models, elastic scalability and fault tolerance are supported by this framework. We present algorithms and theoretical analysis for…
We propose a parameter server framework to solve distributed machine learning problems. Both data and workload are distributed into client nodes, while server nodes maintain globally shared parameters, which are represented as sparse vectors and matrices. The framework manages asynchronous data communications between clients and servers. Flexible consistency models, elastic scalability and fault tolerance are supported by this framework. We present algorithms and theoretical analysis for challenging nonconvex and nonsmooth problems. To demonstrate the scalability of the proposed framework, we show experimental results on real data with billions of parameters.
Other authorsSee publication -
Generalized Relational Topic Models with Data Augmentation
International Joint Conferences on Artificial Intelligence (IJCAI)
Relational topic models have shown promise on analyzing document network structures and discovering latent topic representations. This paper presents three extensions: 1) unlike the common link likelihood with a diagonal weight matrix that allows the-same-topic interactions only, we generalize it to use a full weight matrix that captures all pairwise topic interactions and is applicable to asymmetric networks; 2) instead of doing standard Bayesian inference, we perform regularized Bayesian…
Relational topic models have shown promise on analyzing document network structures and discovering latent topic representations. This paper presents three extensions: 1) unlike the common link likelihood with a diagonal weight matrix that allows the-same-topic interactions only, we generalize it to use a full weight matrix that captures all pairwise topic interactions and is applicable to asymmetric networks; 2) instead of doing standard Bayesian inference, we perform regularized Bayesian inference with a regularization parameter to deal with the imbalanced link structure issue in common real networks; and 3) instead of doing variational approximation with strict mean-field assumptions, we present a collapsed Gibbs sampling algorithm for the generalized relational topic models without making restricting assumptions. Experimental results demonstrate the significance of these extensions on improving the prediction performance, and the time efficiency can be dramatically improved with a simple fast approximation method.
Other authorsSee publication -
Discriminative Relational Topic Models
Submitted to Pattern Analysis and Machine Intelligence (PAMI)
Many scientific and engineering fields involve analyzing network data. For document networks, relational topic models (RTMs) provide a probabilistic generative process to describe both the link structure and document contents, and they have shown promise on predicting network structures and discovering latent topic representations. However, existing RTMs have limitations in both the restricted model expressiveness and incapability of dealing with imbalanced network data. To expand the scope and…
Many scientific and engineering fields involve analyzing network data. For document networks, relational topic models (RTMs) provide a probabilistic generative process to describe both the link structure and document contents, and they have shown promise on predicting network structures and discovering latent topic representations. However, existing RTMs have limitations in both the restricted model expressiveness and incapability of dealing with imbalanced network data. To expand the scope and improve the inference accuracy of RTMs, this paper presents three extensions: 1) unlike the common link likelihood with a diagonal weight matrix that allows the-same-topic interactions only, we generalize it to use a full weight matrix that captures all pairwise topic interactions and is applicable to asymmetric networks; 2) instead of doing standard Bayesian inference, we perform regularized Bayesian inference (RegBayes) with a regularization parameter to deal with the imbalanced link structure issue in common real networks and improve the discriminative ability of learned latent representations; and 3) instead of doing variational approximation with strict mean-field assumptions, we present collapsed Gibbs sampling algorithms for the generalized relational topic models by exploring data augmentation without making restricting assumptions. Under the generic RegBayes framework, we carefully investigate two popular discriminative loss functions, namely, the logistic log-loss and the max-margin hinge loss. Experimental results on several real network datasets demonstrate the significance of these extensions on improving the prediction performance, and the time efficiency can be dramatically improved with a simple fast approximation method.
Other authorsSee publication
Projects
-
Location Recommendation Tool
- Present
1. Aim to deliver a geographic information analysis and recommendation tool for PNC Bank
2. Mine information from Foursquare and Twitter data
3. Develop a website under Django framework and with spatial database (PostgreSQL + PostGIS)Other creators -
-
Search Engine System
-
1. Implemented a search engine system with multiple retrieval algorithms, such as Boolean Retrieval model, BM25 and Indri, to retrieve Wikipedia documents indexed by LUCENE
2. Optimized the system’s performance through using various techniques, such as multiple representation model, sequential dependency model and pseudo relevance feedback
Languages
-
Chinese
Native or bilingual proficiency
-
English
Full professional proficiency
More activity by Fei
-
It took two years to have this breakthrough: announcing our paper to appear in COLING 2025 Industry Track "(ID: 107) LLM-Friendly Knowledge…
It took two years to have this breakthrough: announcing our paper to appear in COLING 2025 Industry Track "(ID: 107) LLM-Friendly Knowledge…
Liked by Fei Xia
-
Excited to share this as my last project in Waymo! It's been great pleasure to work with a group of talented researchers & engineers to explore the…
Excited to share this as my last project in Waymo! It's been great pleasure to work with a group of talented researchers & engineers to explore the…
Liked by Fei Xia
-
It was a great pleasure to meet industry friends in #recsys24 largerecsys+videorecsys and to share TikTok's effort in building trustworthy…
It was a great pleasure to meet industry friends in #recsys24 largerecsys+videorecsys and to share TikTok's effort in building trustworthy…
Liked by Fei Xia
-
Earlier this month, I joined OpenAI post-training team. Our team not only conduct scientific researches on alignment algorithms (e.g, PPO, etc.), but…
Earlier this month, I joined OpenAI post-training team. Our team not only conduct scientific researches on alignment algorithms (e.g, PPO, etc.), but…
Liked by Fei Xia
-
After almost 8 incredible years at Uber, I’m excited to share that I’m starting a new chapter with Lightspeed Venture Partners as a Senior Staff Data…
After almost 8 incredible years at Uber, I’m excited to share that I’m starting a new chapter with Lightspeed Venture Partners as a Senior Staff Data…
Liked by Fei Xia
-
My talk at the LTI Colloquium titled "From words to molecules: Harnessing Generative AI for Breakthroughs Language and Molecule Design" is online…
My talk at the LTI Colloquium titled "From words to molecules: Harnessing Generative AI for Breakthroughs Language and Molecule Design" is online…
Liked by Fei Xia
-
Last week, I concluded my journey at Waymo—an experience I am incredibly proud of. Reflecting back on January 2020 when I first joined Waymo…
Last week, I concluded my journey at Waymo—an experience I am incredibly proud of. Reflecting back on January 2020 when I first joined Waymo…
Liked by Fei Xia
-
I’m visiting University of Illinois Urbana-Champaign on 9/11. Hope to meet the talented students and professors. Drop me a line if you are around.
I’m visiting University of Illinois Urbana-Champaign on 9/11. Hope to meet the talented students and professors. Drop me a line if you are around.
Liked by Fei Xia
Other similar profiles
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content