Brief Biography
I am an Assistant Professor at the Data Science and Analytics Thrust, Information Hub, The Hong Kong University of Science and Technology (Guangzhou).
I also hold an affiliated position at the Hong Kong University of Science and Technology, the Clear Water Bay campus at Hong Kong.
I received my Ph.D. degree in Computer Science from Tsinghua University in 2023, under the supervision of Prof. Guoliang Li.
My current research interests include AI for Data Analytics (e.g., NL2SQL, TableQA, AI4VIS) and Data-centric AI.
I am actively seeking self-motivated PhD students (Spring/Fall 2025). If you are interested in working with me, please send an email with your CV and transcripts for all degrees.
Preprints or Survey
-
A Survey of NL2SQL with Large Language Models: Where are we, and where are we going?
[ NL2SQL Handbook], [ Slides/PPT (to appear)]
-
AFlow: Automating Agentic Workflow Generation
Jiayi Zhang, Jinyu Xiang, et al. Yuyu Luo*, Chenglin Wu*
-
Generative AI for visualization: State of the art and future directions
Yilin Ye, Jianing Hao, Yihan Hou, Zhan Wang, Shishi Xiao, Yuyu Luo, Wei Zeng
Visual Informatics 2024
Selected Publications
Year 2024
-
The Dawn of Natural Language to SQL: Are We Fully Ready?
Boyan Li, Yuyu Luo*, Chengliang Chai, Guoliang Li, Nan Tang
VLDB 2024. [Homepage]
-
HAIChart: Human and AI Paired Visualization System
Yupeng Xie, Yuyu Luo*, Guoliang Li, Nan Tang
VLDB 2024. [Code]
-
Are Large Language Models Good Statisticians?
Yizhang Zhu, Shiyin Du, Boyan Li, Yuyu Luo*, Nan Tang
NeurIPS 2024 [ Dataset]
-
VerifAI: Verified Generative AI
Nan Tang, Chenyu Yang, Ju Fan, Lei Cao, Yuyu Luo, Alon Halevy
CIDR 2024.
-
Data Playwright: Authoring Data Videos with Annotated Narration
Leixian Shen, Haotian Li, Yun Wang, Tianqi Luo, Yuyu Luo, Huamin Qu
TVCG 2024. [Homepage]
-
ChartInsights: Evaluating Multimodal Large Language Models for Low-Level Chart Question Answering
Yifan Wu, Lutao Yan, Leixian Shen, Yunhai Wang, Nan Tang, Yuyu Luo*
EMNLP 2024.
[ Dataset]
-
Fast, Robust and Interpretable Participant Contribution Estimation for Federated Learning
Yong Wang, Yuyu Luo, Kaiyu Li, Guoliang Li, Yunyan Guo, Zhuo Wang
ICDE 2024.
-
Mitigating Data Scarcity in Supervised Machine Learning through Reinforcement Learning Guided Data Generation
Chengliang Chai, Kaisen Jin, Nan Tang, Ju Fan, Lianpeng Qiao, Yu-Ping Wang, Yuyu Luo, Ye Yuan, Guoren Wang
ICDE 2024.
-
CoInsight: Visual Storytelling for Hierarchical Tables with Connected Insights
Guozheng Li, Runfei Li, Yunshan Feng, Yu Zhang, Yuyu Luo*, Chi Harold Liu
TVCG 2024.
Year 2023
-
Learned Data-aware Image Representations of Line Charts for Similarity Search
Yuyu Luo, Yihui Zhou, Nan Tang, Guoliang Li, Chengliang Chai, Leixian Shen
SIGMOD 2023. [Slides]
-
GoodCore: Coreset Selection over Incomplete Data for Data-effective and Data-efficient Machine Learning
Chengliang Chai, Jiabin Liu, Nan Tang, Ju Fan, Dongjing Miao, Jiayi Wang, Yuyu Luo, Guoliang Li
SIGMOD 2023. (Best of SIGMOD 2023 Papers) [Slides]
-
Demystifying Artificial Intelligence for Data Preparation
Chengliang Chai, Nan Tang, Ju Fan, Yuyu Luo
SIGMOD 2023. [Tutorial Slides: Part1, Part2, Part3]
Year 2022
-
Steerable Self-driving Data Visualization.
Yuyu Luo, Xuedi Qin, Chengliang Chai, Nan Tang, Guoliang Li, Wenbo Li.
IEEE TKDE 2022.
-
Sevi: Speech-to-Visualization through Neural Machine Translation
Jiawei Tang, Yuyu Luo*, Mourad Ouzzani, Guoliang Li, Hongyang Chen.
ACM SIGMOD 2022 (Demo Track).
-
Data Management for Machine Learning: A Survey
Chengliang Chai, Jiayi Wang, Yuyu Luo*, Zeping Niu, Guoliang Li.
IEEE TKDE 2022.
-
Towards Natural Language Interfaces for Data Visualization: A Survey
Leixian Shen, Enya Shen, Yuyu Luo, Xiaocong Yang, Xuming Hu, Xiongshuai Zhang, Zhiwei Tai, Jianmin Wang.
IEEE TVCG 2022.
-
Selective Data Acquisition in the Wild for Model Charging
Chengliang Chai, Jiabin Liu, Nan Tang, Guoliang Li, Yuyu Luo.
VLDB 2022.
-
Feature Augmentation with Reinforcement Learning
Jiabin Liu, Chengliang Chai, Yuyu Luo, Yin Lou, Jianhua Feng, Nan Tang.
ICDE 2022.
-
RW-Tree: A Learned Workload-aware Framework for R-tree Construction
Haowen Dong, Chengliang Chai, Yuyu Luo, Jiabin Liu, Jianhua Feng, Chaoqun Zhan.
ICDE 2022.
-
Interactively Discovering and Ranking Desired Tuples by Data Exploration
Xuedi Qin, Chengliang Chai, Yuyu Luo, Tianyu Zhao, Nan Tang, Guoliang Li, Jianhua Feng, Xiang Yu, Mourad Ouzzani.
The VLDB Journal 2022.
-
GALVIS: Visualization Construction through Example-Powered Declarative Programming.
Leixian Shen, Enya Shen, Zhiwei Tai, Yun Wang, Yuyu Luo, Jianmin Wang.
CIKM 2022 (Best Demo Paper Honorable Mention).
Year 2021
-
Synthesizing Natural Language to Visualization (NL2VIS) Benchmarks from NL2SQL Benchmarks
Yuyu Luo, Nan Tang, Guoliang Li, Chengliang Chai, Wenbo Li, Xuedi Qin
ACM SIGMOD 2021
[Project Page]
-
Natural Language to Visualization by Neural Machine Translation
Yuyu Luo, Nan Tang, Guoliang Li, Jiawei Tang, Chengliang Chai, Xuedi Qin
IEEE VIS 2021
[Code] [Poster]
-
nvBench: A Large-Scale Synthesized Dataset for Cross-Domain Natural Language to Visualization Task
Yuyu Luo, Jiawei Tang, Guoliang Li
Workshop on NL VIZ 2021 at IEEE VIS 2021
Year 2020
-
DeepTrack: Monitoring and Exploring Spatio-Temporal Data
– A Case of Tracking COVID-19 –
Yuyu Luo, Wenbo Li, Guoliang Li, Nan Tang
VLDB 2020.
-
VisClean: Interactive Cleaning for Progressive Visualization.
Yuyu Luo, Chengliang Chai, Xuedi Qin, Nan Tang, Guoliang Li.
VLDB 2020.
[Video Demonstration]
-
Interactive Cleaning for Progressive Visualization through Composite Questions.
Yuyu Luo, Chengliang Chai, Xuedi Qin, Nan Tang, Guoliang Li.
IEEE ICDE 2020.
[Video]
-
Human-in-the-loop Outlier Detection
Chengliang Chai, Lei Cao, Guoliang Li, Jian Li, Yuyu Luo, Samuel Madden.
ACM SIGMOD 2020.
-
Interactively Discovering and Ranking Desired Tuples without Writing SQL Queries.
Xuedi Qin, Chengliang Chai, Yuyu Luo, Nan Tang, Guoliang Li.
ACM SIGMOD 2020. [Video Demonstration]
-
DEEPEYE: A Data Science System for Monitoring and Exploring COVID-19 Data.
Yuyu Luo, Nan Tang, Guoliang Li, Tianyu Zhao, Wenbo Li, Xiang Yu.
IEEE Data Engineering Bulletin, 2020. (invited)
-
CrowdChart: Crowdsourced-based Data Extraction from Visualization Chart.
Chengliang Chai, Guoliang Li, Ju Fan, Yuyu Luo.
IEEE TKDE 2020.
Year 2019
-
Making Data Visualization More Efficient and Effective: A Survey.
Xuedi Qin, Yuyu Luo, Nan Tang, Guoliang Li.
The VLDB Journal.
-
MathGraph: A Knowledge Graph for Automatically Solving Mathematical Exercises.
Tianyu Zhao, Yan Huang, Songfan Yang, Yuyu Luo, et al.
DASFAA 2019. (Best Paper Award)
Year 2018
-
DeepEye: Towards Automatic Data Visualization.
ICDE 2018 Highly Cited Papers Top-2
Yuyu Luo, Xuedi Qin, Nan Tang, Guoliang Li.
IEEE ICDE 2018.
[DeepEye-APIs (Python3.6)]
-
DeepEye: Creating Good Data Visualizations by Keyword Search (Demo).
Yuyu Luo, Xuedi Qin, Nan Tang, Guoliang Li, Xinran Wang.
ACM SIGMOD 2018.
[Online Demo]
PhD Students
2023
- Tianqi Luo (M.S. from Johns Hopkins University)
- Xinyu Liu (B.S. from Northeastern University, China)
- Yao Shi (co-advised with Nan Tang, B.S. from Univ. of Elec. Sci. and Tech. of China)
2024
- Changlun Li (co-advised with Nan Tang, B.S. from CUHK)
- Jiayi Zhang (from Renmin University of China)
- Boyan Li (from HKUST(GZ))
- Shuyu Shen (from HKUST(GZ))
Selected Awards
- 2023 - Forbes China 30 Under 30 List (入选2023福布斯中国30 Under 30榜单)
- 2023 - CCF Doctoral Dissertation Nomination Award
- 2023 - Best of SIGMOD 2023 Papers
- 2023 - Distinguished Doctoral Dissertation Award of Tsinghua University (清华优秀博士学位论文)
- 2023 - Rising Star in Data Visualization, CSIG VIS
- 2022 - CIKM 2022 Best Paper Honorable Mention (Demo Track)
- 2021 - Zhejiang Lab’s International Talent Fund for Young Professionals
- 2020 - Tsinghua Top Grade Scholarship (清华大学特等奖学金)
(The highest award in the Tsinghua Univ.)
- 2020 - Zhong Shimo Scholarship (钟士模奖学金), Tsinghua University
(The highest award in the Dept. of CST)
- 2020 - China National Scholarship, Ministry of Education of China
- 2019 - Excellent Paper Award – Big Data Mining and Analytics
- 2019 - DASFAA 2019 Best Student Paper Award
Professional Services
- Session Chair: VLDB 2024
- PC Member: VLDB 2023-2025, ICDE 2024-2025, ICLR 2025
- Conference Reviewer: IEEE VIS 2021-2024, CHI 2022-2025
- Journal Reviewer: ACM Transactions on Database Systems, TVCG, ACM/IMS TDS, Data Science and Engineering
- Conference Volunteer: SIGMOD 2021/2023