Wensheng Gan


Ph.D. Assistant Professor

School of Computer Science and Technology
Harbin Institute of Technology (Shenzhen), Shenzhen, China

Office: L1714, Reseach Building
Email:  wsgan001 at gmail dot com


About me

Dr. Vincent W. Gan (甘文生) is currently an Assistant Professor of computer science. He was a visiting scholar at Department of Computer Science, University of Illinois at Chicago, IL, USA, under the supervision by Prof. Philip S. Yu (ACM fellow & IEEE fellow) from September 2017 to March 2019. He received the M.S. and Ph.D. degrees in computer science from Harbin Institute of Technology (Shenzhen), Shenzhen, China, in January 2016 and December 2019, respectively, and under the supervision by Prof. Jerry Chun-Wei Lin (IET fellow). He received the B.S. degree in Computer Science from South China Normal University, Guangdong, China, in 2013. His research interests include data mining, utility mining, security and privacy-preserving, fuzzy theory, and big data technologies. He has published more than 50 research papers in peer-reviewed journals and international conferences, which have received more than 700 citations, as of 2019/12/31.

Research

News!!! I will release all code & datasets of my papers, if you are interested in any of them, please fell free to contact me (wsgan001@gmail.com)

My principal research interest is Data Science and Engineering (DSE), e.g., big data mining in large-scale information and social data. I am more generally focus on data mining, big data analysis, statistics, machine learning, and network science, with a focus on modeling novel problems and proposing scalable algorithms for large-scale, real-world applications, including but not limited to: pattern mining, utility mining, complex sequence processing, graph data mining, utility computing with different learning models, and intelligent systems with big data.

In particular, I have extended conventional studies of Utility Mining (效用挖掘): Theory, Techniques, and Applications. I have proposed a series of new models and algorithms to capture and predict the high-utility pattern and knoeledge on different types of data, and also studied how to explore both the different patterns of information diffusion and the utility property of the patterns (i.e., itemsets, rules, sequences, episodes and graphs) to better infer the hidden structure and knowledge of the rich data.

My recent research focuses on utility mining and computation with multi-source data, uncertain data, and complex events.

News

  • 2019.10: The 2nd International Workshop on Utility-driven Mining (UDM 2019) in conjunction with ICDM 2019, Beijing, China.
  • 2019.9: The special issue of Utility-driven Mining (submit details) in SCI journal IEEE Access (SCI, IF:4.02, JCR Q2).
  • 2018.7: The 1st International Workshop on Utility-driven Mining (UDM 2018) in conjunction with KDD 2018, London, UK.
  • 2018.4: A tutorial on Utility-driven Pattern Mining. Download slide decks for free!

Research Topics

A. Data/Utility mining on transaction data

B. Data/Utility mining on sequence data

C. Data/Utility mining on complex event sequences

Publications

[DBLP] [Google Scholar]

Survey Papers

  1. Wensheng Gan et al., “A Survey of Utility-Oriented Pattern Mining,” IEEE TKDE or arXiv 2018 [PDF].
  2. Wensheng Gan et al., “A Survey of Parallel Sequential Pattern Mining,” ACM TKDD or arXiv 2018 [PDF].
  3. Wensheng Gan et al., “Privacy Preserving Utility Mining: A Survey,” IEEE BigData, pp. 2617-2626, 2018 [PDF].
  4. Wensheng Gan et al., “A Survey of Incremental High‐Utility Itemset Mining,” Wiley DMKD, 8(2), 2018 [PDF].
  5. Wensheng Gan et al., “Data Mining in Distributed Environment: A Survey,” Wiley DMKD, 7(6), 2018 [PDF].

Journals

News!!! I will release all code & datasets of my papers, if you are interested in any of them, please fell free to contact me (wsgan001@gmail.com)

  1. Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, Han-Chieh Chao, Vincent S. Tseng, and Philip S. Yu, “A Survey of Utility-Oriented Pattern Mining,” IEEE Transactions on Knowledge and Data Engineering, early access, 2019. (SCI, IF:3.438, JCR Q1) [PDF]
  2. Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, Han-Chieh Chao, and Philip S. Yu, “A Survey of Parallel Sequential Pattern Mining,” ACM Transactions on Knowledge Discovery from Data, 13(3): 25:1-25:34, 2019. (SCI, IF:2.50, JCR Q2) [PDF]
  3. Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, Han-Chieh Chao, and Philip S. Yu, “HUOPM: High Utility Occupancy Pattern Mining,” IEEE Transactions on Cybernetics, early access, 2019. (SCI, IF:10.387, JCR Q1) [PDF] [Code & Datasets]
  4. Wensheng Gan, Jerry Chun-Wei Lin, Jiexiong Zhang, Philippe Fournier-Viger, Han-Chieh Chao, and Philip S. Yu, “Fast Utility Mining on Sequence Data,” IEEE Transactions on Cybernetics, early access, 2020. (SCI, IF:10.387, JCR Q1) [PDF] [Code & Datasets]
  5. Wensheng Gan, Jerry Chun-Wei Lin, Jiexiong Zhang, and Philip S. Yu, “Utility Mining Across Multi-Sequences with Individualized Thresholds,” ACM Transactions on Data Science, 2019. [PDF] [Code & Datasets]
  6. Wensheng Gan, Jerry Chun-Wei Lin, Han-Chieh Chao, Athanasios V Vasilakos, and Philip S. Yu, “Utility-driven Data Analytics on Uncertain Data,” IEEE System Journal, 2019. (SCI, IF:4.324, JCR Q1) [PDF] [Code & Datasets]
  7. Wensheng Gan, Jerry Chun-Wei Lin, Jiexiong Zhang, Han-Chieh Chao, Hamido Fujita, and Philip S. Yu, “ProUM: Projection-based Utility Mining on Sequence Data,” Information Science, online, 2019. (SCI, IF:5.524, JCR Q1) [PDF] [Code & Datasets]
  8. Wensheng Gan, Jerry Chun-Wei Lin, Han-Chieh Chao, Hamido Fujita, and Philip S. Yu, “Utility-based Correlated Pattern Mining,” Information Science, vol. 504, pp. 470-486, 2019. (SCI, IF:5.524, JCR Q1) [PDF] [Code & Datasets]
  9. Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, Han-Chieh Chao, Tzung-Pei Hong, and Hamido Fujita, “A Survey of Incremental High-Utility Itemset Mining,” Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery, vol. 8(2), 2018 (SCI, IF:2.541, JCR Q1/Q2) [PDF]
  10. Wensheng Gan, Jerry Chun-Wei Lin, Han-Chieh Chao, and Justin Zhan, “Data Mining in Distributed Environment: A Survey,” Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery, vol. 7(6), 2017 (SCI, IF:2.541, JCR Q1/Q2) [PDF]
  11. Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, Han-Chieh Chao, Justin Zhan, and Ji Zhang, “Exploiting Highly Qualified Pattern with Frequency and Weight Occupancy,” Knowledge and Information Systems, vol. 56(1), pp. 165-196, 2018 (SCI, IF:2.397, JCR Q2) [PDF] [Code & Datasets]
  12. Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, Han-Chieh Chao, Justin Zhan, and Ji Zhang, “Extracting Non-Redundant Correlated Purchase Behaviors by Utility Measure,” Knowledge-Based Systems, vol. 143, pp. 30 - 41, 2018 (SCI, IF:5.101, JCR Q1) [PDF] [Code & Datasets]
  13. Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, Han-Chieh Chao, Jimmy Ming-Thai Wu, and Justin Zhan, “Extracting Recent Weighted-based Patterns from Uncertain Temporal Databases,” Engineering Applications of Artificial Intelligence, vol. 61, pp. 161-172, 2017 (SCI, IF:2.894, JCR Q1) [PDF] [Code & Datasets]
  14. Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, Han-Chieh Chao, and Justin Zhan, “Mining of Frequent Patterns with Multiple Minimum Supports,” Engineering Applications of Artificial Intelligence, vol. 60, pp. 83-96, 2017 (SCI, IF:2.894, JCR Q1) [PDF] [Code & Datasets]
  15. Jerry Chun-Wei Lin, Wensheng Gan, Tzung-Pei Hong, and Vincent S. Tseng, “Efficient algorithms for mining up-to-date high-utility patterns,” Advanced Engineering Informatics, vol. 29(3), pp. 648-661, 2016 (SCI, IF:2.00, JCR Q1) [PDF] [Code & Datasets]
  16. Jerry Chun-Wei Lin, Wensheng Gan, Philippe Fournier-Viger, Tzung-Pei Hong, and Vincent S. Tseng, “Efficient algorithms for mining high-utility itemsets in uncertain databases,” Knowledge-Based Systems, vol. 96, pp. 171-187, 2016 (SCI, IF:4.529, JCR Q1) [PDF] [Code & Datasets]
  17. Jerry Chun-Wei Lin, Wensheng Gan, Philippe Fournier-Viger, Tzung-Pei Hong, and Vincent S. Tseng, “Fast algorithms for mining high-utility itemsets with various discount strategies,” Advanced Engineering Informatics, vol. 30(2), pp. 109-126, 2016 (SCI, IF:2.00, JCR Q1) [PDF] [Code & Datasets]

Conferences

  1. Wensheng Gan, Jerry Chun-Wei Lin, Han-Chieh Chao, Hamido Fujita, and Philip S. Yu, “ProUM: High Utility Sequential Pattern Mining,” IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 777-783, 2019. (EI) [PDF]
  2. Wensheng Gan, Jerry Chun-Wei Lin, Han-Chieh Chao, Shyue-Liang Wang, and Philip S. Yu, “Privacy Preserving Utility Mining: A Survey,” IEEE International Conference on Big Data (Big Data), pp. 2617-2626, 2018 (EI) [PDF]
  3. Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, and Han-Chieh Chao, “Exploiting High Utility Occupancy Patterns,” APWeb-WAIM, pp. 239-247, 2017. (EI) [PDF]
  4. Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, and Han-Chieh Chao, “Extracting Non-Redundant Correlated High Utility Purchase Behaviors by Utility Measure,” The 18th International Conference Big Data Analytics and Knowledge Discovery (DaWak), pp. 433-446, 2017. (EI) [PDF]
  5. Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, Han-Chieh Chao, and Vincent S. Tseng, “Mining High-Utility Itemsets with Both Positive and Negative Unit Profits from Uncertain Databases,” The 21th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), pp. 434-446, 2017. (EI) [PDF]
  6. Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, and Han-Chieh Chao, “More Efficient Algorithms for Mining High-Utility Itemsets with Multiple Minimum Utility Thresholds,” The 27th International Conference Database and Expert Systems Applications (DEXA), pp. 71-87, 2016. (EI) [PDF]
  7. Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, and Han-Chieh Chao, “Mining Recent High-Utility Patterns from Temporal Databases with Time-Sensitive Constraint,” The 17th International Conference Big Data Analytics and Knowledge Discovery (DaWak), pp. 3-18, 2016. (EI) [PDF]
  8. Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, and Han-Chieh Chao, “Mining Recent High Expected Weighted Itemset from Uncertain Databases,” The 18th Asia Pacific Web Conference (APWeb), pp. 581-593, 2016. (EI) [PDF]
  9. Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, and Han-Chieh Chao, “More Efficient Algorithm for Mining Frequent Patterns with Multiple Minimum Supports,” The 17th International Conference on Web-Age Information Management (WAIM), pp. 3-16, 2016. (EI) [PDF]

Professional Services

Publish Editor

TPC Member & Reviewer

  • Utility-Driven Mining Workshop (UDM 2018) with SIGKDD, 2018
  • Utility-Driven Mining Workshop (UDM 2019) with IEEE ICDM, 2019
  • International Conference ICGEC 2017, 2018, 2019
  • International Conference Data Analytics 2017, 2018, 2019
  • IADIS Intern. Conference on Information Systems 2018, 2019
  • International Conference GraphSM 2018, 2019
  • ASPAI 2019

Journal Reviewer

  • ACM Trans. on Knowledge Discovery from Data (TKDD, SCI:2.538, top journal)
  • IEEE Transactions on Cybernetics (TCYB, SCI, IF:10.387, JCR Q1)
  • IEEE Transactions on Neural Networks and Learning Systems (TNNLS, SCI, IF:12.05, JCR Q1)
  • IEEE Transactions on Industrial Electronics (TIE, SCI, IF:6.30, JCR Q1)
  • IEEE Biomedical and Health Informatics (JBHI, SCI, IF:3.850, JCR Q1)
  • Knowledge Based Systems (KBS, SCI, IF:5.10, JCR Q1, CCF C)
  • Pattern Recognition (SCI, IF:4.582, JCR Q1, CCF B)
  • Future Generation Computer Systems (FGCS, SCI, IF:5.768, JCR Q1, CCF C)
  • Applied Intelligence (APIN, SCI, IF:2.882, JCR Q2, CCF C)
  • IEEE Access (SCI, IF:4.000, JCR Q1)
  • World Wide Web Journal (SCI, IF:1.771, JCR Q3, CCF B)
  • Frontiers of Computer Science (SCI, IF:1.129, JCR Q3)

Awards

  • 2017.05 哈尔滨工业大学第四届“优秀学生李昌奖”(哈工大最高荣誉奖, 1/5)
  • 2017.11 中国教育部博士研究生国家奖学金
  • 2018.04 哈尔滨工业大学第三届“春晖创新成果奖”
  • 2016.01 哈尔滨工业大学第七届“研究生十佳英才”提名奖(1/20)
  • 2015.10 IEEE SMC 国际会议学生资助奖(1/10)
  • 2015.08 The 17th APWeb 国际会议学生资助奖(1/20)
  • 2015.12 哈尔滨工业大学优秀毕业研究生(金奖)
  • 2015.12 哈尔滨工业大学优秀毕业论文奖
  • 2015.04 黑龙江省三好学生

Other information

Don't worry about people stealing an idea. If it's original, you will have to ram it down their throats. - Howard Aiken

Science without religion is lame. Religion without science is blind. - Albert Einstein

"Your time is limited, so don 't waste it living someone else's life. " - Steve Jobs

Copyright © 2013~2019 Wensheng Gan