Data Mining, Databases, and Geographical Information Systems

CS&E makes big showing at IEEE BigData 2021
Seven papers from Computer Science & Engineering faculty, students, and alumni were accepted to the top tier research conference in Big Data.

University of Minnesota is part of $25M AI-based climate modeling center
The new NSF-funded Learning the Earth with Artificial Intelligence and Physics (LEAP) center will leverage big data and machine learning to improve climate projections.

New machine learning methods could improve environmental predictions
Vipin Kumar is on the team that recently published a new study on predicting flow and temperature in river networks.

Cloud Connected Delivery Vehicles project receives award from CTS
CS&E professor Shashi Shekhar and Ph.D. student Yan Li were on the research team that received the 2021 Robert C. Johns Research Partnership Award.

Karypis wins PAKDD Distinguished Contributions Award
This is the highest award presented by the PAKDD, and Karypis was selected for his outstanding contributions to the field of knowledge discovery and data mining.

An epidemiology-inspired model for false information mitigation in social networks
Recent Ph.D. grad Bhavtosh Rath’s final dissertation focused on the very relevant issue of ‘fake news’ spreading.

Mokbel named 2020 IEEE Fellow
He is being recognized with this esteemed award for his contributions to building spatially- and privacy-aware systems.
Research in this area explores efficient storage, retrieval, analysis, and visualization of data for analysis and pattern discovery. This encompasses a wide range of topics including improved indexing and query languages, data compression, multimedia storage and retrieval, data clustering, pattern matching, and high-dimensional data modeling. Specific research thrusts in the department include geographical databases, mapping models, anomaly and pattern detection, query processing, and spatial and scientific data mining for application domains like bioinformatics, cyber security, sensor networks, transportation, and the Web. This group has extensive connections with industry and national labs, providing a rich source of problems, datasets, and experience for students.
Faculty








Labs and selected projects
- Data Mining George Karypis
- Database and Spatial Data Mining Research Group Shashi Shekhar
- Discovery of Patterns in the Global Climate System using Data Mining George Karypis, Vipin Kumar, Shashi Shekhar
- Karypis Lab George Karypis
- Unsupervised Document Set Exploration Using Divisive Partitioning Dan Boley
Related centers and programs
Latest research projects, publications, and talks

MC-DGCNN: A Novel DNN Architecture for Multi-Category Point Set Classification [preprint]
Posted December 22, 2021
Majid Farhadloo (Ph.D. student), Carl Molnar (M.S. student), Gaoxiang Luo (undergraduate research assistant), Yan Li (Ph.D. student), Shashi Shekhar (professor), Rachel L Maus, Svetomir N Markovic, Raymond Moore, Alexey Leontovich

A Label Correction Algorithm Using Prior Information for Automatic and Accurate Geospatial Object Recognition [conference paper]
Posted December 15, 2021
Weiwei Duan, Yao-Yi Chiang (associate professor), Stefan Leyk, Johannes H. Uhl, Craig A. Knoblock
IEEE International Conference on Big Data (IEEE BigData)

Guided Generative Models using Weak Supervision for Detecting Object Spatial Arrangement in Overhead Images [conference paper]
Posted December 15, 2021
Weiwei Duan, Yao-Yi Chiang (associate professor), Stefan Leyk, Johannes H. Uhl, Craig A. Knoblock
IEEE International Conference on Big Data (IEEE BigData)

Spatial Variability Aware Deep Neural Networks (SVANN): A General Approach [journal]
Posted November 30, 2021
Jayant Gupta (Ph.D. student), Carl Molnar (M.S. student), Yiqun Xie (Ph.D. 2020), Joe Knight, Shashi Shekhar (professor)
ACM Transactions on Intelligent Systems and Technology (TIST)

Significant DBSCAN+: Statistically Robust Density-based Clustering [journal]
Posted November 24, 2021
Yiqun Xie (Ph.D. 2020), Xiaowei Jia (Ph.D. 2020), Shashi Shekhar (professor), Han Bao, Xun Zhou (Ph.D. 2014)
ACM Transactions on Intelligent Systems and Technology (TIST)

Strategies for building robust prediction models using data unavailable at prediction time [journal]
Posted November 19, 2021
Haoyu Yang (Ph.D. student), Roshan Tourani, Ying Zhu, Vipin Kumar (professor), Genevieve B Melton, Michael Steinbach (researcher), Gyorgy Simon
Journal of the American Medical Informatics Association

A Data-Driven Intervention Framework for Improving Adherence to Growth Hormone Therapy Based on Clustering Analysis and Traffic Light Alerting Systems [journal]
Posted November 18, 2021
Matheus Araújo (Ph.D. student), Paula van Dommelen, Jaideep Srivastava (professor), Ekaterina Koledova
Studies in Health Technology and Informatics

Satellite image classification across multiple resolutions and time using ordering constraint among instances [patent]
Posted November 18, 2021
Ankush Khandelwal (Ph.D. 2019), Anuj Karpatne (Ph.D. 2017), Vipin Kumar (professor)

Hierarchical clustering by aggregating representatives in sub-minimum-spanning-trees [preprint]
Posted November 11, 2021
Wen-Bo Xie, Zhen Liu, Jaideep Srivastava (professor)

SRC: Incorporating Geographic Information for Building a Location-based Recommendation System [conference paper]
Posted November 2, 2021
Yuankun Jiao, Yao-Yi Chiang (associate professor)
ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM SIGSPATIAL)
More About Research areas
- Architectures, Compiler Optimization, and Embedded Systems
- Bioinformatics and Computational Biology
- Graphics and Immersive Computing
- High Performance Computing
- Human Computer Interaction (HCI)
- Networks, Distributed Systems, and Security
- Robotics and Artificial Intelligence
- Software Engineering and Programming Languages
- Theoretical Foundations