Research Interests
Databases, data mining, and data science
Algorithmic, stochastic, AI and machine learning techniques for data science and data analytics, with topics including data streams and sequence mining, machine learning models for data processing, graphs and social networks, Internet of Things, and data security and privacy.
Education
- Ph D: Computer Science, (2009), Brown University - Providence, Rhode Island
- MS: Computer Science, (2006), Brown University - Providence, Rhode Island
- MS: Computer Science, (1998), University of California, Davis - Davis, California
- BS: Computer Science and Engineering, (1994), Tsinghua University - Beijing, China
Selected Awards and Honors
- Career Award (2012), Scholarship/Research - National Science Foundation
Selected Publications
- Liu, X., Ge, T., Wu, Y. (2019). Finding Densest Lasting Subgraphs in Dynamic Graphs: A Stochastic Approach (pp. 782--793).
- Song, C., Ge, T., Ge, Y., Zhang, H., Yuan, X. (2019). Labeled graph sketches: Keeping up with real-time graph streams. Information Sciences, 503 469--492.
- Song, C., Ge, T. (2019). New Progress in Data Stream and Time Series Analytics. Higher Education Press
- Song, C., Liu, X., Ge, T., Ge, Y. (2019). Top-k frequent items and item frequency tracking over sliding windows of any size. Information Sciences, 475 100--120.
- Ge, T., Li, Y., Chen, C. (2018). Complex Event and Pattern Models in Sequence Data Processing (3:). Higher Education Press
- Song, C., Ge, T. (2018). Labeled Graph Sketches (pp. 1312--1315).
- Tang, B., Tang, H., Dong, X., Jin, B., Ge, T. (2018). On Real-time Detecting Passenger Flow Anomalies (pp. 1053--1062).
- Cui, Y., Jin, B., Zhang, F., Ge, T. (2018). Towards Adaptive Sensory Data Fusion for Detecting Highway Traffic Conditions in Real Time (pp. 336--352).
- Li, Y., Ge, T., Chen, C. (2018). VTeller: Telling the Values Somewhere, Sometime in a Dynamic Network of Urban Systems (pp. 577--586).
- Wan, L., Ge, T. (2018). When Optimizer Chooses Table Scans: How to Make Them More Responsive (pp. 1333--1342).
- Wan, L., Ge, T. (2016). Event Regularity and Irregularity in a Time Unit. 2016 IEEE 32nd International Conference on Data Engineering (ICDE), 930.
- Su, J., Inalpolat, M., Ge, T., Esmaeilzadeh, H., Sun, H. (2016). Experimental Study and Analysis of Dropwise Condensation Using Quartz Crystal Microbalance (1: pp. Heat Transfer Division - ). ASME 2016 Heat Transfer Summer Conference, HT 2016, collocated with the ASME 2016 Fluids Engineering Division Summer Meeting and the ASME 2016 14th International Conference on Nanochannels, Microchannels, and Minichannels
- Li, Z., Ge, T. (2016). History Is a Mirror to the Future: Best-Effort Approximate Complex Event Matching with Insufficient Resources. Proceedings of the VLDB Endowment, 10(4) 397–408.
- Li, Z., Ge, T. (2016). Stochastic Data Acquisition for Answering Queries as Time Goes By. Proceedings of the VLDB Endowment, 10(3) 277–288.
- Zhang, F., Jin, B., Ge, T., Ji, Q., Cui, Y. (2016). Who are My Familiar Strangers?. Conference on Information & Knowledge Management!!!, 619.
- Li, Z., Ge, T. (2015). PIE: Approximate Interleaving Event Matching over Sequences.
- Ma, Y., Olendzki, B.C., Wang, J., Persuitte, G.M., Li, W., Fang, H., Merriam, P.A., Wedick, N.M., Ockene, I.S., Culver, A.L., Ge, T. (2015). Single-Component Versus Multicomponent Dietary Goals for the Metabolic Syndrome: A Randomized Trial. Annals of internal medicine, 162(4) 248–257.
- Song, C., Ge, T., Chen, C., Wang, J. (2015). Soft Quorums: A High Availability Solution for Service Oriented Stream Systems (pp. 778–779).
- Song, C., Ge, T. (2015). Window-Chained Longest Common Subsequence: Common Event Matching in Sequences.
- Song, C., Ge, T. (2014). Aroma: A New Data Protection Method with Differential Privacy and Accurate Answering. 23rd ACM International Conference on Information and Knowledge Management
- Song, C., Ge, T., Chen, C., Wang, J. (2014). Event Pattern Matching over Graph Streams. Proceedings of the VLDB Endowment, 8(4) 413–424.
- Kurunji, S., Ge, T., Fu, X., Liu, B., Kumar, A., Chen, C. (2014). Optimizing Aggregate Query Processing in Cloud Data Warehouses (pp. 1-12). The International Conference on Data Management in Cloud, Grid and P2P Systems
- Song, C., Ge, T. (2013). Discovering and Managing Quantitative Association Rules (pp. 2429–2434).
- Li, Z., Ge, T., Chen, C. (2013). E-Matching: Event Processing over Noisy Sequences in Real Time (pp. 601-612). ACM SIGMOD International Conference on Management of Data
- Kurunji, S., Ge, T., Fu, X., Liu, B., Chen, C. (2013). Optimizing Communication for Multi-Join Query Processing in Cloud Data Warehouses. International Journal of Grid and High Performance Computing (IJGHPC), 5(4) 113–130.
- Wang, J., Lu, J., Fang, Z., Ge, T., Chen, C. (2013). Pl-Tree: An Efficient Indexing Method for High-Dimensional Data (pp. 183–200). Springer
- Song, C., Li, Z., Ge, T., Wang, J. (2013). Query Execution Timing: Taming Real-Time Anytime Queries on Multicore Processors (pp. 2237–2242).
- Ge, T., Dekhtyar, A., Goldsmith, J. (2013). Uncertain Data: Representations, Query Processing, and Applications (304: pp. 67-108). Springer-Verlag
- Li, Z., Ge, T., Chen, C. (2013). ε-Matching: Event Processing over Noisy Sequences in Real Time (pp. 601-612). Proceedings of the ACM SIGMOD International Conference on Management of Data
- Song, C., Li, Z., Ge, T. (2013). Top-k Oracle: A New Way to Present Top-K Tuples for Uncertain Data (pp. 146-157). Proceedings - International Conference on Data Engineering
- Singhal, M., Chandrasekhar, S., Ge, T., Sandhu, R., Krishnan, R., Ahn, G., Bertino, E. (2013). Collaboration in Multicloud Computing Environments: Framework and Security Issues. Computer, 46(2) 76-84.
- Chen, C., Kurunji, S., Ge, T., Liu, B. (2012). Communication Cost Optimization for Cloud Data Warehouse Queries. IEEE CloudCom
- Ge, T., Liu, F. (2012). Accuracy-Aware Uncertain Stream Databases (pp. 174–185).
- Kurunji, S., Ge, T., Chen, C. (2012). Multi-Join Query Optimization for Read-Optimized Data Warehouse in a Cloud Environment. Citeseer.
- Li, Z., Ge, T. (2012). Online Windowed Subsequence Matching over Probabilistic Sequences (pp. 277–288).
- Ge, T., Dekhtyar, A., Goldsmith, J. (2012). Uncertain Data: Representations, Query Processing, and Applications.
- Ge, T., Li, Z. (2011). Approximate substring matching over uncertain strings. Proceedings of the VLDB Endowment, 4(11).
- Ge, T. (2011). Join Queries on Uncertain Data: Semantics and Efficient Processing (pp. 697–708).
- Ge, T., Grabiner, D., Zdonik, S. (2011). Monte Carlo Query Processing of Uncertain Multidimensional Array Data (pp. 936–947).
- Ge, T., Zdonik, S. (2010). A*-tree: A Structure for Storage and Modeling of Uncertain Multidimensional Arrays. Proceedings of the VLDB Endowment, 3(1-2) 964-974.
- Ge, T., Zdonik, S. (2009). Light-weight, Runtime Verification of Query Sources (pp. 30–41).
- Ge, T. (2009). Query Processing on Uncertain Data. Brown University
- Ge, T., Zdonik, S., Madden, S. (2009). Top-k Queries on Uncertain Data: On Score Distribution and Typical Answers (pp. 375–388).
- Ge, T., Zdonik, S. (2008). A Skip-List Approach for Efficiently Processing Forecasting Queries. Proceedings of the VLDB Endowment, 1(1) 984-995.
- Ge, T., Zdonik, S. (2008). Handling Uncertain Data in Array Database Systems (pp. 1140–1149).
- Ge, T., Zdonik, S. (2007). Answering Aggregation Queries in a Secure System Model (pp. 519–530).
- Ge, T., Zdonik, S. (2007). Fast, Secure Encryption for Indexing in a Column-Oriented Dbms (pp. 676–685).
- Keen, A.W., Ge, T., Maris, J.T., Olsson, R.A. (2004). JR: Flexible Distributed Programming in an Extended Java. ACM Transactions on Programming Languages and Systems (TOPLAS), 26(3) 578-608.
- Olsson, R.A., Benson, G.D., Ge, T., Keen, A.W. (2002). Fairness in Shared Invocation Servicing. Computer Languages, Systems & Structures, 28(4) 327-351.
Selected Presentations
- Top-K Oracle: A New Way to Present Top-K Tuples for Uncertain Data - 29th International Conference on Data Engineering, April 2013 - Brisbane, Australia
- Communication Cost Optimization for Cloud Data Warehouse Queries - 4th IEEE International Conference on Cloud Computing Technology and Science, December 2012 - Taipei, Taiwan
- - UMass Amherst's Database Seminar Series, October 2012 - UMass Amherst
- - UMass Boston's Computer Science Department Colloquium Series, October 2012 - UMass Boston
- Online Windowed Subsequence Matching over Probabilistic Sequences - ACM SIGMOD International Conference on Management of Data, May 2012 - Scottsdale, Arizona
- - MIT's Database Seminar, May 2012 - MIT
- - SIGMOD, May 2012 - Scottsdale, Arizona
- Accuracy-Aware Uncertain Stream Databases - 28th International Conference on Data Engineering, April 2012 - Washington, DC
- - ICDE, April 2012 - Washington, DC
- Join Queries on Uncertain Data: Semantics and Efficient Processing - 27th International Conference on Data Engineering, April 2011 - Hannover, Germany
- Monte Carlo Query Processing of Uncertain Multidimensional Array Data - 27th International Conference on Data Engineering, April 2011 - Hannover, Germany
- Top-k Queries on Uncertain Data: On Score Distribution and Typical Answers - ACM SIGMOD International Conference on Management of Data, June 2009 - Providence, RI
- Light-weight, Runtime Verification of Query Sources - 25th International Conference on Data Engineering, March 2009 - Shanghi, China
- Handling Uncertain Data in Array Database Systems - 24th International Conference on Data Engineering, April 2008 - Cancun, Mexico
- Answering Aggregation Queries in a Secure System Model - 33rd International Conference on Very Large Data Bases, September 2007 - Vienna, Austria
- Fast, Secure Encryption for Indexing in A Column-Oriented DBMS - 23rd International Conference on Data Engineering, April 2007 - Istanbul, Turkey
- One Size Fits All? - Part 2: Benchmarking Results - 3rd Biennial Conference on Innovative Database Systems, January 2007 - Asilomar, California
- JR: Flexible Distributed Programming in an Extended Java - 21st IEEE International Conference on Distributed Computing Systems, April 2001 - Phoenix, Arizona
Selected Contracts, Fellowships, Grants and Sponsored Research
- CAREER: MUSE: An integrated Approach to Managing Uncertain Scientific Experimenta (2012), Grant -
Ge, T. (Principal) - III:Core:Small:QUEST: An Integrated Query and Event System on Noisy Streams and T (2013), Grant -
Ge, T. (Principal) - III: Small: Rural: Querying Rich Uncertain Data in Real Time (2012), Grant -
Ge, T. (Principal)