publications
2026
2025
-
SOSPORQ: Complex Analytics on Private Data with Strong Security GuaranteesIn Proceedings of the ACM SIGOPS 31st Symposium on Operating Systems Principles (SOSP’25), 2025
2024
-
DaMoNIn situ neighborhood sampling for large-scale GNN trainingIn Proceedings of the 20th International Workshop on Data Management on New Hardware, 2024
2023
2022
-
DEEMEvaluating model serving strategies over streaming dataIn Proceedings of the Sixth Workshop on Data Management for End-To-End Machine Learning, 2022
2021
2020
-
HotStorageIn support of workload-aware streaming state managementIn 12th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage 20), 2020
2019
-
BIRTEFASTER State Management for Timely DataflowIn Proceedings of Real-Time Business Intelligence and Analytics, 2019
2018
2016
2014
-
GRADES-NDAAsymmetry in large-scale graph analysis, explainedIn Proceedings of Workshop on GRAph Data management Experiences and Systems, 2014
2013
-
EuroParPonic: Using stratosphere to speed up pig analyticsIn European Conference on Parallel Processing, 2013
-
Block Sampling: Efficient Accurate Online Aggregation in MapReduceIn 5th IEEE International Conference on Cloud Computing Technology and Science (CloudCom) 2013, 2013
-
SurveyMapReduce: Limitations, Optimizations and Open IssuesIn 11th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA-13), 2013
-
m2r2: A Framework for Results Materialization and Reuse in High-Level Dataflow Systems for Big DataIn 2nd International Conference on Big Data Science and Engineering (BDSE 2013), 2013