SIGMOD accepted papers

SIGMOD '17- Proceedings of the 2017 ACM International Conference on Management of Data

Full Citation in the ACM Digital Library

SESSION: Keynote Session - Grand Challenges in Data Management: Transactions

The Next 700 Transaction Processing Engines
  • Anastasia Ailamaki
What Are We Doing With Our Lives?: Nobody Cares About Our Concurrency Control Research
  • Andrew Pavlo

SESSION: SIGMOD Session 1. Concurrency (1)

ACIDRain: Concurrency-Related Attacks on Database-Backed Web Applications
  • Todd Warszawski
  • Peter Bailis
Cicada: Dependably Fast Multi-Core In-Memory Transactions
  • Hyeontaek Lim
  • Michael Kaminsky
  • David G. Andersen
BatchDB: Efficient Isolated Execution of Hybrid OLTP+OLAP Workloads for Interactive Applications
  • Darko Makreshanski
  • Jana Giceva
  • Claude Barthels
  • Gustavo Alonso

SESSION: SIGMOD Session 2. Storage and Distribution (1)

Azure Data Lake Store: A Hyperscale Distributed File Service for Big Data Analytics
  • Raghu Ramakrishnan
  • Baskar Sridharan
  • John R. Douceur
  • Pavan Kasturi
  • Balaji Krishnamachari-Sampath
  • Karthick Krishnamoorthy
  • Peng Li
  • Mitica Manu
  • Spiro Michaylov
  • Rogério Ramos
  • Neil Sharman
  • Zee Xu
  • Youssef Barakat
  • Chris Douglas
  • Richard Draves
  • Shrikant S. Naidu
  • Shankar Shastry
  • Atul Sikaria
  • Simon Sun
  • Ramarathnam Venkatesan
OctopusFS: A Distributed File System with Tiered Storage Management
  • Elena Kakoulli
  • Herodotos Herodotou
Monkey: Optimal Navigable Key-Value Store
  • Niv Dayan
  • Manos Athanassoulis
  • Stratos Idreos

SESSION: SIGMOD Session 3. Streams

Enabling Signal Processing over Data Streams
  • Milos Nikolic
  • Badrish Chandramouli
  • Jonathan Goldstein
Complete Event Trend Detection in High-Rate Event Streams
  • Olga Poppe
  • Chuan Lei
  • Salah Ahmed
  • Elke A. Rundensteiner
LittleTable: A Time-Series Database and Its Uses
  • Sean Rhea
  • Eric Wang
  • Edmund Wong
  • Ethan Atkins
  • Nat Storer

SESSION: SIGMOD Session 4. Versions and Incremental Maintenance

Incremental View Maintenance over Array Data
  • Weijie Zhao
  • Florin Rusu
  • Bin Dong
  • Kesheng Wu
  • Peter Nugent
Incremental Graph Computations: Doable and Undoable
  • Wenfei Fan
  • Chunming Hu
  • Chao Tian
DEX: Query Execution in a Delta-based Storage System
  • Amit Chavan
  • Amol Deshpande

SESSION: SIGMOD Session 5. Parallel and Distributed Query Processing (1)

Massively Parallel Processing of Whole Genome Sequence Data: An In-Depth Performance Study
  • Abhishek Roy
  • Yanlei Diao
  • Uday Evani
  • Avinash Abhyankar
  • Clinton Howarth
  • Rémi Le Priol
  • Toby Bloom
Distributed Provenance Compression
  • Chen Chen
  • Harshal Tushar Lehri
  • Lay Kuan Loh
  • Anupam Alur
  • Limin Jia
  • Boon Thau Loo
  • Wenchao Zhou
ROBUS: Fair Cache Allocation for Data-parallel Workloads
  • Mayuresh Kunjir
  • Brandon Fain
  • Kamesh Munagala
  • Shivnath Babu

SESSION: SIGMOD Session 6. Concurrency (2)

Transaction Repair for Multi-Version Concurrency Control
  • Mohammad Dashti
  • Sachin Basil John
  • Amir Shaikhha
  • Christoph Koch
Concerto: A High Concurrency Key-Value Store with Integrity
  • Arvind Arasu
  • Ken Eguro
  • Raghav Kaushik
  • Donald Kossmann
  • Pingfan Meng
  • Vineet Pandey
  • Ravi Ramamurthy
Fast Failure Recovery for Main-Memory DBMSs on Multicores
  • Yingjun Wu
  • Wentian Guo
  • Chee-Yong Chan
  • Kian-Lee Tan
Bringing Modular Concurrency Control to the Next Level
  • Chunzhi Su
  • Natacha Crooks
  • Cong Ding
  • Lorenzo Alvisi
  • Chao Xie

SESSION: SIGMOD Session 7. Storage and Distribution (2)

Wide Table Layout Optimization based on Column Ordering and Duplication
  • Haoqiong Bian
  • Ying Yan
  • Wenbo Tao
  • Liang Jeff Chen
  • Yueguo Chen
  • Xiaoyong Du
  • Thomas Moscibroda
Query Centric Partitioning and Allocation for Partially Replicated Database Systems
  • Tilmann Rabl
  • Hans-Arno Jacobsen
Spanner: Becoming a SQL System
  • David F. Bacon
  • Nathan Bales
  • Nico Bruno
  • Brian F. Cooper
  • Adam Dickinson
  • Andrew Fikes
  • Campbell Fraser
  • Andrey Gubarev
  • Milind Joshi
  • Eugene Kogan
  • Alexander Lloyd
  • Sergey Melnik
  • Rajesh Rao
  • David Shue
  • Christopher Taylor
  • Marcel van der Holst
  • Dale Woodford

SESSION: SIGMOD Session 8. Tree & Graph Processing (1)

Landmark Indexing for Evaluation of Label-Constrained Reachability Queries
  • Lucien D.J. Valstar
  • George H.L. Fletcher
  • Yuichi Yoshida
Efficient Ad-Hoc Graph Inference and Matching in Biological Databases
  • Xiang Lian
  • Dongchul Kim
DAG Reduction: Fast Answering Reachability Queries
  • Junfeng Zhou
  • Shijie Zhou
  • Jeffrey Xu Yu
  • Hao Wei
  • Ziyang Chen
  • Xian Tang
Flexible and Feasible Support Measures for Mining Frequent Patterns in Large Labeled Graphs
  • Jinghan Meng
  • Yi-cheng Tu

SESSION: SIGMOD Session 9. New Hardware

Accelerating Pattern Matching Queries in Hybrid CPU-FPGA Architectures
  • David Sidler
  • Zsolt István
  • Muhsen Owaida
  • Gustavo Alonso
A Memory Bandwidth-Efficient Hybrid Radix Sort on GPUs
  • Elias Stehle
  • Hans-Arno Jacobsen
FPGA-based Data Partitioning
  • Kaan Kara
  • Jana Giceva
  • Gustavo Alonso
Template Skycube Algorithms for Heterogeneous Parallelism on Multicore and GPU Architectures
  • Kenneth S. Bøgh
  • Sean Chester
  • Darius Šidlauskas
  • Ira Assent

SESSION: SIGMOD Session 10. Parallel and Distributed Query Processing (2)

Heterogeneity-aware Distributed Parameter Servers
  • Jiawei Jiang
  • Bin Cui
  • Ce Zhang
  • Lele Yu
Distributed Algorithms on Exact Personalized PageRank
  • Tao Guo
  • Xin Cao
  • Gao Cong
  • Jiaheng Lu
  • Xuemin Lin
Parallelizing Sequential Graph Computations
  • Wenfei Fan
  • Jingbo Xu
  • Yinghui Wu
  • Wenyuan Yu
  • Jiaxin Jiang
  • Zeyu Zheng
  • Bohan Zhang
  • Yang Cao
  • Chao Tian

SESSION: Keynote Session - Grand Challenges in Data Management: Approximate Query Processing

Approximate Query Processing: No Silver Bullet
  • Surajit Chaudhuri
  • Bolin Ding
  • Srikanth Kandula
Approximate Query Engines: Commercial Challenges and Research Opportunities
  • Barzan Mozafari
Approximate Query Processing for Interactive Data Science
  • Tim Kraska

SESSION: SIGMOD Session 11. Interactive Data Exploration and AQP (1)

Controlling False Discoveries During Interactive Data Exploration
  • Zheguang Zhao
  • Lorenzo De Stefani
  • Emanuel Zgraggen
  • Carsten Binnig
  • Eli Upfal
  • Tim Kraska
MacroBase: Prioritizing Attention in Fast Data
  • Peter Bailis
  • Edward Gan
  • Samuel Madden
  • Deepak Narayanan
  • Kexin Rong
  • Sahaana Suri
Data Canopy: Accelerating Exploratory Statistical Analysis
  • Abdul Wasay
  • Xinding Wei
  • Niv Dayan
  • Stratos Idreos

SESSION: SIGMOD Session 12. Beliefs, Conflicts, Knowledge

Beta Probabilistic Databases: A Scalable Approach to Belief Updating and Parameter Learning
  • Niccolo' Meneghetti
  • Oliver Kennedy
  • Wolfgang Gatterbauer
Database Learning: Toward a Database that Becomes Smarter Every Time
  • Yongjoo Park
  • Ahmad Shahab Tajik
  • Michael Cafarella
  • Barzan Mozafari
Staging User Feedback toward Rapid Conflict Resolution in Data Fusion
  • Romila Pradhan
  • Siarhei Bykau
  • Sunil Prabhakar

SESSION: SIGMOD Session 13. Influence in Social Networks

Discovering Your Selling Points: Personalized Social Influential Tags Exploration
  • Yuchen Li
  • Ju Fan
  • Dongxiang Zhang
  • Kian-Lee Tan
Coarsening Massive Influence Networks for Scalable Diffusion Analysis
  • Naoto Ohsaka
  • Tomohiro Sonobe
  • Sumio Fujita
  • Ken-ichi Kawarabayashi
Debunking the Myths of Influence Maximization: An In-Depth Benchmarking Study
  • Akhil Arora
  • Sainyam Galhotra
  • Sayan Ranu

SESSION: SIGMOD Session 14. Mappings, Transformations, Pricing

Interactive Mapping Specification with Exemplar Tuples
  • Angela Bonifati
  • Ugo Comignani
  • Emmanuel Coquery
  • Romuald Thion
Foofah: Transforming Data By Example
  • Zhongjun Jin
  • Michael R. Anderson
  • Michael Cafarella
  • H. V. Jagadish
QIRANA: A Framework for Scalable Query Pricing
  • Shaleen Deep
  • Paraschos Koutris

SESSION: SIGMOD Session 15. Optimization and Performance (1)

Access Path Selection in Main-Memory Optimized Data Systems: Should I Scan or Should I Probe?
  • Michael S. Kester
  • Manos Athanassoulis
  • Stratos Idreos
Optimization of Disjunctive Predicates for Main Memory Column Stores
  • Fisnik Kastrati
  • Guido Moerkotte
A Top-Down Approach to Achieving Performance Predictability in Database Systems
  • Jiamin Huang
  • Barzan Mozafari
  • Grant Schoenebeck
  • Thomas F. Wenisch

SESSION: SIGMOD Session 16. Interactive Data Exploration and AQP (2)

Two-Level Sampling for Join Size Estimation
  • Yu Chen
  • Ke Yi
A General-Purpose Counting Filter: Making Every Bit Count
  • Prashant Pandey
  • Michael A. Bender
  • Rob Johnson
  • Rob Patro
BePI: Fast and Memory-Efficient Method for Billion-Scale Random Walk with Restart
  • Jinhong Jung
  • Namyong Park
  • Sael Lee
  • U Kang

SESSION: SIGMOD Session 17. User Preferences

Determining the Impact Regions of Competing Options in Preference Space
  • Bo Tang
  • Kyriakos Mouratidis
  • Man Lung Yiu
Efficient Computation of Regret-ratio Minimizing Set: A Compact Maxima Representative
  • Abolfazl Asudeh
  • Azade Nazi
  • Nan Zhang
  • Gautam Das
FEXIPRO: Fast and Exact Inner Product Retrieval in Recommender Systems
  • Hui Li
  • Tsz Nam Chan
  • Man Lung Yiu
  • Nikos Mamoulis
Feedback-Aware Social Event-Participant Arrangement
  • Jieying She
  • Yongxin Tong
  • Lei Chen
  • Tianshu Song

SESSION: SIGMOD Session 18. Tree & Graph Processing (2)

Exploiting Common Patterns for Tree-Structured Data
  • Zhiyi Wang
  • Shimin Chen
Extracting and Analyzing Hidden Graphs from Relational Databases
  • Konstantinos Xirogiannopoulos
  • Amol Deshpande
TrillionG: A Trillion-scale Synthetic Graph Generator using a Recursive Vector Model
  • Himchan Park
  • Min-Soo Kim

SESSION: SIGMOD Session 19. Machine Learning

Schema Independent Relational Learning
  • Jose Picado
  • Arash Termehchy
  • Alan Fern
  • Parisa Ataei
Scalable Kernel Density Classification via Threshold-Based Pruning
  • Edward Gan
  • Peter Bailis
The BUDS Language for Distributed Bayesian Machine Learning
  • Zekai J. Gao
  • Shangyu Luo
  • Luis L. Perez
  • Chris Jermaine
A Cost-based Optimizer for Gradient Descent Optimization
  • Zoi Kaoudi
  • Jorge-Arnulfo Quiane-Ruiz
  • Saravanan Thirumuruganathan
  • Sanjay Chawla
  • Divy Agrawal

SESSION: SIGMOD Session 20. Optimization and Performance (2)

An Experimental Study of Bitmap Compression vs. Inverted List Compression
  • Jianguo Wang
  • Chunbin Lin
  • Yannis Papakonstantinou
  • Steven Swanson
Automatic Database Management System Tuning Through Large-scale Machine Learning
  • Dana Van Aken
  • Andrew Pavlo
  • Geoffrey J. Gordon
  • Bohan Zhang
Solving the Join Ordering Problem via Mixed Integer Linear Programming
  • Immanuel Trummer
  • Christoph Koch
Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases
  • Alexandre Verbitski
  • Anurag Gupta
  • Debanjan Saha
  • Murali Brahmadesam
  • Kamal Gupta
  • Raman Mittal
  • Sailesh Krishnamurthy
  • Sandor Maurice
  • Tengiz Kharatishvili
  • Xiaofeng Bao

SESSION: SIGMOD Session 21. Encryption

Fast Searchable Encryption With Tunable Locality
  • Ioannis Demertzis
  • Charalampos Papamanthou
Cryptanalysis of Comparable Encryption in SIGMOD'16
  • Caleb Horst
  • Ryo Kikuchi
  • Keita Xagawa
BLOCKBENCH: A Framework for Analyzing Private Blockchains
  • Tien Tuan Anh Dinh
  • Ji Wang
  • Gang Chen
  • Rui Liu
  • Beng Chin Ooi
  • Kian-Lee Tan

SESSION: SIGMOD Session 22. Cleaning, Versioning, Fusion (1)

Living in Parallel Realities: Co-Existing Schema Versions with a Bidirectional Database Evolution Language
  • Kai Herrmann
  • Hannes Voigt
  • Andreas Behrend
  • Jonas Rausch
  • Wolfgang Lehner
Synthesizing Mapping Relationships Using Table Corpus
  • Yue Wang
  • Yeye He
Waldo: An Adaptive Human Interface for Crowd Entity Resolution
  • Vasilis Verroios
  • Hector Garcia-Molina
  • Yannis Papakonstantinou

SESSION: SIGMOD Session 23. Tree & Graph Processing (3)

ZipG: A Memory-efficient Graph Store for Interactive Queries
  • Anurag Khandelwal
  • Zongheng Yang
  • Evan Ye
  • Rachit Agarwal
  • Ion Stoica
All-in-One: Graph Processing in RDBMSs Revisited
  • Kangfei Zhao
  • Jeffrey Xu Yu
Computing A Near-Maximum Independent Set in Linear Time by Reducing-Peeling
  • Lijun Chang
  • Wei Li
  • Wenjie Zhang

SESSION: SIGMOD Session 24. Spatial and Multidimensional Data (1)

Utility-Aware Ridesharing on Road Networks
  • Peng Cheng
  • Hao Xin
  • Lei Chen
Distance Oracle on Terrain Surface
  • Victor Junqiu Wei
  • Raymond Chi-Wing Wong
  • Cheng Long
  • David M. Mount
Efficient Computation of Top-k Frequent Terms over Spatio-temporal Ranges
  • Pritom Ahmed
  • Mahbub Hasan
  • Abhijith Kashyap
  • Vagelis Hristidis
  • Vassilis J. Tsotras

SESSION: SIGMOD Session 25. Optimization and Main Memory (1)

Optimizing Iceberg Queries with Complex Joins
  • Brett Walenz
  • Sudeepa Roy
  • Jun Yang
The Dynamic Yannakakis Algorithm: Compact and Efficient Query Processing Under Updates
  • Muhammad Idris
  • Martin Ugarte
  • Stijn Vansummeren
Revisiting Reuse in Main Memory Database Systems
  • Kayhan Dursun
  • Carsten Binnig
  • Ugur Cetintemel
  • TIm Kraska

SESSION: SIGMOD Session 26. Privacy

Pufferfish Privacy Mechanisms for Correlated Data
  • Shuang Song
  • Yizhen Wang
  • Kamalika Chaudhuri
Bolt-on Differential Privacy for Scalable Stochastic Gradient Descent-based Analytics
  • Xi Wu
  • Fengan Li
  • Arun Kumar
  • Kamalika Chaudhuri
  • Somesh Jha
  • Jeffrey Naughton
Pythia: Data Dependent Differentially Private Algorithm Selection
  • Ios Kotsogiannis
  • Ashwin Machanavajjhala
  • Michael Hay
  • Gerome Miklau
Utility Cost of Formal Privacy for Releasing National Employer-Employee Statistics
  • Samuel Haney
  • Ashwin Machanavajjhala
  • John M. Abowd
  • Matthew Graham
  • Mark Kutzbach
  • Lars Vilhuber

SESSION: SIGMOD Session 27. Cleaning, Versioning, Fusion (2)

Online Deduplication for Databases
  • Lianghong Xu
  • Andrew Pavlo
  • Sudipta Sengupta
  • Gregory R. Ganger
QFix: Diagnosing Errors through Query Histories
  • Xiaolan Wang
  • Alexandra Meliou
  • Eugene Wu
UGuide: User-Guided Discovery of FD-Detectable Errors
  • Saravanan Thirumuruganathan
  • Laure Berti-Equille
  • Mourad Ouzzani
  • Jorge-Arnulfo Quiane-Ruiz
  • Nan Tang
SLiMFast: Guaranteed Results for Data Fusion and Source Reliability
  • Theodoros Rekatsinas
  • Manas Joglekar
  • Hector Garcia-Molina
  • Aditya Parameswaran
  • Christopher Ré

SESSION: SIGMOD Session 28. Crowdsourcing

Crowdsourced Top-k Queries by Confidence-Aware Pairwise Judgments
  • Ngai Meng Kou
  • Yan Li
  • Hao Wang
  • Leong Hou U.
  • Zhiguo Gong
Falcon: Scaling Up Hands-Off Crowdsourced Entity Matching to Build Cloud Services
  • Sanjib Das
  • Paul Suganthan G.C.
  • AnHai Doan
  • Jeffrey F. Naughton
  • Ganesh Krishnan
  • Rohit Deep
  • Esteban Arcaute
  • Vijay Raghavendra
  • Youngchoon Park
CrowdDQS: Dynamic Question Selection in Crowdsourcing Systems
  • Asif R. Khan
  • Hector Garcia-Molina
CDB: Optimizing Queries with Crowd-Based Selections and Joins
  • Guoliang Li
  • Chengliang Chai
  • Ju Fan
  • Xueping Weng
  • Jian Li
  • Yudian Zheng
  • Yuanbing Li
  • Xiang Yu
  • Xiaohang Zhang
  • Haitao Yuan

SESSION: SIGMOD Session 29. Spatial and Multidimensional Data (2)

Scaling Locally Linear Embedding
  • Yasuhiro Fujiwara
  • Naoki Marumo
  • Mathieu Blondel
  • Koh Takeuchi
  • Hideaki Kim
  • Tomoharu Iwata
  • Naonori Ueda
Dynamic Density Based Clustering
  • Junhao Gan
  • Yufei Tao
Extracting Top-K Insights from Multi-dimensional Data
  • Bo Tang
  • Shi Han
  • Man Lung Yiu
  • Rui Ding
  • Dongmei Zhang
QUILTS: Multidimensional Data Partitioning Framework Based on Query-Aware and Skew-Tolerant Space-Filling Curves
  • Shoji Nishimura
  • Haruo Yokota

SESSION: SIGMOD Session 30. Optimization and Main Memory (2)

Leveraging Re-costing for Online Optimization of Parameterized Queries with Guarantees
  • Anshuman Dutt
  • Vivek Narasayya
  • Surajit Chaudhuri
Handling Environments in a Nested Relational Algebra with Combinators and an Implementation in a Verified Query Compiler
  • Joshua S. Auerbach
  • Martin Hirzel
  • Louis Mandel
  • Avraham Shinnar
  • Jérôme Siméon
From In-Place Updates to In-Place Appends: Revisiting Out-of-Place Updates on Flash
  • Sergey Hardock
  • Ilia Petrov
  • Robert Gottstein
  • Alejandro Buchmann

DEMONSTRATION SESSION: Demonstrations

Visual Graph Query Construction and Refinement
  • Robert Pienta
  • Fred Hohman
  • Acar Tamersoy
  • Alex Endert
  • Shamkant Navathe
  • Hanghang Tong
  • Duen Horng Chau
Demonstration of the Cosette Automated SQL Prover
  • Shumo Chu
  • Daniel Li
  • Chenglong Wang
  • Alvin Cheung
  • Dan Suciu
Interactive Time Series Analytics Powered by ONEX
  • Rodica Neamtu
  • Ramoza Ahsan
  • Charles Lovering
  • Cuong Nguyen
  • Elke Rundensteiner
  • Gabor Sarkozy
The VADA Architecture for Cost-Effective Data Wrangling
  • Nikolaos Konstantinou
  • Martin Koehler
  • Edward Abel
  • Cristina Civili
  • Bernd Neumayr
  • Emanuel Sallinger
  • Alvaro A.A. Fernandes
  • Georg Gottlob
  • John A. Keane
  • Leonid Libkin
  • Norman W. Paton
A Demonstration of Lusail: Querying Linked Data at Scale
  • Essam Mansour
  • Ibrahim Abdelaziz
  • Mourad Ouzzani
  • Ashraf Aboulnaga
  • Panos Kalnis
Foofah: A Programming-By-Example System for Synthesizing Data Transformation Programs
  • Zhongjun Jin
  • Michael R. Anderson
  • Michael Cafarella
  • H. V. Jagadish
Virtualized Network Service Topology Exploration Using Nepal
  • Pramod Jamkhedkar
  • Theodore Johnson
  • Yaron Kanza
  • Aman Shaikh
  • N.K. Shankarnarayanan
  • Vladislav Shkapenyuk
  • Gordon Woodhull
VisualCloud Demonstration: A DBMS for Virtual Reality
  • Brandon Haynes
  • Artem Minyaylov
  • Magdalena Balazinska
  • Luis Ceze
  • Alvin Cheung
The Best of Both Worlds: Big Data Programming with Both Productivity and Performance
  • Fan Yang
  • Yuzhen Huang
  • Yunjian Zhao
  • Jinfeng Li
  • Guanxian Jiang
  • James Cheng
In-Browser Interactive SQL Analytics with Afterburner
  • Kareem El Gebaly
  • Jimmy Lin
Debugging Big Data Analytics in Spark with BigDebug
  • Muhammad Ali Gulzar
  • Matteo Interlandi
  • Tyson Condie
  • Miryung Kim
Interactive Query Synthesis from Input-Output Examples
  • Chenglong Wang
  • Alvin Cheung
  • Rastislav Bodik
Generating Concise Entity Matching Rules
  • Rohit Singh
  • Vamsi Meduri
  • Ahmed Elmagarmid
  • Samuel Madden
  • Paolo Papotti
  • Jorge-Arnulfo Quiané-Ruiz
  • Armando Solar-Lezama
  • Nan Tang
A Demo of the Data Civilizer System
  • Raul Castro Fernandez
  • Dong Deng
  • Essam Mansour
  • Abdulhakim A. Qahtan
  • Wenbo Tao
  • Ziawasch Abedjan
  • Ahmed Elmagarmid
  • Ihab F. Ilyas
  • Samuel Madden
  • Mourad Ouzzani
  • Michael Stonebraker
  • Nan Tang
Querying and Exploring Polygamous Relationships in Urban Spatio-Temporal Data Sets
  • Yeuk-Yin Chan
  • Fernando Chirigati
  • Harish Doraiswamy
  • Cláudio T. Silva
  • Juliana Freire
Graph Data Mining with Arabesque
  • Eslam Hussein
  • Abdurrahman Ghanem
  • Vinicius Vitor dos Santos Dias
  • Carlos H.C. Teixeira
  • Ghadeer AbuOda
  • Marco Serafini
  • Georgos Siganos
  • Gianmarco De Francisci Morales
  • Ashraf Aboulnaga
  • Mohammed Zaki
Alpine: Efficient In-Situ Data Exploration in the Presence of Updates
  • Antonios Anagnostou
  • Matthaios Olma
  • Anastasia Ailamaki
OrpheusDB: A Lightweight Approach to Relational Dataset Versioning
  • Liqi Xu
  • Silu Huang
  • Sili Hui
  • Aaron J. Elmore
  • Aditya Parameswaran
doppioDB: A Hardware Accelerated Database
  • David Sidler
  • Zsolt Istvan
  • Muhsen Owaida
  • Kaan Kara
  • Gustavo Alonso
DBridge: Translating Imperative Code to SQL
  • K. Venkatesh Emani
  • Tejas Deshpande
  • Karthik Ramachandra
  • S. Sudarshan
BEAS: Bounded Evaluation of SQL Queries
  • Yang Cao
  • Wenfei Fan
  • Yanghao Wang
  • Tengfei Yuan
  • Yanchao Li
  • Laura Yu Chen
Safe Visual Data Exploration
  • Zheguang Zhao
  • Emanuel Zgraggen
  • Lorenzo De Stefani
  • Carsten Binnig
  • Eli Upfal
  • Tim Kraska
Optimizing Data-Intensive Applications Automatically By Leveraging Parallel Data Processing Frameworks
  • Maaz Bin Safeer Ahmad
  • Alvin Cheung
DIAS: Differentially Private Interactive Algorithm Selection using Pythia
  • Ios Kotsogiannis
  • Michael Hay
  • Ashwin Machanavajjhala
  • Gerome Miklau
  • Margaret Orr
Snorkel: Fast Training Set Generation for Information Extraction
  • Alexander J. Ratner
  • Stephen H. Bach
  • Henry R. Ehrenberg
  • Chris Ré
Synthesizing Extraction Rules from User Examples with SEER
  • Maeda F. Hanafi
  • Azza Abouzied
  • Laura Chiticariu
  • Yunyao Li
Scout: A GPU-Aware System for Interactive Spatio-temporal Data Visualization
  • Harshada Chavan
  • Mohamed F. Mokbel
Graphflow: An Active Graph Database
  • Chathura Kankanamge
  • Siddhartha Sahu
  • Amine Mhedbhi
  • Jeremy Chen
  • Semih Salihoglu
Demonstration: MacroBase, A Fast Data Analysis Engine
  • Peter Bailis
  • Edward Gan
  • Kexin Rong
  • Sahaana Suri
Q*cert: A Platform for Implementing and Verifying Query Compilers
  • Joshua S. Auerbach
  • Martin Hirzel
  • Louis Mandel
  • Avraham Shinnar
  • Jérôme Siméon
A Demonstration of Interactive Analysis of Performance Measurements with Viska
  • Helga Gudmundsdottir
  • Babak Salimi
  • Magdalena Balazinska
  • Dan R.K. Ports
  • Dan Suciu

TUTORIAL SESSION: Tutorials

Crowdsourced Data Management: Overview and Challenges
  • Guoliang Li
  • Yudian Zheng
  • Ju Fan
  • Jiannan Wang
  • Reynold Cheng
Data Management in Machine Learning: Challenges, Techniques, and Systems
  • Arun Kumar
  • Matthias Boehm
  • Jun Yang
Data Management Challenges in Production Machine Learning
  • Neoklis Polyzotis
  • Sudip Roy
  • Steven Euijong Whang
  • Martin Zinkevich
Differential Privacy in the Wild: A Tutorial on Current Practices & Open Challenges
  • Ashwin Machanavajjhala
  • Xi He
  • Michael Hay
Graph Querying Meets HCI: State of the Art and Future Directions
  • Sourav S. Bhowmick
  • Byron Choi
  • Chengkai Li
Graph Exploration: From Users to Large Graphs
  • Davide Mottin
  • Emmanuel Müller
Building Structured Databases of Factual Knowledge from Massive Text Corpora
  • Xiang Ren
  • Meng Jiang
  • Jingbo Shang
  • Jiawei Han
Data Profiling: A Tutorial
  • Ziawasch Abedjan
  • Lukasz Golab
  • Felix Naumann
How to Build a Non-Volatile Memory Database Management System
  • Joy Arulraj
  • Andrew Pavlo
Data Structure Engineering For Byte-Addressable Non-Volatile Memory
  • Ismail Oukid
  • Wolfgang Lehner
Natural Language Data Management and Interfaces: Recent Development and Open Challenges
  • Yunyao Li
  • Davood Rafiei
Hybrid Transactional/Analytical Processing: A Survey
  • Fatma Özcan
  • Yuanyuan Tian
  • Pinar Tözün
Query Processing Techniques for Big Spatial-Keyword Data
  • Ahmed Mahmood
  • Walid G. Aref