The thirteenth issue of the seventh volume of the Proceedings of the VLDB Endowment (PVLDB) is now available electronically at the VLDB web site:
http://www.vldb.org/pvldb/vol7.html
This issue consists of the papers that have been accepted for the Industry and Demonstration Tracks, tutorials and Ph.D. workshop overview, keynote and award Talks, and selected papers from local industry for VLDB 2014 in Hangzhou, China in September 2014:
Industrial, Applications, and Experience Papers
1. MRTuner: A Toolkit to Enable Holistic Optimization for MapReduce Jobs
Juwei Shi, Jia Zou, Jiaheng Lu, Zhao Cao, Shiqiang Li, and Chen Wang
http://www.vldb.org/pvldb/vol7/p1319-shi.pdf
2. Reducing Database Locking Contention Through Multi-version Concurrency
Mohammad Sadoghi, Mustafa Canim, Bishwaranjan Bhattacharjee, Fabian Nagel, Kenneth A. Ross
http://www.vldb.org/pvldb/vol7/p1331-sadoghi.pdf
3. Changing Engines in Midstream: A Java Stream Computational Model for Big Data Processing
Xueyuan Su, Garret Swart, Brian Goetz, Brian Oliver, Paul Sandoz
http://www.vldb.org/pvldb/vol7/p1343-su.pdf
4. Joins on Encoded and Partitioned Data
Jae-Gil Lee, Gopi Attaluri, Ronald Barber, Naresh Chainani, Oliver Draese, Frederick Ho, Stratos Idreos, Min-Soo Kim, Sam Lightstone, Guy Lohman, Konstantinos Morfonios, Keshava Murthy, Ippokratis Pandis, Lin Qiao, Vijayshankar Raman, Vincent Kulandai Samy, Richard Sidle, Knut Stolz, Liping Zhang
http://www.vldb.org/pvldb/vol7/p1355-lee.pdf
5. TPC-DI: The First Industry Benchmark for Data Integration
Meikel Poess, Tilmann Rabl, Brian Caufield
http://www.vldb.org/pvldb/vol7/p1367-poess.pdf
6. Real-Time Twitter Recommendation: Online Motif Detection in Large Dynamic Graphs
Pankaj Gupta, Venu Satuluri, Ajeet Grewal, Siva Gurumurthy, Volodymyr Zhabiuk, Quannan Li, and Jimmy Lin
http://www.vldb.org/pvldb/vol7/p1379-lin.pdf
7. Interval Disaggregate: A New Operator for Business Planning
Sang K. Cha, Kunsoo Park, Changbin Song, Kihong Kim, Cheol Ryu, Sunho Lee
http://www.vldb.org/pvldb/vol7/p1381-cha.pdf
8. Fuxi: a Fault-Tolerant Resource Management and Job Scheduling System at Internet Scale
Zhuo Zhang, Chao Li, Yangyu Tao, Renyu Yangy, Hong Tang, Jie Xu
http://www.vldb.org/pvldb/vol7/p1393-zhang.pdf
9. Large-Scale Graph Analytics in Aster 6: Bringing Context to Big Data Discovery
David Simmen, Karl Schnaitter, Jeff Davis, Yingjie He, Sangeet Lohariwala, Ajay Mysore, Vinayak Shenoi, Mingfeng Tan, Yu Xiao
http://www.vldb.org/pvldb/vol7/p1405-simmen.pdf
10. Fast Foreign-Key Detection in Microsoft SQL Server PowerPivot for Excel
Zhimin Chen, Vivek Narasayya, Surajit Chaudhuri
http://www.vldb.org/pvldb/vol7/p1417-chen.pdf
11. Big Data Small Footprint: The Design of A Low-Power Classifier for Detecting Transportation Modes
Meng-Chieh Yu, Tong Yu, Shao-Chen Wang, Chih-Jen Lin, Edward Y. Chang
http://www.vldb.org/pvldb/vol7/p1429-yu.pdf
12. Summingbird: A Framework for Integrating Batch and Online MapReduce Computations
Oscar Boykin, Sam Ritchie, Ian O’Connell, Jimmy Lin
http://www.vldb.org/pvldb/vol7/p1441-boykin.pdf
13. Of Snowstorms and Bushy Trees
Rafi Ahmed, Rajkumar Sen, Meikel Poess, Sunil Chakkappen
http://www.vldb.org/pvldb/vol7/p1452-ahmed.pdf
14. Execution Primitives for Scalable Joins and Aggregations in Map Reduce
Srinivas Vemuri, Maneesh Varshney, Krishna Puttaswamy, Rui Liu
http://www.vldb.org/pvldb/vol7/p1462-vemuri.pdf
15. CAP Limits in Telecom Subscriber Database Design
Javier Arauz
http://www.vldb.org/pvldb/vol7/p1474-arauz.pdf
16. Advanced Join Strategies for Large-Scale Distributed Computation
Nicolas Bruno, YongChul Kwon, Ming-Chuan Wu
http://www.vldb.org/pvldb/vol7/p1484-bruno.pdf
17. DGFIndex for Smart Grid: Enhancing Hive with a Cost-Effective Multidimensional Range Index
Yue Liu, Songlin Hu, Tilmann Rabl, Wantao Liu, Hans-Arno Jacobsen, Kaifeng Wu, Jian Chen, Jintao Li
http://www.vldb.org/pvldb/vol7/p1496-liu.pdf
18. Error-bounded Sampling for Analytics on Big Sparse Data
Ying Yan, Liang Jeff Chen, Zheng Zhang
http://www.vldb.org/pvldb/vol7/p1508-yan.pdf
19. Indexing HDFS Data in PDW: Splitting the data from the index
Vinitha Reddy Gankidi, Nikhil Teletia, Jignesh M. Patel, Alan Halverson, David J. DeWitt
http://www.vldb.org/pvldb/vol7/p1520-gankidi.pdf
20. Chimera: Large-Scale Classification using Machine Learning, Rules, and Crowdsourcing
Chong Sun, Narasimhan Rampalli, Frank Yang, AnHai Doan
http://www.vldb.org/pvldb/vol7/p1529-sun.pdf
Demonstrations
1. Interactive Join Query Inference with JIM
Angela Bonifati, Radu Ciucanu, Slawek Staworko
http://www.vldb.org/pvldb/vol7/p1541-bonifati.pdf
2. MESA: A Map Service to Support Fuzzy Type-ahead Search over Geo-Textual Data
Yuxin Zheng, Zhifeng Bao, Lidan Shou, Anthony K. H. Tung
http://www.vldb.org/pvldb/vol7/p1545-zheng.pdf
3. R3: A Real-Time Route Recommendation System
Henan Wang, Guoliang Li, Huiqi Hu, Shuo Chen, Bingwen Shen, Hao Wu, Wen-Syan Li, Kian-Lee Tan
http://www.vldb.org/pvldb/vol7/p1549-wang.pdf
4. PDQ: Proof-driven Query Answering over Web-based Data
Michael Benedikt, Julien Leblay, Efthymia Tsamoura
http://www.vldb.org/pvldb/vol7/p1553-benedikt.pdf
5. Data In, Fact Out: Automated Monitoring of Facts by FactWatcher
Naeemul Hassan, Afroza Sultana, You Wu, Gensheng Zhang, Chengkai Li, Jun Yang, Cong Yu
http://www.vldb.org/pvldb/vol7/p1557-hassan.pdf
6. OceanST: A Distributed Analytic System for Large-Scale Spatiotemporal Mobile Broadband Data
Mingxuan Yuan, Ke Deng, Jia Zeng, Yanhua Li, Bing Ni, Xiuqiang He, Fei Wang, Wenyuan Dai, Qiang Yang
http://www.vldb.org/pvldb/vol7/p1561-yuan.pdf
7. That’s All Folks! LLUNATIC Goes Open Source
Floris Geerts, Giansalvatore Mecca, Paolo Papotti, Donatello Santoro
http://www.vldb.org/pvldb/vol7/p1565-mecca.pdf
8. HDBTracker: Monitoring the Aggregates On Dynamic Hidden Web Databases
Weimo Liu, Saad Bin Suhaim, Saravanan Thirumuruganathan, Nan Zhang, Gautam Das, Ali Jaoua
http://www.vldb.org/pvldb/vol7/p1569-liu.pdf
9. BSMA: A Benchmark for Analytical Queries over Social Media Data
Fan Xia, Ye Li, Chengcheng Yu, Haixin Ma, Weining Qian
http://www.vldb.org/pvldb/vol7/p1573-xia.pdf
10. Graph-based Data Integration and Business Intelligence with BIIIG
Andre Petermann, Martin Junghanns, Robert Muller, Erhard Rahm
http://www.vldb.org/pvldb/vol7/p1577-petermann.pdf
11. SEEDB: Automatically Generating Query Visualizations
Manasi Vartak, Samuel Madden, Aditya Parameswaran, Neoklis Polyzotis
http://www.vldb.org/pvldb/vol7/p1581-vartak.pdf
12. QUEST: An Exploratory Approach to Robust Query Processing
Anshuman Dutt, Sumit Neelam, Jayant R. Haritsa
http://www.vldb.org/pvldb/vol7/p1585-dutt.pdf
13. Redoop Infrastructure for Recurring Big Data Queries
Chuan Lei, Zhongfang Zhuang, Elke A. Rundensteiner, Mohamed Y. Eltabakh
http://www.vldb.org/pvldb/vol7/p1589-lei.pdf
14. PackageBuilder: From Tuples to Packages
Matteo Brucato, Rahul Ramakrishna, Azza Abouzied, Alexandra Meliou
http://www.vldb.org/pvldb/vol7/p1593-brucato.pdf
15. Ontology Assisted Crowd Mining
Yael Amsterdamer, Susan B. Davidson, Tova Milo, Slava Novgorodov, Amit Somech
http://www.vldb.org/pvldb/vol7/p1597-amsterdamer.pdf
16. SOPS: A System for Efficient Processing of Spatial-Keyword Publish/Subscribe
Lisi Chen, Yan Cui, Gao Cong, Xin Cao
http://www.vldb.org/pvldb/vol7/p1601-chen.pdf
17. MLJ: Language-Independent Real-Time Search of Tweets Reported by Media Outlets and Journalists
Masumi Shirakawa, Takahiro Hara, Shojiro Nishio
http://www.vldb.org/pvldb/vol7/p1605-shirakawa.pdf
18. Ocelot/HyPE: Optimized Data Processing on Heterogeneous Hardware
Sebastian Bress, Max Heimel, Michael Saecker, Bastian Kocher, Volker Markl, Gunter Saake
http://www.vldb.org/pvldb/vol7/p1609-bress.pdf
19. MoveMine 2.0: Mining Object Relationships from Movement Data
Fei Wu, Tobias Kin Hou Lei, Zhenhui Li, Jiawei Han
http://www.vldb.org/pvldb/vol7/p1613-wu.pdf
20. A Partitioning Framework for Aggressive Data Skipping
Liwen Sun, Sanjay Krishnan, Reynold S. Xin, Michael J. Franklin
http://www.vldb.org/pvldb/vol7/p1617-sun.pdf
21. Interactive Outlier Exploration in Big Data Streams
Lei Cao, Qingyang Wang, Elke A. Rundensteiner
http://www.vldb.org/pvldb/vol7/p1621-cao.pdf
22. SQL/AA: Executing SQL on an Asymmetric Architecture
Quoc-Cuong To, Benjamin Nguyen, Philippe Pucheral
http://www.vldb.org/pvldb/vol7/p1625-to.pdf
23. gMission: A General Spatial Crowdsourcing Platform
Zhao Chen, Rui Fu, Ziyuan Zhao, Zheng Liu, Leihao Xia, Lei Chen, Peng Cheng, Caleb Chen Cao, Yongxin Tong, Chen Jason Zhang
http://www.vldb.org/pvldb/vol7/p1629-chen.pdf
24. S-Store: A Streaming NewSQL System for Big Velocity Applications
Ugur Cetintemel, Jiang Du, Tim Kraska, Samuel Madden, David Maier, John Meehan, Andrew Pavlo, Michael Stonebraker, Erik Sutherland, Nesime Tatbul, Kristin Tufte, Hao Wang, Stanley Zdonik
http://www.vldb.org/pvldb/vol7/p1633-cetintemel.pdf
25. CLEar: A Real-time Online Observatory for Bursty and Viral Events
Runquan Xie, Feida Zhu, Hui Ma, Wei Xie, Chen Lin
http://www.vldb.org/pvldb/vol7/p1637-xie.pdf
26. AZDBLab: A Laboratory Information System for Large-Scale Empirical DBMS Studies
Young-Kyoon Suh, Richard T. Snodgrass, Rui Zhang
http://www.vldb.org/pvldb/vol7/p1641-suh.pdf
27. Terrain-Toolkit: A Multi-Functional Tool for Terrain Data
Qi Wang, Manohar Kaul, Cheng Long, Raymond Chi-Wing Wong
http://www.vldb.org/pvldb/vol7/p1645-wang.pdf
28. FORWARD: Data-Centric UIs using Declarative Templates that Efficiently Wrap Third-Party JavaScript Components
Yupeng Fu, Kian Win Ong, Yannis Papakonstantinou, Erick Zamora
http://www.vldb.org/pvldb/vol7/p1649-fu.pdf
29. SPIRE: Supporting Parameter-Driven Interactive Rule Mining and Exploration
Xika Lin, Abhishek Mukherji, Elke A. Rundensteiner, Matthew O. Ward
http://www.vldb.org/pvldb/vol7/p1653-lin.pdf
30. An Integrated Development Environment for Faster Feature Engineering
Michael R. Anderson, Michael Cafarella, Yixing Jiang, Guan Wang, Bochun Zhang
http://www.vldb.org/pvldb/vol7/p1657-anderson.pdf
31. Pronto: A Software-Defined Networking based System for Performance Management of Analytical Queries on Distributed Data Stores
Pengcheng Xiong, Hakan Hacigumus
http://www.vldb.org/pvldb/vol7/p1661-xiong.pdf
32. Getting Your Big Data Priorities Straight: A Demonstration of Priority-based QoS using Social-network-driven Stock Recommendation
Rui Zhang, Reshu Jain, Prasenjit Sarkar, Lukas Rupprecht
http://www.vldb.org/pvldb/vol7/p1665-zhang.pdf
33. VERTEXICA: Your Relational Friend for Graph Analytics!
Alekh Jindal, Praynaa Rawlani, Eugene Wu, Samuel Madden, Amol Deshpande, Mike Stonebraker
http://www.vldb.org/pvldb/vol7/p1669-jindal.pdf
34. NScale: Neighborhood-centric Analytics on Large Graphs
Abdul Quamar, Amol Deshpande, Jimmy Lin
http://www.vldb.org/pvldb/vol7/p1673-quamar.pdf
35. DPSynthesizer: Differentially Private Data Synthesizer for Privacy Preserving Data Sharing
Haoran Li, Li Xiong, Lifan Zhang, Xiaoqian Jiang
http://www.vldb.org/pvldb/vol7/p1677-li.pdf
36. SPOT: Locating Social Media Users Based on Social Network Context
Longbo Kong, Zhi Liu, Yan Huang
http://www.vldb.org/pvldb/vol7/p1681-liu.pdf
37. RASP-QS: Efficient and Confidential Query Services in the Cloud
Zohreh Alavi, Lu Zhou, James Powers, Keke Chen
http://www.vldb.org/pvldb/vol7/p1685-alavi.pdf
38. Thoth: Towards Managing a Multi-System Cluster
Mayuresh Kunjir, Prajakta Kalmegh, Shivnath Babu
http://www.vldb.org/pvldb/vol7/p1689-kunjir.pdf
39. X-LiSA: Cross-lingual Semantic Annotation
Lei Zhang, Achim Rettinger
http://www.vldb.org/pvldb/vol7/p1693-zhang.pdf
40. Combining User Interaction, Speculative Query Execution and Sampling in the DICE System
Prasanth Jayachandran, Karthik Tunga, Niranjan Kamat, Arnab Nandi
http://www.vldb.org/pvldb/vol7/p1697-jayachandran.pdf
41. STMaker – A System to Make Sense of Trajectory Data
Han Su, Kai Zheng, Kai Zeng, Jiamin Huang, Xiaofang Zhou
http://www.vldb.org/pvldb/vol7/p1701-su.pdf
42. Faster Visual Analytics through Pixel-Perfect Aggregation
Uwe Jugel, Zbigniew Jerzak, Gregor Hackenbroich, Volker Markl
http://www.vldb.org/pvldb/vol7/p1705-jugel.pdf
Tutorials and Workshop
1. Systems for Big-Graphs
Arijit Khan, Sameh Elnikety
http://www.vldb.org/pvldb/vol7/p1709-khan.pdf
2. Tutorial: Uncertain Entity Resolution
Avigdor Gal
http://www.vldb.org/pvldb/vol7/p1711-gal.pdf
3. Knowledge Bases in the Age of Big Data Analytics
Fabian M. Suchanek, Gerhard Weikum
http://www.vldb.org/pvldb/vol7/p1713-suchanek.pdf
4. Causality and Explanations in Databases
Alexandra Meliou, Sudeepa Roy, Dan Suciu
http://www.vldb.org/pvldb/vol7/p1715-meliou.pdf
5. Enterprise Search in the Big Data Era: Recent Developments and Open Challenges
Yunyao Li, Ziyang Liu, Huaiyu Zhu
http://www.vldb.org/pvldb/vol7/p1717-li.pdf
6. VLDB 2014 Ph.D. Workshop — An Overview
Yunyao Li, Erich Neuhold
http://www.vldb.org/pvldb/vol7/p1719-phd.pdf
Keynote and Award Talks
1. Datacenters as Computers: Google Engineering & Database Research Perspectives
Shivakumar Venkataraman, Divyakant Agrawal
http://www.vldb.org/pvldb/vol7/p1720-venkataraman-agrawal.pdf
2. The Impact of Columnar In-Memory Databases on Enterprise Systems
Hasso Plattner
http://www.vldb.org/pvldb/vol7/p1722-plattner.pdf
3. Breaking the Chains: On Declarative Data Analysis and Data Independence in the Big Data Era
Volker Markl
http://www.vldb.org/pvldb/vol7/p1730-markl.pdf
4. Engineering High-Performance Database Engines
Thomas Neumann
http://www.vldb.org/pvldb/vol7/p1734-neumann.pdf
Selected Papers from Local Industry
1. Realization of the Low Cost and High Performance MySQL Cloud Database
Wei Cao, Feng Yu, Jiasen Xie
http://www.vldb.org/pvldb/vol7/p1742-alibaba.pdf
2. Fatman: Cost-saving and reliable archival storage based on volunteer resources
An Qin, Dianming Hu, Jun Liu, Wenjun Yang, Dai Tan
http://www.vldb.org/pvldb/vol7/p1748-baidu.pdf
3. Design and Implementation of a Real-Time Interactive Analytics System for Large Spatio-Temporal Data
Shiming Zhang, Yin Yang, Wei Fan, Marianne Winslet
http://www.vldb.org/pvldb/vol7/p1754-huawei.pdf
4. A Personalized Recommendation System for NetEase Dating Site
Chaoyue Dai, Feng Qian, Wei Jiang, Zhoutian Wang, Zenghong Wu
http://www.vldb.org/pvldb/vol7/p1760-netease.pdf
5. GEMINI: An Integrative Healthcare Analytics System
Zheng Jye Ling, Quoc Trung Tran, Ju Fan, Gerald C.H. Koh, Thi Nguyen, Chuen Seng Tan, James W. L. Yip, Meihui Zhang
http://www.vldb.org/pvldb/vol7/p1766-nuhs.pdf
6. Mariana: Tencent Deep Learning Platform and its Applications
Yongqiang Zou, Xing Jin, Yi Li, Zhimao Guo, Eryu Wang, Bin Xiao
http://www.vldb.org/pvldb/vol7/p1772-tencent.pdf
7. yzBigData: Provisioning Customizable Solution for Big Data
Sai Wu, Gang Chen, Ke Chen, Lidan Shou, Hui Cao, He Bai
http://www.vldb.org/pvldb/vol7/p1778-yzbigdata.pdf
Errata
1. Errata for "Building Efficient Query Engines in a High-Level Language" (PVLDB 7(10): 853-864)
Yannis Klonatos, Christoph Koch, Tiark Rompf, Hassan Chafi
http://www.vldb.org/pvldb/vol7/p1784-klonatos.pdf
In addition, the front matter contains a letter from the VLDB 2014 Program Co-Chair, H. V. Jagadish.
http://www.vldb.org/pvldb/vol7/FrontMatterVol7No13.pdf
Li Xiong, Emory University
Cong Yu, Google Research New York
VLDB 2014 Proceedings Chairs