October 16 |
||
18:30-20:00 |
Registration [in Lower Lobby] |
|
October 17 |
||
7:30 |
Registration [in Lower Lobby] |
|
8:30-9:00 |
Opening [in Salle De Bal - Lower Lobby] |
|
9:00-10:00 |
Keynote: Yoshua Bengio - Deep Learning and AI [in Salle De Bal - Lower Lobby] |
|
10:00-10:20 |
Break |
|
10:20-12:20 |
Classification [in Salles Mainsonneuve A] |
|
194 research On the Evaluation of Outlier
Detection and One-Class Classification Methods, |
||
79 research Active Semi-Supervised
Classification based on Multiple Clustering Hierarchies, |
||
110 research Combining Static and Dynamic Features for Multivariate Sequence Classification, Anna Leontjeva, Ilya Kuzovkin, |
||
61 research Correcting Relational Bias to Improve Classification in Sparsely-Labeled Networks, Joshua King, Luke McDowell, |
||
67 research Hyperparameter Optimization Machines, Lars Schmidt-Thieme, Martin Wistuba, Nicolas Schilling, |
||
10:20-12:20 |
Networks [in Salles Mainsonneuve B,C] |
|
152 research Temporal Network Change
Detection Using Network Centrality, |
||
255 research Harvester: Influence Optimization
in Symmetric Interaction Networks, |
||
297 research Pattern Matching Trajectories
for Investigative Graph Searches, |
||
240 research A Framework for Description and Analysis of Approximate Triangle Counting Algorithms, Mostafa Haghir Chehreghani, |
||
180 research Limiting the diffusion of information by a selective PageRank-preserving approach, Grigorios Loukides, Robert Gwadera |
||
10:20-12:20 |
Anonymity, Fraud, and Privacy [in Salles Mainsonneuve D] |
|
30 research An Exploratory Statistical
Cusp Catastrophe Model, |
||
34 research Using Loglinear
Model for Discrimination Discovery and Prevention, |
||
145 applications Fraud Detection in Energy Consumption: A Supervised Approach, Bernat Coma-Puig, Josep Carmona, Ricard Gavalda, Santiago Alcoverro, Victor Martin, |
||
270 applications Anomaly Detection in Automobile Control Network Data with Long Short-Term Memory Networks,Adrian Taylor, Sylvain Leblanc and Nathalie Japkowicz |
||
220 applications Anonymizing NYC Taxi
Data: Does It Matter? |
||
10:20-12:20 |
SS1: Statistical Learning for Data Science [in Salles Mainsonneuve E, F] |
|
39 SS1-SLDS A Distributed
Decision Tree Algorithm and Its Implementation on Big Data Platforms
|
||
81 SS1-SLDS Analysing the History of Autism Spectrum Disorder using Topic Models Adham Beykikhoshk, Dinh Phung, Ognjen Arandelovic, Svetha Venkatesh, |
||
181 SS1-SLDS Sparse Linear Discriminant Analysis in Structured Covariates Space Sandra Safo and Qi Long |
||
199 SS1-SLDS Informative Priors and Bayesian Computation Shirin Golchi, |
||
230 SS1-SLDS Causal structure learning with reduced partial correlation thresholding Arjun Sondhi and Ali Shojaie |
||
12:20-13:30 |
Lunch |
|
13:30-15:30 |
High-dimensional data [in Salles Mainsonneuve A] |
|
253 research Infinite Langevin
Mixture Modeling and Feature Selection, |
||
29 research Efficient Identification of Tanimoto Nearest Neighbors, David Anastasiu, George Karypis, |
||
37 research Parallel Least-Squares Policy Iteration, Jun-Kun Wang, Shou-De Lin, |
||
56 research Dilation of Chisini-Jensen-Shannon Divergence, Piyush Sharma and Gary Holness |
||
100 research Projecting ``better than
randomly": How to reduce the dimensionality of very large datasets
in a way that outperforms random projections, |
||
13:30-15:30 |
Social Media and Crowd [in Salles Mainsonneuve B, C] |
|
18 research Task Composition in Crowdsourcing,
|
||
272 research On the Role of Mentions on Tweet Virality, Soumajit Pramanik, Maximilien Danisch, Qinna Wang, Anand Kumar, Sumanth Bandi, Jean-Loup Guillaume and Bivas Mitra |
||
60 applications Mining Pre-Exposure
Prophylaxis Trends in Social Media, |
||
112 research Overlapping Target Event
and Story Line Detection of Online Newspaper Articles, |
||
117 research Online Collaborative Prediction
of Regional Vote Results, |
||
13:30-15:30 |
SS2: Health Data Science [in Salles Mainsonneuve D] |
|
305 SS1-SLDS Nonparametric Adjoint-Based Inference for Stochastic Differential Equations Harish S. Bhat, R. W. M. A. Madushani, |
||
118 SS2-HDS Actitracker: A Smartphone-based Activity Recognition System for Improving Health and Well-Being Gary Weiss, Jeffrey Lockhart, Tony Pulickal, Paul McHugh, Isaac Ronan and Jessica Timko |
||
258 SS2-HDS The Highly Adaptive Lasso Estimator David Benkeser, Mark van der Laan, |
||
285 SS2-HDS Meeting Health Care Research
Needs in a Kimball Integrated Data Warehouse |
||
309 SS2-HDS MedCare: Leveraging Medication Similarity for Disease Prediction Dipanwita Dasgupta, Nitesh V. Chawla |
||
13:30-15:30 |
SS3: Environmental and Geo-spatial Data Analytics [in Salles Mainsonneuve E, F] |
|
52: Reserve Price Optimization at Scale,
|
||
164 SS3-EnGeoData Efficient
Large Scale Clustering based on Data Partitioning |
||
200 SS3-EnGeoData Traffic
Risk Mining Using Partially Ordered Non-negative Matrix Factorization
|
||
219 SS3-EnGeoData On the
Use of Ontology as a priori Knowledge into Constrained Clustering
|
||
227 SS3-EnGeoData Maritime
Pattern Extraction from AIS data using a Genetic Algorithm |
||
15:30-15:50 |
Break |
|
15:50-17:00 |
Industry Keynote: Xin Fu (LinkedIn): Path to 400M Members: LinkedIn’s Data Powered Journey [in Salle De Bal - Lower Lobby] |
|
17:00-18:00 |
Industry invited talk: Abesh Bhattacharjee (InfoSys): Predictive Maintenance of Automobile Parts using distributed data store and classification techniques [in Salle De Bal - Lower Lobby] |
|
18:00-20:00 |
Poster+Reception [in Viger - Lower Lobby] |
|
October 18 |
||
9:00-10:00 |
Keynote: Juliana Freire - Democratizing Urban Data Analysis [in Salle De Bal - Lower Lobby] |
|
10:00-10:20 |
Break |
|
10:20-12:20 |
Temporal Analytics [in Salles Mainsonneuve A] |
|
114 research Continuous Monitoring of A/B Tests without Pain: Optional Stopping in Bayesian Testing, Alex Deng, Jiannan Lu, Shouyuan Chen, |
||
304 research Learning Temporal Dependence from Time-Series Data with Latent Variables, Baosen Zhang, Hossein Hosseini, Radha Poovendran, Sreeram Kannan, |
||
243 research Trend Detection based Regret Minimization for Bandit Problems, Paresh Nakhe, Rebecca Reiffenhaeuser, |
||
262 applications A Symbolic Tree Model for Oil and Gas Production Prediction Using Time-Series Production Data, Bingjie Wei, Helen Pinto, Xin Wang, |
||
128 research Resampling Strategies for Imbalanced Time Series, Luis Torgo, Nuno Moniz, Paula Branco |
||
10:20-12:20 |
Scale [in Salles Mainsonneuve B, C] |
|
70 research Performance Improvement of MapReduce Process Using Limited Node Block Placement Policy, Sungchul Lee, Juyeon Jo and Yoohwan Kim |
||
149 research Closest Interval Join Using MapReduce, Qiang Zhang, Andy He, Chris Liu and Eric Lo |
||
290 research EM*: An EM algorithm for Big Data, Hasan Kurban, Mark Jenne, Mehmet M. Dalkilic, |
||
36 research Efficient Sampling-based ADMM for Distributed Data, Jun-Kun Wang, Shou-De Lin, |
||
298 research A Parallel Framework for Grid-based Bottom-up Subspace Clustering, Poonam Goyal, Sonal Kumari, Shubham Singh, Vivek Choudhary, Sundar Balasubramaniam and Navneet Goyal |
||
10:20-12:20 |
SS4: Emotion and Sentiment in Intelligent Systems and Big Social Data Analysis [in Salles Mainsonneuve D] |
|
105 SS4-SentISData Connecting
Opinions to Opinion-Leaders: A case study on Brazilian political
protests |
||
129 SS4-SentISData Exploiting
a Bootstrapping Approach for Annotating Emotions in Texts Automatically
|
||
143 SS4-SentISData An Anatomy of Hate: Identifying Hate Speech in Social Media Brian Carignan, Howard Needham, |
||
251 SS4-SentISData Senpy:
A Pragmatic Linked Sentiment Analysis Framework |
||
287 SS4-SentISData Word Segmentation Algorithms and Combined Lexical Resources for Identifying Hashtag Types Credell Simeon, Howard Hamilton, Robert Hilderman, |
||
10:20-12:20 |
Tutorial1: Continuous Measurement of Quality of Data Streams [in Salles Mainsonneuve E, F] |
|
12:20-13:30 |
Lunch |
|
13:30-15:30 |
Search and Mining [in Salles Mainsonneuve A] |
|
103 research Impact of Query Sample Selection Bias on Information Retrieval System Ranking, Massimo Melucci, |
||
295 research Mining Research Problems from Scientific Literature, Chanakya Aalla, Vikram Pudi, |
||
80 research Perceived, Projected, and True Investment Expertise: Not All Experts Provide Expert Recommendations, Amit Shavit, Sameena Shah, |
||
189 applications A Multi-granularity Pattern-based Sequence Classification Framework for Educational Data, Mohammad Jaber, Peter Wood, Panagiotis Papapetrou and Ana González-Marcos |
||
17: Testing Interestingness Measures in Practice: A Large-Scale Analysis of Buying Patterns, Martin Kirchgessner, Vincent Leroy, Sihem Amer-Yahia, Sashwat Mishra |
||
13:30-15:30 |
Relational and Structured Data [in Salles Mainsonneuve B, C] |
|
98 research Inconsistent Node Flattening for Improving Top-down Hierarchical Classification, Azad Naik, Huzefa Rangwala, |
||
225 research Learning Multifaceted
Latent Activities from Heterogeneous Mobile Data, |
||
316 research The synthetic data vault, Neha Patki, Roy Wedge and Kalyan Veeramachaneni |
||
58 research A Decision Tree-based Approach
for Categorizing Spatial Database Query Results, |
||
245 research The Semantic Knowledge
Graph: A compact, auto-generated model for real-time traversal
and ranking of any relationship within a domain, |
||
13:30-15:30 |
SS5: Game Data Science + SS6: Stat and Math Tools for DM [in Salles Mainsonneuve D] |
|
69 SS5-GDS Using players' Gameplay Action-Decision Profiles to prescribe training: Reducing training costs with Serious Games Analytics Christian Loh, I-Hung Li, |
||
186 SS5-GDS What did I
do Wrong in my MOBA Game?: Mining Patterns Discriminating Deviant
Behaviours |
||
212 SS5-GDS On The "Tiny
yet Real Happiness" Phenomenon in The Mobile Games Market |
||
139 SS6-SMTDM The Uniqueness and Greedy Method for Quadratic Compressive Sensing Jun Fan, Lingchen Kong, Liqun Wang, Naihua Xiu, |
||
274 SS6-SMTDM Robust Online
Time Series Prediction with Recurrent Neural Networks |
||
13:30-14:30 |
Tutorial1: Continuous Measurement of Quality of Data Streams (continue) [in Salles Mainsonneuve E, F] |
|
15:30-15:50 |
Break |
|
15:50-17:30 |
Trends &s; Controvercy + Panel [in Salle De Bal - Lower Lobby] |
|
19:00-21:00 |
Banquet [in Viger - Lower Lobby] |
|
October 19 |
||
8:30-90 |
Award ceremony + DSAA17 [in Salle De Bal - Lower Lobby] |
|
9:00-10:00 |
Keynote: David Donoho [in Salle De Bal - Lower Lobby] |
|
10:00-10:20 |
Break |
|
10:20-12:20 |
Predictive Analytics [in Salles Mainsonneuve A] |
|
293 research Prediction engineering:Enabling
agile predictive analytics, |
||
310 research Trane: A Language to Express Predictive Problems, Benjamin Schreck, Kalyan Veeramachaneni, |
||
250 applications Detecting Inaccurate
Predictions of Pediatric Surgical Durations, |
||
188 applications Advanced Analytics
for Train Delay Prediction Systems by Including Exogenous Weather
Data, |
||
82 applications Waiting to be Sold: Prediction of Time-Dependent House Selling Probability, Mansurul Bhuiyan, Mohammad Al Hasan |
||
10:20-12:20 |
SS7: Data Science for Agricultural Decision Support Systems [in Salles Mainsonneuve B, C] |
|
SS7-DS4ADSS Data science and digital agriculture at The Climate Corporation, Steve Sain (invited) |
||
185 SS7-DS4ADSS Disease
Detection and Severity Estimation in Cotton Plant from Unconstrained
Images |
||
273 SS7-DS4ADSS Digital
Knowledge Ecosystem for achieving Sustainable Agriculture Production:
A Case Study from Sri Lanka |
||
10:20-12:20 |
Tutorial 2: Model Selection and Error Estimation without the Agonizing Pain [in Salles Mainsonneuve D] |
|
10:20-12:20 |
Tutorial 3: Similarity Search on Time Series Data: Past, Present and Future [in Salles Mainsonneuve E, F] |
|
12:20-13:30 |
Lunch |
|
13:30-15:30 |
Business Intelligence [in Salles Mainsonneuve A] |
|
84 applications BOTS: Behavior-oriented Optimal Time Segmentation for Personalized Rules of Mobile Phone Users, Iqbal Sarker, |
||
48 applications Customer Simulation
for Direct Marketing Experiments, |
||
124 applications Online Experimentation
Diagnosis and Troubleshooting Beyond AA Validation, |
||
140 applications Role Models: Mining
Role Transitions Data in IT Project Management, |
||
92 applications Deconstructing Domain Names to Reveal Latent Topics, Cheryl Flynn, Kenneth Shirley, Wei Wang |
||
13:30-15:30 |
SS8: Big Behavioral Data Analytics [in Salles Mainsonneuve B, C] |
|
232: Uncovering the Bitcoin
blockchain: an analysis of the full
users graph, |
||
146 SS8-BBDA Data-driven
Sales Leads Prediction for Everything-as-a-Service in the Cloud |
||
159 SS8-BBDA Churn Prediction in Mobile Social Games: Towards a Complete Assessment Using Survival Ensembles, Africa Perianez, Alain Saas, Anna Guitart and Colin Magne |
||
209 SS8-BBDA Web Behavior
Analysis Using Sparse Non-Negative Matrix Factorization |
||
218 SS8-BBDA EBM: Evidence-Based
Behavioral Model for Calendar Schedules of Individual Mobile Users |
||
13:30-14:30 |
Tutorial 2: Model Selection and Error Estimation without the Agonizing Pain (continue) [in Salles Mainsonneuve D] |
|
13:30-14:30 |
Tutorial 3: Similarity Search on Time Series Data: Past, Present and Future (continue) [in Salles Mainsonneuve E, F] |
|
15:30-15:50 |
Break |
|
Excursion |
||
15:30-19:00 |