Proceedings of the 2019 International Conference on Data Science

ICDATA'19 Table of Contents

Editors: Robert Stahlbock, Gary M. Weiss, Mahmoud Abou-Nasr
ISBN: 1-60132-502-9 | Copyright © 2019 CSREA Press | United States of America


Ensemble Learning for Predicting Multiple Sclerosis Disease Course 3-8
Yijun Zhao, Tanuja Chitnis, Tung Doan
Deep Convolutional Autoencoder for Recovering Defocused License Plates and Smudged Fingerprints 9-15
Yijun Zhao, Stephen Lebak
Analysis Framework to Investigate Power-Failure Events and Their Causes 16-20
Vivian Sultan, Brian Hilton
Optimal Route Planning for Probabilistic Landscape Exploration 21-27
Ming Tony Shing, K. Y. Michael Wong
Using Apple Store Dataset to Predict User Rating of Mobile Applications 28-33
Kevin Daimi, Noha Hazzazi
On Data Fusion Methodologies For Spontaneous and Solicited Safety Data Evaluation 34-38
Hal Li, William Wang
Statistical Assessment of Physicochemical Properties of Protein Tertiary Structure 39-45
Deok Hee Nam
A Competing Values Approach to Business Intelligence 46-49
Jonathan Fowler
Patterns of Human Mobility 50-55
Shijun Tang, Sihai Tang
LK-Means Algorithm for Evaluation of the Behavior of University Students in Social Networks Related to Cyberbullying and Netiquetas Topics 56-62
Cota Ortiz Maria de Guadalupe, Ojeda Cota Maria de Guadalupe, Florez Perez Pedro
Which Grid Infrastructure Needs Utilities' Immediate Attention to Reduce the Risk of Power Outages? 63-69
Vivian Sultan
Police Precinct Optimization: Using Distance as an Evaluation Metric 70-73
Alyssa Grubbs, Brent Lodge, Qingguo Wang
Personal Big Data Computing Platform for Deep Learning: Implementation and Performance Benchmark 74-77
Chien-Heng Wu, Yu-Feng Chung, Wen-Yi Chang, Whey-Fone Tsai
Using Tags and Comments to Understand Gender Difference in the Online Evaluation from 78-83
Ting Liu, Yuwei Chen, Logan Brandt
Data Science and Security in Digital Governance Aspects and an Elastic Bus Transportation Scheme 84-90
Movses Musaelian, Md Zakirul Alam Bhuiyan, Gary M. Weiss, Tian Wang, Aliuz Zaman, Thaier Hayajneh


Application of Artificial Neural Network on Speech Signal Features for Parkinson's Disease Classification 93-99
John Wu, Xin Ye, Shaun-inn Wu
Fine-Tuning Naive Bayes for Imbalanced Datasets 100-105
Fahad Alenazi, Khalil El Hindi, Basil AsSadhan
Identifying Classification Algorithms Most Suitable for Imbalanced Data 106-111
Ray Marie Tischio, Gary M. Weiss
A Random Under-sampled Deep Architecture with Medical Event Embedding: Highly Imbalanced Rare Disease Classification with EHR Data 112-118
Yan Hu, Feng Chen, Yong Cai, Yilian Yuan
Analysis of Binary Classification Metrics Using Empirical Distributions of Prediction Scores 119-122
Andrei Chtcheprov, Srinivas Krovvidy, Hernando Vera
Parameter Optimization of RBF Kernel SVM from miniCV 123-129
Li-Chia Yeh, Chung-Chin Lu
A Prediction Model for Backpack Programs 130-133
Derrick N. Black, Liping Liu, Seong-Tae Kim, Lauren Davis


Consistency Constraints for Overlapping Data Clustering 137-143
Jakob Hansen, Jared Culbertson, Peter F. Stiller, Dan P. Guralnik
The Sequence Prediction Model of Latent Variable Conditional Random Fields 144-149
Jimmy Ming-Tai Wu, Jerry Chun-Wei Lin, Yinan Shao, Matin Pirouz
A Neural-Encoded Mention-Hypergraph Model for Mention Recognition 150-156
Jimmy Ming-Tai Wu, Jerry Chun-Wei Lin, Yinan Shao, Matin Pirouz
Twitter Streaming API Data Collection for Infrequent Keywords 157-163
Traian Marius Truta, Parker Kain, Tobel Atnafu, Alina Campan, Joseph Nolan, Alyssa Appelman
Feature Extraction through Deepwalk on Weighted Graph 164-170
Jayesh Soni, Nagarajan Prabakar, Himanshu Upadhyay
Graph based Version for Clustering Texts in Current Affair Domain 171-174
Taeho Jo


Apply Machine Learning to Detect and Diagnose Faults in Multi-Array PV Plants 177-178
Chung-Chian Hsu, Jia-Long Li, Arthur Chang, Yu-Sheng Chen
Prediction of Asset Price using News Corpus Data in a Time Series Analysis 179-180
Arkajyoti Chakraborty
A Computer Model to Study Users' Information Behavior 181-183
Paulo Hideo Ohtoshi, Claudio Gottschalg-Duque
Breast Cancer Dataset Analytics 184-192
Kevin Daimi


An Event Driven Sentiment Detection Method for Correlation Analysis of Bitcoin Market 195-200
Chung-Hong Lee, Hsin-Chang Yang, Yong-Lin Chuang, Shuo-Hsin Huang, Po-Hao Chen, Hong-Jie Dai
Cloud-based Big Data Platforms and Tools for Data Analytics in the Big Data Engineering Curriculum 201-206
Yuri Demchenko, Oleg Chertov
Data Quality Assessment and Problem Severity Assessment for Data Cleaning 207-210
Hong Liu, Jongyeop Kim
Measuring the Impact of Integrated Delivery Networks on Physicians' Prescribing Preference 211-214
Wenzhe Lu, Yong Cai
360 Degree Technology as a Gateway for Immersive Psychotherapy Applications: An Intelligent Patent Mining Analysis 215-218
Usharani Hareesh Govindarajan, Amy J.C. Trappey, Charles V. Trappey
Predictive Modeling of an Unbalanced Binary Outcome in Food Insecurity Data 219-225
Jonathan Fabish, Lauren Davis, Seong-Tae Kim


Administered by
Universal Conference Management Systems & Support (UCMSS)
** San Diego, California, USA **