TY - GEN
T1 - Enabling non-expert users to apply data mining for bridging the big data divide
AU - Espinosa, Roberto
AU - García-Saiz, Diego
AU - Zorrilla, Marta
AU - Zubcoff, Jose Jacobo
AU - Mazón, Jose Norberto
N1 - Publisher Copyright:
© IFIP International Federation for Information Processing 2015.
PY - 2015
Y1 - 2015
N2 - Non-expert users find complex to gain richer insights into the increasingly amount of available heterogeneous data, the so called big data. Advanced data analysis techniques, such as data mining, are difficult to apply due to the fact that (i) a great number of data mining algorithms can be applied to solve the same problem, and (ii) correctly applying data mining techniques always requires dealing with the inherent features of the data source. Therefore, we are attending a novel scenario in which non-experts are unable to take advantage of big data, while data mining experts do: the big data divide. In order to bridge this gap, we propose an approach to offer non-expert miners a tool that just by uploading their data sets, return them the more accurate mining pattern without dealing with algorithms or settings, thanks to the use of a data mining algorithm recommender. We also incorporate a previous task to help non-expert users to specify data mining requirements and a later task in which users are guided in interpreting data mining results. Furthermore, we experimentally test the feasibility of our approach, in particular, the method to build recommenders in an educational context, where instructors of e-learning courses are non-expert data miners who need to discover how their courses are used in order to make informed decisions to improve them.
AB - Non-expert users find complex to gain richer insights into the increasingly amount of available heterogeneous data, the so called big data. Advanced data analysis techniques, such as data mining, are difficult to apply due to the fact that (i) a great number of data mining algorithms can be applied to solve the same problem, and (ii) correctly applying data mining techniques always requires dealing with the inherent features of the data source. Therefore, we are attending a novel scenario in which non-experts are unable to take advantage of big data, while data mining experts do: the big data divide. In order to bridge this gap, we propose an approach to offer non-expert miners a tool that just by uploading their data sets, return them the more accurate mining pattern without dealing with algorithms or settings, thanks to the use of a data mining algorithm recommender. We also incorporate a previous task to help non-expert users to specify data mining requirements and a later task in which users are guided in interpreting data mining results. Furthermore, we experimentally test the feasibility of our approach, in particular, the method to build recommenders in an educational context, where instructors of e-learning courses are non-expert data miners who need to discover how their courses are used in order to make informed decisions to improve them.
KW - Big data
KW - Data mining
KW - Knowledge base
KW - Meta-learning
KW - Model-driven development
KW - Recommender
UR - https://www.scopus.com/pages/publications/84923666947
U2 - 10.1007/978-3-662-46436-6_4
DO - 10.1007/978-3-662-46436-6_4
M3 - Conference contribution
AN - SCOPUS:84923666947
T3 - Lecture Notes in Business Information Processing
SP - 65
EP - 86
BT - Data-Driven Process Discovery and Analysis - 3rd IFIP WG 2.6, 2.12 International Symposium, SIMPDA 2013, Revised Selected Papers
A2 - Accorsi, Rafael
A2 - Cudre-Mauroux, Philippe
A2 - Ceravolo, Paolo
PB - Springer Nature
T2 - 3rd IFIP WG 2.6, 2.12 International Symposium on Data-driven Process Discovery and Analysis, SIMPDA 2013
Y2 - 30 August 2013 through 30 August 2013
ER -