Browse > Article
http://dx.doi.org/10.7236/JIIBC.2019.19.2.151

Bigdata Prediction Support Service for Citizen Data Scientists  

Chang, Jae-Young (Dept. of Computer Engineering, Hansung University)
Publication Information
The Journal of the Institute of Internet, Broadcasting and Communication / v.19, no.2, 2019 , pp. 151-159 More about this Journal
Abstract
As the era of big data, which is the foundation of the fourth industry, has come, most related industries are developing related solutions focusing on the technologies of data storage, statistical analysis and visualization. However, for the diffusion of bigdata technology, it is necessary to develop the prediction analysis technologies using artificial intelligence. But these advanced technologies are only possible by some experts now called data scientists. For big data-related industries to develop, a non-expert, called a citizen data scientist, should be able to easily access the big data analysis process at low cost because they have insight into their own data. In this paper, we propose a system for analyzing bigdata and building business models with the support of easy-to-use analysis system without knowledge of high-level data science. We also define the necessary components and environment for the prediction analysis system and present the overall service plan.
Keywords
Citizen Data Scientist; Bigdata; Prediction Analysis; Data Blending; Feature Engineering;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Jim X. Chen. "The evolution of computing: AlphaGo." Computing in Science & Engineering 18.4 (2016): 4-7.   DOI
2 Gary Miner, John Elder IV, and Thomas Hill. Practical text mining and statistical analysis for non-structured text data applications. Academic Press, 2012.
3 Greg Linden, Brent Smith, and Jeremy York. "Amazon. com recommendations: Item-to-item collaborative filtering." IEEE Internet computing 1 (2003): 76-80. DOI: 10.1109/MIC.2003.1167344   DOI
4 Jorg Rech, "Discovering trends in software engineering with google trend." ACM SIGSOFT Software Engineering Notes 32.2 (2007): 1-2. DOI: 10.1145/1234741.1234765
5 Steve Banker, "The Citizen Data Scientist", Article of Forbes, Available via https://www.forbes.com/sites/stevebanker/2018/01/19/the-citizen-data-scientist
6 https://www.datawatch.com/what-is-data-blending/
7 Robert Hoyt, et al. "IBM Watson analytics: automating visualization, descriptive, and predictive statistics." JMIR public health and surveillance 2.2 (2016).
8 Sam Scott and Stan Matwin. "Feature engineering for text classification." ICML. Vol. 99. 1999.
9 David W. Cearley and Carl Claunch. "The top 10 strategic technology trends for 2013." The Top 10 (2012).
10 Jacky Akoka and Isabelle Comyn-Wattiau. "A Method for Emerging Technology Evaluation. Application to Blockchain and Smart Data Discovery." Conceptual Modeling Perspectives. Springer, Cham, 2017. 247-258. DOI: https://doi.org/10.1007/978-3-319-67271-7_17
11 Roger Barga, et al. Predictive analytics with Microsoft Azure machine learning. Apress, 2015. DOI: 10.1007/978-1-4842-0445-0
12 Amazon. Available via https://aws.amazon.com/aml/
13 Wikipedia. Available via https://en.wikipedia.org/wiki/Data_science
14 Y-lan Boureau and Yann L. Cun. "Sparse feature learning for deep belief networks." Advances in neural information processing systems. 2008.
15 K. Lee and J. Lee, "A Classification of Medical and Advertising Blogs Using Machine Learning", Journal of the Korea Academia-Industrial cooperation Society(JKAIS), Vol. 19, No. 11, 2018.
16 Jie Zhou and Wei Xu. "End-to-end learning of semantic role labeling using recurrent neural networks." Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing . Vol. 1. 2015. DOI: 10.3115/v1/P15-1109   DOI
17 William Vorhies. "Data Scientists Automated and Unemployed by 2025!", Available via https://www.datasciencecentral.com/profile/WilliamVorhies
18 Natalia Miloslavskaya and Alexander Tolstoy. "Big data, fast data and data lake concepts." Procedia Computer Science 88 (2016): 300-305. DOI: https://linkinghub.elsevier.com/retrieve/pii/S1877050916316957   DOI
19 Amanda Casari, Alice Zheng, Feature Engineering for Machine Learning, O'Reilly Media, Inc.
20 J. Chang. "Feature-Based Summarization for a Large Opinion Documents Collection", Journal of the Institute of Internet, Broadcasting and Communication, Vol. 16, No. 1, 2016.
21 K, Ko, D. Whang, S. Park, and K. Moon, "Electrical fire prediction model study using machine learning", The Journal of KIIECT, Vol. 11, No. 6, 2018.