Search | Korea Science

Big data distributed processing system using RHadoop (RHadoop을 이용한 빅데이터 분산처리 시스템)

Shin, Ji Eun;Jung, Byung Ho;Lim, Dong Hoon
- Journal of the Korean Data and Information Science Society
- /
- v.26 no.5
- /
- pp.1155-1166
- /
- 2015
It is almost impossible to store or analyze big data increasing exponentially with traditional technologies, so Hadoop is a new technology to make that possible. In recent R is using as an engine for big data analysis based on distributed processing with Hadoop technology. With RHadoop that integrates R and Hadoop environment, we implemented parallel multiple regression analysis with various data sizes of actual data and simulated data. Experimental results showed our RHadoop system was faster as the number of data nodes increases. We also compared the performance of our RHadoop with lm function and biglm packages available on bigmemory. The results showed that our RHadoop was faster than other packages owing to paralleling processing with increasing the number of map tasks as the size of data increases.
https://doi.org/10.7465/jkdi.2015.26.5.1155 인용 PDF KSCI

A Case Study on Credit Analysis System in P2P: 8Percent, Lendit, Honest Fund (P2P 플랫폼에서의 대출자 신용분석 사례연구: 8퍼센트, 렌딧, 어니스트 펀드)

Choi, Su Man;Jun, Dong Hwa;Oh, Kyong Joo
- Knowledge Management Research
- /
- v.21 no.3
- /
- pp.229-247
- /
- 2020
In the remarkable growth of P2P financial platform in the field of knowledge management, only companies with big data and machine learning technologies are surviving in fierce competition. The ability to analyze borrowers' credit is most important, and platform companies are also recognizing this capability as the most important business asset, so they are building a credit evaluation system based on artificial intelligence. Nonetheless, online P2P platform providers that offer related services only act as intermediaries to apply for investors and borrowers, and all the risks associated with the investments are attributable to investors. For investors, the only way to verify the safety of investment products depends on the reputation of P2P companies from newspaper and online website. Time series information such as delinquency rate is not enough to evaluate the early stage of Korean P2P makers' credit analysis capability. This study examines the credit analysis procedure of P2P loan platform using artificial intelligence through the case analysis method for well known the top three companies that are focusing on the credit lending market and the kinds of information data to use. Through this, we will improve the understanding of credit analysis techniques through artificial intelligence, and try to examine limitations of credit analysis methods through artificial intelligence.
https://doi.org/10.15813/kmr.2020.21.3.013 인용 PDF KSCI

Bio-Sensing Convergence Big Data Computing Architecture (바이오센싱 융합 빅데이터 컴퓨팅 아키텍처)

Ko, Myung-Sook;Lee, Tae-Gyu
- KIPS Transactions on Software and Data Engineering
- /
- v.7 no.2
- /
- pp.43-50
- /
- 2018
Biometric information computing is greatly influencing both a computing system and Big-data system based on the bio-information system that combines bio-signal sensors and bio-information processing. Unlike conventional data formats such as text, images, and videos, biometric information is represented by text-based values that give meaning to a bio-signal, important event moments are stored in an image format, a complex data format such as a video format is constructed for data prediction and analysis through time series analysis. Such a complex data structure may be separately requested by text, image, video format depending on characteristics of data required by individual biometric information application services, or may request complex data formats simultaneously depending on the situation. Since previous bio-information processing computing systems depend on conventional computing component, computing structure, and data processing method, they have many inefficiencies in terms of data processing performance, transmission capability, storage efficiency, and system safety. In this study, we propose an improved biosensing converged big data computing architecture to build a platform that supports biometric information processing computing effectively. The proposed architecture effectively supports data storage and transmission efficiency, computing performance, and system stability. And, it can lay the foundation for system implementation and biometric information service optimization optimized for future biometric information computing.
https://doi.org/10.3745/KTSDE.2018.7.2.43 인용 PDF KSCI

Performance Evaluation of Medical Big Data Analysis based on RHadoop (RHadoop 기반 보건의료 빅데이터 분석의 성능 평가)

Ryu, Woo-Seok
- The Journal of the Korea institute of electronic communication sciences
- /
- v.13 no.1
- /
- pp.207-212
- /
- 2018
As a data analysis tool which is becoming popular in the Big Data era, R is rapidly expanding its user range by providing powerful statistical analysis and data visualization functions. Major advantage of R is its functional scalability based on open source, but its scale scalability is limited, resulting in performance degrades in large data processing. RHadoop, one of the extension packages to complement it, can improve data analysis performance as it supports Hadoop platform-based distributed processing of programs written in R. In this paper, we evaluate the validity of RHadoop by evaluating the performance improvement of RHadoop in real medical big data analysis. Performance evaluation of the analysis of the medical history information, which is provided by National Health Insurance Service, using R and RHadoop shows that RHadoop cluster composed of 8 data nodes can improve performance up to 8 times compared with R.
https://doi.org/10.13067/JKIECS.2018.13.1.207 인용 PDF KSCI

Decision Program for Advertisement Web Posts (광고성 웹 게시글 판단 프로그램)

Bae, Ji-Seon;Oh, Ye-Rim;Kim, Chae-won;Park, Ji-Won;Hong, Jin-Keun;Yoon, Hyung-Ki
- Proceedings of the Korea Information Processing Society Conference
- /
- 2021.11a
- /
- pp.1334-1336
- /
- 2021
흔히, 웹 플랫폼에서 검색했을 때, 게시글 마지막부분에 광고인지 여부를 판단 할 수 있는 관련 글들이 나타난다. 이 글들은 사용자의 판단력을 흐리게 할 수 있다고 판단되며 개선의 필요성이 제기된다. 따라서 본 논문에서는 사용자들에게 웹 게시글에서 나타나는 광고성 여부에 대해 신속한 판단이 가능하도록 하는 환경에 대한 연구를 하고자 한다. 본 논문에서는 게시글에 포함된 광고 관련 문구를 찾아 페이지 상단에 해당 정보를 제공하는 프로그램을 제작 게시함으로써, 광고여부를 판단할 수 있도록 하였다.
https://doi.org/10.3745/PKIPS.y2021m11a.1334 인용 PDF

Renewable energy trends and relationship structure by SNS big data analysis (SNS 빅데이터 분석을 통한 재생에너지 동향 및 관계구조)

Jong-Min Kim
- Convergence Security Journal
- /
- v.22 no.1
- /
- pp.55-60
- /
- 2022
This study is to analyze trends and relational structures in the energy sector related to renewable energy. For this reason, in this study, we focused on big data including SNS data. SNS utilizes the Instagram platform to collect renewable energy hash tags and use them as a word embedding method for big data analysis and social network analysis, and based on the results derived from this research, it will be used for the development of the renewable energy industry. It is expected that it can be utilized.
https://doi.org/10.33778/kcsa.2022.22.1.055 인용 PDF KSCI

Big Data Refining System for Environmental Sensor of Continuous Manufacturing Process using IIoT Middleware Platform (IIoT 미들웨어 플랫폼을 활용한 연속 제조공정의 환경센서 빅데이터 정제시스템)

Yoon, Yeo-Jin;Kim, Tea-Hyung;Lee, Jun-Hee;Kim, Young-Gon
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.18 no.4
- /
- pp.219-226
- /
- 2018
IIoT(Industrial Internet of Thing) means that all manufacturing processes are informed beyond the conventional automation of process automation. The objective of the system is to build an information system based on the data collected from the sensors installed in each process and to maintain optimal productivity by managing and automating each process in real time. Data collected from sensors in each process is unstructured and many studies have been conducted to collect and process such unstructured data effectively. In this paper, we propose a system using Node-RED as middleware for effective big data collection and processing.
https://doi.org/10.7236/JIIBC.2018.18.4.219 인용 PDF KSCI

A Study on Big Data Processing Technology Based on Open Source for Expansion of LIMS (실험실정보관리시스템의 확장을 위한 오픈 소스 기반의 빅데이터 처리 기술에 관한 연구)

Kim, Soon-Gohn
- The Journal of Korea Institute of Information, Electronics, and Communication Technology
- /
- v.14 no.2
- /
- pp.161-167
- /
- 2021
Laboratory Information Management System(LIMS) is a centralized database for storing, processing, retrieving, and analyzing laboratory data, and refers to a computer system or system specially designed for laboratories performing inspection, analysis, and testing tasks. In particular, LIMS is equipped with a function to support the operation of the laboratory, and it requires workflow management or data tracking support. In this paper, we collect data on websites and various channels using crawling technology, one of the automated big data collection technologies for the operation of the laboratory. Among the collected test methods and contents, useful test methods and contents useful that the tester can utilize are recommended. In addition, we implement a complementary LIMS platform capable of verifying the collection channel by managing the feedback.
https://doi.org/10.17661/jkiiect.2021.14.2.161 인용 PDF KSCI

Real-time Monitoring System for Rotating Machinery with IoT-based Cloud Platform (회전기계류 상태 실시간 진단을 위한 IoT 기반 클라우드 플랫폼 개발)

Jeong, Haedong;Kim, Suhyun;Woo, Sunhee;Kim, Songhyun;Lee, Seungchul
- Transactions of the Korean Society of Mechanical Engineers A
- /
- v.41 no.6
- /
- pp.517-524
- /
- 2017
The objective of this research is to improve the efficiency of data collection from many machine components on smart factory floors using IoT(Internet of things) techniques and cloud platform, and to make it easy to update outdated diagnostic schemes through online deployment methods from cloud resources. The short-term analysis is implemented by a micro-controller, and it includes machine-learning algorithms for inferring snapshot information of the machine components. For long-term analysis, time-series and high-dimension data are used for root cause analysis by combining a cloud platform and multivariate analysis techniques. The diagnostic results are visualized in a web-based display dashboard for an unconstrained user access. The implementation is demonstrated to identify its performance in data acquisition and analysis for rotating machinery.
https://doi.org/10.3795/KSME-A.2017.41.6.517 인용 PDF KSCI

A Study on the Energy Platform to Reduce Carbon Emissions (탄소배출 저감을 위한 에너지 플랫폼 연구)

Beom-seok Cha;Hyung-Jin Moon;Woojin Wi;Gab-Sang Ryu
- Journal of Internet of Things and Convergence
- /
- v.10 no.2
- /
- pp.43-50
- /
- 2024
This manuscript proposes an artificial intelligence-based(AI) energy platform system that efficiently use existing energy than creating new energy than creating new energy sources. To this end, it collects public information data portal and statistics data portal and data emissions, including energy usage and greenhouse gas emissions, including energy consumption and greenhouse gas emissions.In addition, it provides strong security and personal information protection functions to overcome the limit of existing energy platform. Through the built energy platform, improving power supply and user convenience of users and users to contribute to global warming issues.In this paper, the contents to implement the contents of the system, and improvement direction from the future completion and improvement direction.
https://doi.org/10.20465/KIOTS.2024.10.2.043 인용 PDF

Search Result 483, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)