• Title/Summary/Keyword: Big-data Software

Search Result 441, Processing Time 0.031 seconds

Improvement of BigCloneBench Using Tree-Based Convolutional Neural Network (트리 기반 컨볼루션 신경망을 이용한 BigCloneBench 개선)

  • Park, Gunwoo;Hong, Sung-Moon;Kim, Hyunha;Doh, Kyung-Goo
    • Journal of Software Assessment and Valuation
    • /
    • v.15 no.1
    • /
    • pp.43-53
    • /
    • 2019
  • BigCloneBench has recently been used for performance evaluation of code clone detection tool using machine learning. However, since BigCloneBench is not a benchmark that is optimized for machine learning, incorrect learning data can be created. In this paper, we have shown through experiments using machine learning that the set of Type-4 clone methods provided by BigCloneBench can additionally be found. Experimental results using Tree-Based Convolutional Neural Network show that our proposed method is effective in improving BigCloneBench's dataset.

The relation between the five critical crime of criminal law and the private security services (형법범죄 중 5대 범죄와 민간경비 간의 관계)

  • Joo, Il-Yeob;Jo, Gwang-Rae
    • Korean Security Journal
    • /
    • no.8
    • /
    • pp.361-377
    • /
    • 2004
  • This study is to examine the relations between the big five critical crime that consist of homicide, robbery, rape, theft, violence and the private security services. To achieve this objective, this research selected the subject of study, specially, 2002 status of the private security such as the number of companies and employees classified by areas along with the big five crime mentioned above classified by area. The research data is secondary data that is from '2003 Crime Analysis' of the Supreme Public Prosecutors' Office and 'The private Security Related Data' of the National Police Agency. The selected data were analyzed according to the variables by using SPSS 10.0 statistics software program. Each hypothesis was verified around the level of significance ${\alpha}$=.05 by using the statistical techniques, such as Descriptive Statistics, Correlation, Regression, etc. The following was the result of the study, First, the total number of the big five crime affects the number of the companies at significant level. Second, the number of the security companies can be explained by the each total number of the big five crime in the order of theft, robbery, violence, rape and murder. Third, the total number of the big five crime affects the number of the security employees at significant level. Forth the number of the security employees can be explained by the each total number of the big five crime in the order of theft, robbery, violence, rape and murder.

  • PDF

An Introduction and Trend Analysis in Questions of Engineer Big Data Analyst (빅데이터분석 기사 국가기술자격 개요 및 출제 경향 분석)

  • Jang, Hee-Seon;Song, Ji Young
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2022.01a
    • /
    • pp.393-394
    • /
    • 2022
  • 본 논문에서는 과학기술정보통신부와 통계청에서 주관하고 한국산업인력공단에서 시행(한국데이터산업진흥원 위탁)하는 「빅데이터분석기사」에 대한 필기 및 실기 시험의 내용을 설명하고 지금까지 2회에 걸쳐 시행된 시험에 대한 문제점과 이에 대한 해결방안을 제시하였다. 2021년 처음 시행된 국가기술자격으로써 기존 자격증과의 차별성, 난이도 조정, 수험생들의 각종 민원 발생 등의 문제를 해결하기 위한 체계적인 시스템 마련이 요구되며, 향후 데이터 과학자들에 대한 수요 급증에 대비하기 위해 빅데이터분석 실무 능력을 평가하기 위한 바람직한 제도와 정책이 병행되어야 한다.

  • PDF

Performance Analysis of Real-Time Big Data Search Platform Based on High-Capacity Persistent Memory (대용량 영구 메모리 기반 실시간 빅데이터 검색 플랫폼 성능 분석)

  • Eunseo Lee;Dongchul Park
    • Journal of Platform Technology
    • /
    • v.11 no.4
    • /
    • pp.50-61
    • /
    • 2023
  • The advancement of various big data technologies has had a tremendous impact on many industries. Diverse big data research studies have been conducted to process and analyze massive data quickly. Under these circumstances, new emerging technologies such as high-capacity persistent memory (PMEM) and Compute Express Link (CXL) have lately attracted significant attention. However, little investigation into a big data "search" platform has been made. Moreover, most big data software platforms have been still optimized for traditional DRAM-based computing systems. This paper first evaluates the basic performance of Intel Optane PMEM, and then investigates both indexing and searching performance of Elasticsearch, a widely-known enterprise big data search platform, on the PMEM-based computing system to explore its effectiveness and possibility. Extensive and comprehensive experiments shows that the proposed Optane PMEM-based Elasticsearch achieves indexing and searching performance improvement by an average of 1.45 times and 3.2 times respectively compared to DRAM-based system. Consequently, this paper demonstrates the high I/O, high-capacity, and nonvolatile PMEM-based computing systems are very promising for big data search platforms.

  • PDF

IP-Based Heterogeneous Network Interface Gateway for IoT Big Data Collection (IoT 빅데이터 수집을 위한 IP기반 이기종 네트워크 인터페이스 연동 게이트웨이)

  • Kang, Jiheon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.23 no.2
    • /
    • pp.173-178
    • /
    • 2019
  • Recently, the types and amount of data generated, collected, and measured in IoT such as smart home, security, and factory are increasing. The technologies for IoT service include sensor devices to measure desired data, embedded software to control the devices such as signal processing, wireless network protocol to transmit and receive the measured data, and big data and AI-based analysis. In this paper, we focused on developing a gateway for interfacing heterogeneous sensor network protocols that are used in various IoT devices and propose a heterogeneous network interface IoT gateway. We utilized a OpenWrt-based wireless routers and used 6LoWAN stack for IP-based communication via BLE and IEEE 802.15.4 adapters. We developed a software to convert Z-Wave and LoRa packets into IP packet using our Python-based middleware. We expect the IoT gateway to be used as an effective device for collecting IoT big data.

A Study on the Developing of Big Data Services in Public Library (도서관 빅데이터 서비스 모형 개발에 관한 연구: 공공도서관을 중심으로)

  • Pyo, Soon Hee;Kim, Yun Hyung;Kim, Hye Sun;Kim, Wan Jong
    • Journal of the Korean Society for information Management
    • /
    • v.32 no.2
    • /
    • pp.63-86
    • /
    • 2015
  • Big data refers to dataset whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze. And now it is considered to create the new opportunity in every industry. The purpose of this study is to develop of big data services in public library for improved library services. To this end, analysed the type of library big data and needs of stockholders through the various methods such as deep interview, focus group interview, questionnaire. At first step, we defined the 16 big data service models from interview with librarians, and LIS professions. Second step, it was considered necessity, timeliness, possibility of development. We developed the final two services called on 'Decision Support Services for Public Librarians' and 'Book Recommendation Services for Users.'

BIG DATA ANALYSIS ROLE IN ADVANCING THE VARIOUS ACTIVITIES OF DIGITAL LIBRARIES: TAIBAH UNIVERSITY CASE STUDY- SAUDI ARABIA

  • Alotaibi, Saqar Moisan F
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.8
    • /
    • pp.297-307
    • /
    • 2021
  • In the vibrant environment, documentation and managing systems are maintained autonomously through education foundations, book materials and libraries at the same time as information are not voluntarily accessible in a centralized location. At the moment Libraries are providing online resources and services for education activities. Moreover, libraries are applying outlets of social media such as Facebook as well as Instagrams to preview their services and procedures. Librarians with the assistance of promising tools and technology like analytics software are capable to accumulate more online information, analyse them for incorporating worth to their services. Thus Libraries can employ big data to construct enhanced decisions concerning collection developments, updating public spaces and tracking the purpose of library book materials. Big data is being produced due to library digitations and this has forced restrictions to academicians, researchers and policy creator's efforts in enhancing the quality and effectiveness. Accordingly, helping the library clients with research articles and book materials that are in line with the users interest is a big challenge and dispute based on Taibah university in Saudi Arabia. The issues of this domain brings the numerous sources of data from various institutions and sources into single place in real time which can be time consuming. The most important aim is to reduce the time that lapses among the authentic book reading and searching the specific study material.

A Study on Finding Emergency Conditions for Automatic Authentication Applying Big Data Processing and AI Mechanism on Medical Information Platform

  • Ham, Gyu-Sung;Kang, Mingoo;Joo, Su-Chong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.8
    • /
    • pp.2772-2786
    • /
    • 2022
  • We had researched an automatic authentication-supported medical information platform[6]. The proposed automatic authentication consists of user authentication and mobile terminal authentication, and the authentications are performed simultaneously in patients' emergency conditions. In this paper, we studied on finding emergency conditions for the automatic authentication by applying big data processing and AI mechanism on the extended medical information platform with an added edge computing system. We used big data processing, SVM, and 1-Dimension CNN of AI mechanism to find emergency conditions as authentication means considering patients' underlying diseases such as hypertension, diabetes mellitus, and arrhythmia. To quickly determine a patient's emergency conditions, we placed edge computing at the end of the platform. The medical information server derives patients' emergency conditions decision values using big data processing and AI mechanism and transmits the values to an edge node. If the edge node determines the patient emergency conditions, the edge node notifies the emergency conditions to the medical information server. The medical server transmits an emergency message to the patient's charge medical staff. The medical staff performs the automatic authentication using a mobile terminal. After the automatic authentication is completed, the medical staff can access the patient's upper medical information that was not seen in the normal condition.

The Analysis of the GPS Data Processing of the NGII CORS by Bernese and TGO (Bernese와 TGO에 의한 국내 GPS 상시관측소 자료처리 결과 분석)

  • Kim, Ji-Woon;Kwon, Jay-Hyoun;Lee, Ji-Sun
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.26 no.6
    • /
    • pp.549-559
    • /
    • 2008
  • This study verified the limitations of commercial GPS data processing software and the applicability on precise positioning through comparing the processing results between Bernese and TGO under various conditions. To achieve the goal, we selected three nationwide station data and two smaller local data to constitute networks. By using Bernese and TGO, those networks are processed through the baseline analysis and the network adjustment. The comparative analysis was carried out, in terms of software, baseline length and network scale, observation duration, and number of fixed points. In the comparison between softwares, the scientific software was excellent in accuracy. It was confirmed that, as GPS-related technology is developed, the performance of the receiver was enhanced. And, in parallel with this, even the functionalities of the commercial software were tremendously enhanced. The difference, however, in result between the scientific and commercial software are still exist even if it is not big. Therefore, this study confirms that the scientific software should be used when the most precise position is necessary to be computed, especially if baseline vectors are big.

Development of an LP integrated environment software under MS-DOS (MS-DOS용 선형계획법 통합환경 소프트웨어의 개발)

  • 설동렬;박찬규;서용원;박순달
    • Korean Management Science Review
    • /
    • v.12 no.1
    • /
    • pp.125-138
    • /
    • 1995
  • This paper is to develop an integrated environment software on MS-DOS for linear programming. For the purpose, First, the linear programming integrated environment software satisfying both the educational purpose and the professional purpose was designed and constructed on MS-DOS. Second, the text editor with big capacity was developed. The arithmetic form analyser was also developed and connected to the test editor so that users can input data in the arithmetic form. As a result, users can learn and perform linear programming in the linear programming integrated environment software.

  • PDF