• Title/Summary/Keyword: data pre-processing

Search Result 801, Processing Time 0.026 seconds

Pre/post-processing Operator Selection for Accurate Program Bug Localization (정확한 프로그램 결함 위치 추적을 위한 전-후처리 방법론)

  • Kim, Dongsun
    • Journal of Broadcast Engineering
    • /
    • v.27 no.2
    • /
    • pp.240-243
    • /
    • 2022
  • Tracking the location of program defects is an essential task for software maintenance and repair. When a bug report is submitted, bug localization is a costly task because of the developer's manual effort. Many researchers have tried to automate the task, but according to the reported results, the performance is still insufficient in practice. Therefore, in this study, we analyzed a large amount of bug report data and the latest research and found that the existing studies used only one preprocessing without considering the characteristics of the bug report. In this paper, to solve the problems mentioned earlier, we propose a pre/post-processing operator selection approach for bug localization.

Design and Implementation of a Sound Classification System for Context-Aware Mobile Computing (상황 인식 모바일 컴퓨팅을 위한 사운드 분류 시스템의 설계 및 구현)

  • Kim, Joo-Hee;Lee, Seok-Jun;Kim, In-Cheol
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.3 no.2
    • /
    • pp.81-86
    • /
    • 2014
  • In this paper, we present an effective sound classification system for recognizing the real-time context of a smartphone user. Our system avoids unnecessary consumption of limited computational resource by filtering both silence and white noise out of input sound data in the pre-processing step. It also improves the classification performance on low energy-level sounds by amplifying them as pre-processing. Moreover, for efficient learning and application of HMM classification models, our system executes the dimension reduction and discretization on the feature vectors through k-means clustering. We collected a large amount of 8 different type sound data from daily life in a university research building and then conducted experiments using them. Through these experiments, our system showed high classification performance.

Developing a Program to Pre-process AIS Data and applying to Vung Tau Waterway in Vietnam - Based on the IWRAP Mk2 program - (AIS 데이터 전처리 프로그램의 개발 및 Vung Tau 해역에의 적용 - IWRAP Mk2 프로그램을 기초로 -)

  • Nguyen, Xuan Thanh;Park, Young-Soo;Park, Jin-Soo;Jeong, Jae-Yong
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.19 no.4
    • /
    • pp.345-351
    • /
    • 2013
  • The IWRAP program (Inland Waterway Risk Assessment Program) is a useful program for risk assessment of a waterway. However, in the basic version, the function which is used to import AIS data is not included. So users have to prepare the data and input to the program manually. And not all waterways have enough statistical data about passing vessels especially in developing countries as Vietnam. This paper studies the development of a program to pre-process AIS data for using the IWRAP Mk2 program basic version. In addition, it provides users basic information about marine traffic in a waterway such as routes layout, number of passages at a gate classified by type, size and time. The developed program, named TOAIS (Total AIS), was successfully used to pre-process AIS data collected in the Vung Tau waterway-Vietnam. As a result, the IWRAP Mk2 program basic version using data pre-processed from TOAIS could effectively assess the risk of collision in this waterway.

Web Navigation Mining by Integrating Web Usage Data and Hyperlink Structures (웹 사용 데이타와 하이퍼링크 구조를 통합한 웹 네비게이션 마이닝)

  • Gu Heummo;Choi Joongmin
    • Journal of KIISE:Software and Applications
    • /
    • v.32 no.5
    • /
    • pp.416-427
    • /
    • 2005
  • Web navigation mining is a method of discovering Web navigation patterns by analyzing the Web access log data. However, it is admitted that the log data contains noisy information that leads to the incorrect recognition of user navigation path on the Web's hyperlink structure. As a result, previous Web navigation mining systems that exploited solely the log data have not shown good performance in discovering correct Web navigation patterns efficiently, mainly due to the complex pre-processing procedure. To resolve this problem, this paper proposes a technique of amalgamating the Web's hyperlink structure information with the Web access log data to discover navigation patterns correctly and efficiently. Our implemented Web navigation mining system called SPMiner produces a WebTree from the hyperlink structure of a Web site that is used trl eliminate the possible noises in the Web log data caused by the user's abnormal navigational activities. SPMiner remarkably reduces the pre-processing overhead by using the structure of the Web, and as a result, it could analyze the user's search patterns efficiently.

The Study on the Quality of Pre-Processed Vegetables in School and Institutional Food-Service (단체급식에서 사용되는 전처리 농산물의 품질 특성 분석)

  • Lee, Seung-Joo;Lee, Seung-Mi
    • Korean Journal of Food Science and Technology
    • /
    • v.38 no.5
    • /
    • pp.628-634
    • /
    • 2006
  • This study was performed to investigate the quality of pre-processed vegetables used in school and institutional food-services. Pre-processed food materials (carrot, potato, and cabbage) frequently used in food-service were collected from 14 various processing company sources. The sensory and physico-chemical qualities of the pre-processed food materials were determined using sensory and instrumental analysis. For the physico-chemical analysis of the food materials, pH, total acidity, hardness, Hunter colorimeter value, reducing sugar and vitamin C content were determined. For the sensory quality evaluation, 15 panelist were trained and consensus was reached on the quality standards of the preprocessed materials (carrot, potato, and cabbage). Finally, appearance, color, texture, off-odor/taste, and overall quality were determined. In the physico-chemical analysis, there were no significant differences among samples collected from various processing companies. In sensory quality evaluations, the color quality of pre-processed potato was lower than that of other materials. From the coefficient correlations and partial least squares regression analysis between sensory and instrumental data, pH, total acidity, colorimeter values, and hardness were considered important components in assessing the quality of pre-processed vegetables.

Safety Design and Validation of Mission Equipment Package for Korean Utility Helicopter (KUH 임무탑재시스템의 안전성설계 및 검증)

  • Kim, Yoo-Kyung;Kim, Myung-Chin;Kim, Tae-Hyun;Yim, Jong-Bong
    • Journal of the Korean Society for Aeronautical & Space Sciences
    • /
    • v.38 no.8
    • /
    • pp.813-822
    • /
    • 2010
  • Integrated data processing for display of flight critical data and mission critical data was conducted without additional display instruments using glass cockpit design. Based on a pre-designed flight critical system and a mission critical system, this paper shows an optimal design of subsystem integration. The design satisfies safety requirements of flight control systems(FCS) and requires minimized modification of pre-designed systems. By conducting integration test using System Integration laboratory(SIL), it is confirmed that the introduced design approach meets the safety requirements of the MEP system.

GML Map Visualization on Mobile Devices

  • Song, Eun-Ha;Jeong, Young-Sik
    • Journal of Information Processing Systems
    • /
    • v.6 no.1
    • /
    • pp.33-42
    • /
    • 2010
  • GIS can only be applied to certain areas by storing format. It is subordinate to a system when displaying geographic information data. It is therefore inevitable for GIS to use GML that supports efficient usage of various geographic information data and interoperability for integration and sharing. The paper constructs VisualGML that translates currently-used geographic information such as DXF (Drawing Exchange Format), DWG (DraWinG), or SHP (Shapefile) into GML format for visualization. VisualGML constructs an integrated map pre-process module, which filters geographic information data according to its tag and properties, to provide the flexibility of a mobile device. VisualGML also provides two major GIS services for the user and administrator. It can enable visualizing location search. This is applied with a 3-Layer POI structure for the user. It has trace monitoring visualization through moving information of mobile devices for the administrator.

A Development and Application of Data Visualization EducationProgram for 3rd Grade Students in Elementary School (초등학교 3학년 학생들을 위한 데이터 시각화 교육 프로그램 개발 및 적용)

  • Jiseon Woo;Kapsu Kim
    • Journal of The Korean Association of Information Education
    • /
    • v.26 no.6
    • /
    • pp.481-490
    • /
    • 2022
  • With the development of computing technology, the big data era has arrived, and we live with a lot of data around us. Elementary school students are no exception. Therefore, it is very important to learn to process data from elementary school. Since elementary school students have intuitive thinking, data visualization, which expresses data directly in pictures, is an important learning element. In this study, we study how effective elementary school students can visualize data in their daily lives to improve their information processing capabilities. Adata visualization program was developed by organizing and visualizing data using data visualization tools for the 8th class, which can be done by third graders in elementary school, and then experiencing the process of interaction. As a result of applying the developed program to 186 students in 7 classes, knowledge information processing competency factors were evaluated before and after class. As a result of the pre- and post-test, there was a significant difference in knowledge information processing capabilities. Therefore, the data visualization program developed in this study is effective.

Semantic Correspondence of Database Schema from Heterogeneous Databases using Self-Organizing Map

  • Dumlao, Menchita F.;Oh, Byung-Joo
    • Journal of IKEEE
    • /
    • v.12 no.4
    • /
    • pp.217-224
    • /
    • 2008
  • This paper provides a framework for semantic correspondence of heterogeneous databases using self- organizing map. It solves the problem of overlapping between different databases due to their different schemas. Clustering technique using self-organizing maps (SOM) is tested and evaluated to assess its performance when using different kinds of data. Preprocessing of database is performed prior to clustering using edit distance algorithm, principal component analysis (PCA), and normalization function to identify the features necessary for clustering.

  • PDF

Efficient Data Pre-fetching Scheme for InfiniBand based High Performance Clusters (인피니밴드 기반 고성능 클러스터를 위한 효율적인 데이터 선반입 기법)

  • Kim, Bongjae;Jung, Jinman;Min, Hong;Heo, Junyoung;Jung, Hyedong
    • KIISE Transactions on Computing Practices
    • /
    • v.23 no.5
    • /
    • pp.293-298
    • /
    • 2017
  • Recently, much research has been devoted to implementing and provisioning high-performance computing environment using clusters with multiple computers and high-performance networking technologies. In-memory based Key-Value stores, such as Redis or Memcached, are widely used in high performance cluster environments to improve the data processing performance. We can distribute data at different storage nodes, and each computing node can access it at a high speed using these In-memory based Key-Value stores. InfiniBand is a de-facto technology that is widely used to interconnect each node of a cluster. In this paper, we propose a new data pre-fetching scheme for Key-Value store based on high performance clusters to improve the performance. The proposed scheme utilizes the data transfer characteristics of InfiniBand. The results of the simulation show that the proposed scheme can reduce the data transfer time by up to about 28%.