• Title/Summary/Keyword: data pre-processing

Search Result 801, Processing Time 0.027 seconds

A Study on Life Estimate of Insulation Cable for Image Processing of Electrical Tree (전기트리의 영상처리를 이용한 절연케이블의 수명예측에 관한 연구)

  • 정기봉;김형균;김창석;최창주;오무송;김태성
    • Proceedings of the Korean Institute of Electrical and Electronic Material Engineers Conference
    • /
    • 2001.07a
    • /
    • pp.319-322
    • /
    • 2001
  • The proposed system was composed of pre-processor which was executing binary/high-pass filtering and post-processor which ranged from statistic data to prediction. In post-processor work, step one was filter process of image, step two was image recognition, and step three was destruction degree/time prediction. After these processing, we could predict image of the last destruction timestamp. This research was produced variation value according to growth of tree pattern. This result showed improved correction, when this research was applied image Processing. Pre-processing step of original image had good result binary work after high pass- filter execution. In the case of using partial discharge of the image, our research could predict the last destruction timestamp. By means of experimental data, this Prediction system was acquired ${\pm}$3.2% error range.

  • PDF

Constraint Data Modeling for Spatiotemporal Data Application (시공간 데이터 응용을 위한 제약 데이터 모델링)

  • Jung, Hun Jo;Woo, Sung Koo
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.6 no.4
    • /
    • pp.45-56
    • /
    • 2010
  • This paper suggests constraint data modeling based on constraint data presentation techniques to perform complex spatial database operation naturally. We were able to identify the limitation of extendibility of dimension and non-equal framework via relevant research for former schema of spatial database and query processing. Therefore we described generalized tuple of spatial data and the definition of suggested constraint data modeling. Also we selected MLPQ/PReSTO tool among constraint database prototype and compare standard functionality of ARC/VIEW. Then we design scenario for spatial operation using MLPQ/PReSTO and we suggested application effect after query processing. Based on above explanation, we were able to identify that we can process spatial data naturally and effectively using simple constraint routine on same framework via constraint data modeling.

DESIGN OF COMMON TEST HARNESS SYSTEM FOR SATELLITE GROUND SEGMENT DEVELOPMENT

  • Seo, Seok-Bae;Kim, Su-Jin;Koo, In-Hoi;Ahn, Sang-Il
    • Proceedings of the KSRS Conference
    • /
    • 2007.10a
    • /
    • pp.544-547
    • /
    • 2007
  • Because data processing systems in recent years are more complicated, main function of the data processing is divided as several sub-functions which are implemented and verified in each subsystem of the data processing system. For the verification of data processing system, many interface tests among subsystems are required and also a lot of simulation systems are demanded. This paper proposes CTHS (Common Test Harness System) for satellite ground segment development which has all of functions for interface test of the data processing system in one PC. Main functions of the CTHS software are data interface, system log generation, and system information display. For the interface test of the data processing system, all of actions of the CTHS are executed by a pre-defined operation scenario which is written by purpose of the data processing system test.

  • PDF

Development of the GIS Based Pre- and Post-Processing Tool for the Visual MODFLOW Groundwater Flow Modeling (Visual MODFLOW 지하수 유동 모델링을 위한 GIS 기반 전ㆍ후처리기 개발)

  • Kim, Man-Kyu
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.6 no.2
    • /
    • pp.65-79
    • /
    • 2003
  • In this study GIS based pre- and post-processing tool for the Visual MODFLOW that is specific software to model groundwater flow is developed. This tool not only makes input data scientifically but also manages input and output data in terms of the groundwater flow analysis. In addition it can storage project products systematically into Oracle database. The most characteristic figure of this processing tool is to provide the module, which automatically or semi automatically develops various grid cell sizes using GIS ArcView and also produces DXF files reflecting various boundary conditions in the modeling zone. In particular, eminences of this research are to create 3 dimensional hydrogeological structures with 2 dimensional GIS ArcView and to conduct pre- and post- processing along with same topology and data format of the MODFLOW.

  • PDF

Case Study of Flipped-learning on a Signal Processing Class (신호처리 교과목에 대한 플립러닝 적용사례)

  • Yoo, Jae Ha
    • Journal of Practical Engineering Education
    • /
    • v.9 no.2
    • /
    • pp.125-132
    • /
    • 2017
  • This paper is a study on the application of flipped learning, which is known as a teaching method that provides effective learning, to signal processing subjects. The teaching - learning model used for the class and the implementation examples for three years are described. In-class can be judged to be a relatively successful class, but organization of the video data provided in the pre-class and evaluation of whether or not to study pre-class video have to be improved.

Robust Sentiment Classification of Metaverse Services Using a Pre-trained Language Model with Soft Voting

  • Haein Lee;Hae Sun Jung;Seon Hong Lee;Jang Hyun Kim
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.9
    • /
    • pp.2334-2347
    • /
    • 2023
  • Metaverse services generate text data, data of ubiquitous computing, in real-time to analyze user emotions. Analysis of user emotions is an important task in metaverse services. This study aims to classify user sentiments using deep learning and pre-trained language models based on the transformer structure. Previous studies collected data from a single platform, whereas the current study incorporated the review data as "Metaverse" keyword from the YouTube and Google Play Store platforms for general utilization. As a result, the Bidirectional Encoder Representations from Transformers (BERT) and Robustly optimized BERT approach (RoBERTa) models using the soft voting mechanism achieved a highest accuracy of 88.57%. In addition, the area under the curve (AUC) score of the ensemble model comprising RoBERTa, BERT, and A Lite BERT (ALBERT) was 0.9458. The results demonstrate that the ensemble combined with the RoBERTa model exhibits good performance. Therefore, the RoBERTa model can be applied on platforms that provide metaverse services. The findings contribute to the advancement of natural language processing techniques in metaverse services, which are increasingly important in digital platforms and virtual environments. Overall, this study provides empirical evidence that sentiment analysis using deep learning and pre-trained language models is a promising approach to improving user experiences in metaverse services.

Distributed Software Tools Enabling Efficient RFID Data Pre-Processing Using Agent Mobility (에이전트 이동성을 이용한 효율적인 전자태그 데이터 전처리 가능한 분산 소프트웨어 도구)

  • Ahn, Yong-Sun;Ahn, Jin-Ho
    • Journal of Korea Multimedia Society
    • /
    • v.12 no.4
    • /
    • pp.608-615
    • /
    • 2009
  • As RFID tag prices have rapidly been declining because of the advance of RFID technology, each tag is attached to an individual item, not a packing box only, for managing the item much more precisely. However, some mechanisms are essential to handle a very large amount of tag data quickly because readers and middlewares processing RFID data have limited hardware resources. In this paper, we design and implement a new mobile agent-based distributed software tools to satisfy this requirement efficiently. These tools provide a convenient environment enabling required data to be pre-processed repeatedly in transit by transferring a mobile agent including its specified data collection policy to numerous mobile readers. This behavior can significantly reduce the elapsed time required for processing huge volumes of tag data at the readers and middlewares with their very high recognition rates compared with the existing one to process the data by fixed readers after having arrived at the destination

  • PDF

An Intelligent Framework for Feature Detection and Health Recommendation System of Diseases

  • Mavaluru, Dinesh
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.3
    • /
    • pp.177-184
    • /
    • 2021
  • All over the world, people are affected by many chronic diseases and medical practitioners are working hard to find out the symptoms and remedies for the diseases. Many researchers focus on the feature detection of the disease and trying to get a better health recommendation system. It is necessary to detect the features automatically to provide the most relevant solution for the disease. This research gives the framework of Health Recommendation System (HRS) for identification of relevant and non-redundant features in the dataset for prediction and recommendation of diseases. This system consists of three phases such as Pre-processing, Feature Selection and Performance evaluation. It supports for handling of missing and noisy data using the proposed Imputation of missing data and noise detection based Pre-processing algorithm (IMDNDP). The selection of features from the pre-processed dataset is performed by proposed ensemble-based feature selection using an expert's knowledge (EFS-EK). It is very difficult to detect and monitor the diseases manually and also needs the expertise in the field so that process becomes time consuming. Finally, the prediction and recommendation can be done using Support Vector Machine (SVM) and rule-based approaches.

Sentiment Analysis of COVID-19 Tweets: Impact of Pre-processing Step

  • Ayadi, Rami;Shahin, Osama R.;Ghorbel, Osama;Alanazi, Rayan;Saidi, Anouar
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.3
    • /
    • pp.206-211
    • /
    • 2021
  • Internet users are increasingly invited to express their opinions on various subjects in social networks, e-commerce sites, news sites, forums, etc. Much of this information, which describes feelings, becomes the subject of study in several areas of research such as: "Sensing opinions and analyzing feelings". It is the process of identifying the polarity of the feelings held in the opinions found in the interactions of Internet users on the web and classifying them as positive, negative, or neutral. In this article, we suggest the implementation of a sentiment analysis tool that has the role of detecting the polarity of opinions from people about COVID-19 extracted from social media (tweeter) in the Arabic language and to know the impact of the pre-processing phase on the opinions classification. The results show gaps in this area of research, first of all, the lack of resources when collecting data. Second, Arabic language is more complexes in pre-processing step, especially the dialects in the pre-treatment phase. But ultimately the results obtained are promising.

A Pre-processing Study to Solve the Problem of Rare Class Classification of Network Traffic Data (네트워크 트래픽 데이터의 희소 클래스 분류 문제 해결을 위한 전처리 연구)

  • Ryu, Kyung Joon;Shin, DongIl;Shin, DongKyoo;Park, JeongChan;Kim, JinGoog
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.9 no.12
    • /
    • pp.411-418
    • /
    • 2020
  • In the field of information security, IDS(Intrusion Detection System) is normally classified in two different categories: signature-based IDS and anomaly-based IDS. Many studies in anomaly-based IDS have been conducted that analyze network traffic data generated in cyberspace by machine learning algorithms. In this paper, we studied pre-processing methods to overcome performance degradation problems cashed by rare classes. We experimented classification performance of a Machine Learning algorithm by reconstructing data set based on rare classes and semi rare classes. After reconstructing data into three different sets, wrapper and filter feature selection methods are applied continuously. Each data set is regularized by a quantile scaler. Depp neural network model is used for learning and validation. The evaluation results are compared by true positive values and false negative values. We acquired improved classification performances on all of three data sets.