Search | Korea Science

The Effect of Meta-Features of Multiclass Datasets on the Performance of Classification Algorithms (다중 클래스 데이터셋의 메타특징이 판별 알고리즘의 성능에 미치는 영향 연구)

Kim, Jeonghun;Kim, Min Yong;Kwon, Ohbyung
- Journal of Intelligence and Information Systems
- /
- v.26 no.1
- /
- pp.23-45
- /
- 2020
Big data is creating in a wide variety of fields such as medical care, manufacturing, logistics, sales site, SNS, and the dataset characteristics are also diverse. In order to secure the competitiveness of companies, it is necessary to improve decision-making capacity using a classification algorithm. However, most of them do not have sufficient knowledge on what kind of classification algorithm is appropriate for a specific problem area. In other words, determining which classification algorithm is appropriate depending on the characteristics of the dataset was has been a task that required expertise and effort. This is because the relationship between the characteristics of datasets (called meta-features) and the performance of classification algorithms has not been fully understood. Moreover, there has been little research on meta-features reflecting the characteristics of multi-class. Therefore, the purpose of this study is to empirically analyze whether meta-features of multi-class datasets have a significant effect on the performance of classification algorithms. In this study, meta-features of multi-class datasets were identified into two factors, (the data structure and the data complexity,) and seven representative meta-features were selected. Among those, we included the Herfindahl-Hirschman Index (HHI), originally a market concentration measurement index, in the meta-features to replace IR(Imbalanced Ratio). Also, we developed a new index called Reverse ReLU Silhouette Score into the meta-feature set. Among the UCI Machine Learning Repository data, six representative datasets (Balance Scale, PageBlocks, Car Evaluation, User Knowledge-Modeling, Wine Quality(red), Contraceptive Method Choice) were selected. The class of each dataset was classified by using the classification algorithms (KNN, Logistic Regression, Nave Bayes, Random Forest, and SVM) selected in the study. For each dataset, we applied 10-fold cross validation method. 10% to 100% oversampling method is applied for each fold and meta-features of the dataset is measured. The meta-features selected are HHI, Number of Classes, Number of Features, Entropy, Reverse ReLU Silhouette Score, Nonlinearity of Linear Classifier, Hub Score. F1-score was selected as the dependent variable. As a result, the results of this study showed that the six meta-features including Reverse ReLU Silhouette Score and HHI proposed in this study have a significant effect on the classification performance. (1) The meta-features HHI proposed in this study was significant in the classification performance. (2) The number of variables has a significant effect on the classification performance, unlike the number of classes, but it has a positive effect. (3) The number of classes has a negative effect on the performance of classification. (4) Entropy has a significant effect on the performance of classification. (5) The Reverse ReLU Silhouette Score also significantly affects the classification performance at a significant level of 0.01. (6) The nonlinearity of linear classifiers has a significant negative effect on classification performance. In addition, the results of the analysis by the classification algorithms were also consistent. In the regression analysis by classification algorithm, Naïve Bayes algorithm does not have a significant effect on the number of variables unlike other classification algorithms. This study has two theoretical contributions: (1) two new meta-features (HHI, Reverse ReLU Silhouette score) was proved to be significant. (2) The effects of data characteristics on the performance of classification were investigated using meta-features. The practical contribution points (1) can be utilized in the development of classification algorithm recommendation system according to the characteristics of datasets. (2) Many data scientists are often testing by adjusting the parameters of the algorithm to find the optimal algorithm for the situation because the characteristics of the data are different. In this process, excessive waste of resources occurs due to hardware, cost, time, and manpower. This study is expected to be useful for machine learning, data mining researchers, practitioners, and machine learning-based system developers. The composition of this study consists of introduction, related research, research model, experiment, conclusion and discussion.
https://doi.org/10.13088/jiis.2020.26.1.023 인용 PDF KSCI

Design and Implementation of Content-based Video Database using an Integrated Video Indexing Method (통합된 비디오 인덱싱 방법을 이용한 내용기반 비디오 데이타베이스의 설계 및 구현)

Lee, Tae-Dong;Kim, Min-Koo
- Journal of KIISE:Computing Practices and Letters
- /
- v.7 no.6
- /
- pp.661-683
- /
- 2001
There is a rapid increase in the use of digital video information in recent years, it becomes more important to manage video databases efficiently. The development of high speed data network and digital techniques has emerged new multimedia applications such as internet broadcasting, Video On Demand(VOD) combined with video data processing and computer. Video database should be construct for searching fast, efficient video be extract the accurate feature information of video with more massive and more complex characteristics. Video database are essential differences between video databases and traditional databases. These differences lead to interesting new issues in searching of video, data modeling. So, cause us to consider new generation method of database, efficient retrieval method of video. In this paper, We propose the construction and generation method of the video database based on contents which is able to accumulate the meaningful structure of video and the prior production information. And by the proposed the construction and generation method of the video database implemented the video database which can produce the new contents for the internet broadcasting centralized on the video database. For this production, We proposed the video indexing method which integrates the annotation-based retrieval and the content-based retrieval in order to extract and retrieval the feature information of the video data using the relationship between the meaningful structure and the prior production information on the process of the video parsing and extracting the representative key frame. We can improve the performance of the video contents retrieval, because the integrated video indexing method is using the content-based metadata type represented in the low level of video and the annotation-based metadata type impressed in the high level which is difficult to extract the feature information of the video at he same time.
PDF

Origin and Evolution of Leucogranite of NE Yeongnam Massif from Samcheok Area, Korea (삼척지역 북동 영남 육괴에 분포하는 우백질 화강암의 기원 및 진화)

Cheong, Won-Seok;Na, Ki-Chang
- The Journal of the Petrological Society of Korea
- /
- v.17 no.1
- /
- pp.16-35
- /
- 2008
We study metamorphism of metasedimetary rocks and origin and evolution of leucogranite form Samcheok area, northeastern Yeongnam massif, South Korea. Metamorphic rocks in this area are composed of metasedimentary migmatite, biotite granitic gneiss and leucogranite. Metasedimentary rocks, which refer to major element feature of siliclastic sediment, are divided into two metamorphic zones based on mineral assemblages, garnet and sillimanite zones. According to petrogenetic grid of mineral assemblages, metamorhpic P-T conditions are $740{\sim}800^{\circ}C$ at $4.8{\sim}5.8\;kbar$ in the garnet zone and $640-760^{\circ}C$ at 2.5-4.5kbar in sillimanite zone. The leucogranite (Imwon leucogranite) is peraluminous granite which has high alumina index (A/CNK=1.31-1.93) and positive discriminant factor value (DF > 0). Thus, leucogranite is S-type granite generated from metasedimentary rocks. Major and trace element diagram ($R_1-R_2$ diagram and Rb vs. Y+Nb etc.) show collisional environment such as syn-collisional or volcanic arc granite. Because Rb/sr ratio (1.8-22.9) of leucogranites is higher than Sr/Ba ratio (0.21-0.79), leucogranite would be derived from muscovite dehydrate melting in metasedimentary rocks. Leucogranites have lower concentration of LREE and Eu and similar that of HREE relative to metasedimentary rocks. To examine difference of REEs between leucogranites and metasedimentary rocks, we perform modeling using volume percentage of a leucogranite and a metasedimenatry rock from study area and REE data of minerals from rhyolite (Nash and Crecraft, 1985) and melanosome of migmatite (Bea et al., 1994). Resultants of modeling indicate that LREE and HREE are controlled by monazites and garnet, respectively, although zircon is estimated HREE dominant in some leucogranite without garnet. Because there are many inclusions of accessary phases such as monazite and zircon in biotites from metasedimentary rocks. leucogranitic magma was mainly derived from muscovite-breakdown in metasedimenary rocks. Leucogranites can be subdivided into two types in compliance with Eu anomaly of chondrite nomalized REE pattern; the one of negative Eu anomaly is type I and the other is type II. Leucogranites have lower Eu concetnrations than that of metasedimenary rocks and similar that of both type. REE modeling suggest that this difference of Eu value is due to that of components of feldspars in both leucogranite and metasedimentary rock. The tendency of major ($K_2O$ and $Na_2O$) and face elements (Eu, Rb, Sr and Ba) of leucogranites also indicate that source magma of these two types was developed by anatexis experienced strong fractionation of alkali-feldspar. Conclusionally, leucogranites in this area are products of melts which was generated by muscovite-breakdown of metasedimenary rock in environment of continetal collision during high temperature/pressure metamorphism and then was fractionated and crystallized after extraction from source rock.
PDF KSCI

Performance Evaluation of Workstation System within ATM Integrated Service Switching System using Mean Value Analysis Algorithm (MVA 알고리즘을 이용한 ATM 기반 통합 서비스 교환기 내 워크스테이션의 성능 평가)

Jang, Seung-Ju;Kim, Gil-Yong;Lee, Jae-Hum;Park, Ho-Jin
- Journal of KIISE:Computing Practices and Letters
- /
- v.6 no.4
- /
- pp.421-429
- /
- 2000
In present, ATM integrated switching system has been developed to a mixed modules that complexed switching system including maintenance, operation based on B-ISDN/LAN service and plug-in module, , which runs on workstation computer system. Meanwhile, workstation has HMI operation system feature including file system management, time management, graphic processing, TMN agent function. The workstation has communicated with between ATM switching module and clients. This computer system architecture has much burden messages communication among processes or processor. These messages communication consume system resources which are socket, message queue, IO device files, regular files, and so on. Therefore, in this paper we proposed new performance modeling with this system architecture. We will analyze the system bottleneck and improve system performance. In addition, in the future, the system has many additional features should be migrated to workstation system, we need previously to evaluate system bottleneck and redesign it. In performance model, we use queueing network model and the simulation package is used PDQ and C-program.
PDF

Aesthetic Characteristics of Hanae Mori's Apparel (하나에 모리(Hanae Mori) 의상에 나타난 미적 특성)

Choi, Young-Ok
- The Korean Fashion and Textile Research Journal
- /
- v.9 no.6
- /
- pp.613-625
- /
- 2007
Globalizing the Japanese fashion successfully, Hanae Mori's work awoke the western fashion world's nostalgia towards the East. Analyzing the aesthetic characteristics of Hanae Mori's clothes what kinds of aesthetic characteristic that her work had and what kinds of influences that she made in the modern fashion would provide substantial contribution of the world's modern fashion. This study provided forms and remarkable features of Japanese traditional custom, revealed Hanae Mori's life and her philosophies of fashion, and defined Hanae Mori's aesthetic characteristics by analyzing her work from 1970's until the retirement, July 2004. Methods of this study are completed by documentary records of Hanae Mori, research papers and fashion magazines that are published domestically and internationally, and collected materials from internet. The results of analysis are epitomized as below. Hanae Mori was the first Japanese fashion designer who expressed the characteristics of traditional Japanese custom with modernity sprit. In the 60's and 70's, especially in the U.S. and European fashion market, she inspired western fashion designers by her original sprit of art: combining Japanese tradition which showed distinctive color and spirit of nature and the western beauty. Hanae Mori created new dress molding from the Kimono's unstructured feature. Her layered look dressing, oblique adjustment and Obi, and others all enabled Mori to express Japanese image into modern fashion. Additionally, in terms of traditional Japanese image being acknowledged world-widely, she played a major contribution in world fashion by suggesting a new vision and raised several sensations in fashion artistry and modeling. Amongst her various patterns, Hanae Mori had butterfly patterns in most of her works, which was her representative symbol. This spoke for her strong will and senses of duty that wanting to inform beauty of Japanese women who were reflected in modern and graceful butterfly patterns. Flowers were another element that symbolized Mori. Using various flower motifs that bloomed in every different four seasons, she connected two images into her fashion; beauty of the nature and enlightening image of vibrating life. The aesthetic characteristics of Hanae Mori's clothes were defined as five: Japonism, naturalism, feminism, eroticism, and modernism. Japonism which is the spirit of Japanese, Mori used the concept to connect the East and the West. Naturalism represented harmony of the nature and the human. Feminism highlighted Eastern women's beauty. Eroticism emitted feminine attraction. Modernism represented simplicity and sophistication. Such aesthetic character illustrated Mori's original emotion that was based on Japanese spirit and she combined it with values of the East and the West. From the analysis of Mori's aesthetic characteristics, it is clearly recognizable her feministic beauty is emanated by her original emotion and sensibility.
PDF KSCI

Light-Ontology Classification for Efficient Object Detection using a Hierarchical Tree Structure (효과적인 객체 검출을 위한 계층적 트리 구조를 이용한 조명 온톨로지 분류)

Kang, Sung-Kwan;Lee, Jung-Hyun
- Journal of Digital Convergence
- /
- v.10 no.10
- /
- pp.215-220
- /
- 2012
This paper proposes a ontology of tree structure approach for adaptive object recognition in a situation-variant environment. In this paper, we introduce a new concept, ontology of tree structure ontology, for context sensitivity, as we found that many developed systems work in a context-invariant environment. Due to the effects of illumination on a supreme obstinate designing context-sensitive recognition system, we have focused on designing such a context-variant system using ontology of tree structure. Ontology can be defined as an explicit specification of conceptualization of a domain typically captured in an abstract model of how people think about things in the domain. People produce ontologies to understand and explain underlying principles and environmental factors. In this research, we have proposed context ontology, context modeling, context adaptation, and context categorization to design ontology of tree structure based on illumination criteria. After selecting the proper light-ontology domain, we benefit from selecting a set of actions that produces better performance on that domain. We have carried out extensive experiments on these concepts in the area of object recognition in a dynamic changing environment, and we have achieved enormous success, which will enable us to proceed on our basic concepts.
https://doi.org/10.14400/JDPM.2012.10.10.215 인용 PDF

Microtube Light-Emitting Diode Arrays with Metal Cores

Tchoe, Youngbin;Lee, Chul-Ho;Park, Junbeom;Baek, Hyeonjun;Chung, Kunook;Jo, Janghyun;Kim, Miyoung;Yi, Gyu-Chul
- Proceedings of the Korean Vacuum Society Conference
- /
- 2016.02a
- /
- pp.287.1-287.1
- /
- 2016
Three-dimensional (3-D) semiconductor nanoarchitectures, including nano- and micro- rods, pyramids, and disks, are emerging as one of the most promising elements for future optoelectronic devices. Since these 3-D semiconductor nanoarchitectures have many interesting unconventional properties, including the use of large light-emitting surface area and semipolar/nonpolar nano- or micro-facets, numerous studies reported on novel device applications of these 3-D nanoarchitectures. In particular, 3-D nanoarchitecture devices can have noticeably different current spreading characteristics compared with conventional thin film devices, due to their elaborate 3-D geometry. Utilizing this feature in a highly controlled manner, color-tunable light-emitting diodes (LEDs) were demonstrated by controlling the spatial distribution of current density over the multifaceted GaN LEDs. Meanwhile, for the fabrication of high brightness, single color emitting LEDs or laser diodes, uniform and high density of electrical current must be injected into the entire active layers of the nanoarchitecture devices. Here, we report on a new device structure to inject uniform and high density of electrical current through the 3-D semiconductor nanoarchitecture LEDs using metal core inside microtube LEDs. In this work, we report the fabrications and characteristics of metal-cored coaxial $GaN/In_xGa_{1-x}N$ microtube LEDs. For the fabrication of metal-cored microtube LEDs, $GaN/In_xGa_{1-x}N/ZnO$ coaxial microtube LED arrays grown on an n-GaN/c-Al2O3 substrate were lifted-off from the substrate by wet chemical etching of sacrificial ZnO microtubes and $SiO_2$ layer. The chemically lifted-off layer of LEDs were then stamped upside down on another supporting substrates. Subsequently, Ti/Au and indium tin oxide were deposited on the inner shells of microtubes, forming n-type electrodes of the metal-cored LEDs. The device characteristics were investigated measuring electroluminescence and current-voltage characteristic curves and analyzed by computational modeling of current spreading characteristics.
PDF

RBM-based distributed representation of language (RBM을 이용한 언어의 분산 표상화)

You, Heejo;Nam, Kichun;Nam, Hosung
- Korean Journal of Cognitive Science
- /
- v.28 no.2
- /
- pp.111-131
- /
- 2017
The connectionist model is one approach to studying language processing from a computational perspective. And building a representation in the connectionist model study is just as important as making the structure of the model in that it determines the level of learning and performance of the model. The connectionist model has been constructed in two different ways: localist representation and distributed representation. However, the localist representation used in the previous studies had limitations in that the unit of the output layer having a rare target activation value is inactivated, and the past distributed representation has the limitation of difficulty in confirming the result by the opacity of the displayed information. This has been a limitation of the overall connection model study. In this paper, we present a new method to induce distributed representation with local representation using abstraction of information, which is a feature of restricted Boltzmann machine, with respect to the limitation of such representation of the past. As a result, our proposed method effectively solves the problem of conventional representation by using the method of information compression and inverse transformation of distributed representation into local representation.
https://doi.org/10.19066/cogsci.2017.28.2.002 인용 PDF

CompGenX: Component Code Generation System based on GenVoca and XML (CompGenX: GenVoca와 XML 기반의 컴포넌트 코드 생성 시스템)

Choi Seung-Hoon
- Journal of Internet Computing and Services
- /
- v.4 no.3
- /
- pp.57-67
- /
- 2003
Software product lines are to attain the rapid development of qualify applications by concretizing the general components populated in software assets and assembling them according to the predefined architectures. For supporting the construction of the software product lines, this paper proposes a component code generation techniques based on GenVoca architecture and XML/XSLT technologies, In addition, CompGenX(Component Generator using XML), a component code generation system, is proposed on the basis of this techniques. By providing reconfigurability of component at the time of code generation, CompGenX allows the reusers to create the component source code that is appropriate to their purpose, In this system, the process of the component development is divided into two tasks which are the component family construction task and the component reuse task, For the component family construction, CompGenX provides the feature modeling tool for domain analysis and the domain architecture definition tool. Also, it provides the tool for building the component configuration know1edge specification and the code templates, For the component reuse task, it offers the component family search tool. the component customizing tool and the component code generator. Component code generation techniques and system in this paper should be applicable as basic technology to build the component-based software product lines.
PDF

Mathematical Modeling of the Novel Influenza A (H1N1) Virus and Evaluation of the Epidemic Response Strategies in the Republic of Korea (수학적 모델을 이용한 신종인플루엔자 환자 예측 및 대응 전략 평가)

Suh, Min-A;Lee, Jee-Hyun;Chi, Hye-Jin;Kim, Young-Keun;Kang, Dae-Yong;Hur, Nam-Wook;Ha, Kyung-Hwa;Lee, Dong-Han;Kim, Chang-Soo
- Journal of Preventive Medicine and Public Health
- /
- v.43 no.2
- /
- pp.109-116
- /
- 2010
Objectives: The pandemic of novel influenza A (H1N1) virus has required decision-makers to act in the face of the substantial uncertainties. In this study, we evaluated the potential impact of the pandemic response strategies in the Republic of Korea using a mathematical model. Methods: We developed a deterministic model of a pandemic (H1N1) 2009 in a structured population using the demographic data from the Korean population and the epidemiological feature of the pandemic (H1N1) 2009. To estimate the parameter values for the deterministic model, we used the available data from the previous studies on pandemic influenza. The pandemic response strategies of the Republic of Korea for novel influenza A (H1N1) virus such as school closure, mass vaccination (70% of population in 30 days), and a policy for anti-viral drug (treatment or prophylaxis) were applied to the deterministic model. Results: The effect of two-week school closure on the attack rate was low regardless of the timing of the intervention. The earlier vaccination showed the effect of greater delays in reaching the peak of outbreaks. When it was no vaccination, vaccination at initiation of outbreak, vaccination 90 days after the initiation of outbreak and vaccination at the epidemic peak point, the total number of clinical cases for 400 days were 20.8 million, 4.4 million, 4.7 million and 12.6 million, respectively. The pandemic response strategies of the Republic of Korea delayed the peak of outbreaks (about 40 days) and decreased the number of cumulative clinical cases (8 million). Conclusions: Rapid vaccination was the most important factor to control the spread of pandemic influenza, and the response strategies of the Republic of Korea were shown to delay the spread of pandemic influenza in this deterministic model.
https://doi.org/10.3961/jpmph.2010.43.2.109 인용 PDF KSCI

Search Result 639, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)