• 제목/요약/키워드: Grouping analysis

검색결과 586건 처리시간 0.028초

A Study of Main Contents Extraction from Web News Pages based on XPath Analysis

  • Sun, Bok-Keun
    • 한국컴퓨터정보학회논문지
    • /
    • 제20권7호
    • /
    • pp.1-7
    • /
    • 2015
  • Although data on the internet can be used in various fields such as source of data of IR(Information Retrieval), Data mining and knowledge information servece, and contains a lot of unnecessary information. The removal of the unnecessary data is a problem to be solved prior to the study of the knowledge-based information service that is based on the data of the web page, in this paper, we solve the problem through the implementation of XTractor(XPath Extractor). Since XPath is used to navigate the attribute data and the data elements in the XML document, the XPath analysis to be carried out through the XTractor. XTractor Extracts main text by html parsing, XPath grouping and detecting the XPath contains the main data. The result, the recognition and precision rate are showed in 97.9%, 93.9%, except for a few cases in a large amount of experimental data and it was confirmed that it is possible to properly extract the main text of the news.

${\nabla}^2G$ 연산자의 신호 분석 특성을 이용한 음성 인식 신경 회로망에 관한 연구 (Neural Network for Speech Recognition Using Signal Analysis Characteristics by ${\nabla}^2G$ Operator)

  • 이종혁;정용근;남기곤;윤태훈;김재창;박의열;이양성
    • 전자공학회논문지B
    • /
    • 제29B권10호
    • /
    • pp.90-99
    • /
    • 1992
  • In this paper, we propose a neural network model for speech recognition. The model consists of feature extraction parts and recognition parts. The interconnection model based on ${\Delta}^2$G operator was used for frequency analysis. Two features, global feature and local feature, were extracted from this model. Recognition parts consist of global grouping stage and local grouping stage. When the input pattern was coded by slope method, the recognition rate of speakers, A and B, was 100%. When the test was performed with the data of 9 speakers, the recognition rate of 91.4% was obtained.

  • PDF

실시간 오차 보정을 위한 열변형 오차 모델의 최적 변수 선택 (Optimal Variable Selection in a Thermal Error Model for Real Time Error Compensation)

  • 황석현;이진현;양승한
    • 한국정밀공학회지
    • /
    • 제16권3호통권96호
    • /
    • pp.215-221
    • /
    • 1999
  • The object of the thermal error compensation system in machine tools is improving the accuracy of a machine tool through real time error compensation. The accuracy of the machine tool totally depends on the accuracy of thermal error model. A thermal error model can be obtained by appropriate combination of temperature variables. The proposed method for optimal variable selection in the thermal error model is based on correlation grouping and successive regression analysis. Collinearity matter is improved with the correlation grouping and the judgment function which minimizes residual mean square is used. The linear model is more robust against measurement noises than an engineering judgement model that includes the higher order terms of variables. The proposed method is more effective for the applications in real time error compensation because of the reduction in computational time, sufficient model accuracy, and the robustness.

  • PDF

AHP - 군집분석을 이용한 주요어종의 자원감소 원인 비교분석에 관한 연구 (The Comparative Analysis of the Reasons for Decreases in Marin Fishery Resources Based on AHP & duster Analysis)

  • 박철형;이상고
    • 수산경영론집
    • /
    • 제40권3호
    • /
    • pp.127-146
    • /
    • 2009
  • This study is to estimate the factor weights of the reasons for decreases in marine fishery resources using the Analytical Hierarchy Process. Furthermore, it classifies 20 fishes under a fishery resource recovery plan into various groups of fishes according to these factor weights using the non-hierarchial cluster analysis. The factors of decreases in marine fishery resources are identified as bio-ecological, technology-system, economic-business, and fishing village-society factors. Two of the most important factors of decreases in resource are turned out to be the economic-business and bio-ecological factors, estimated as 31% and 30% respectively. The technology-system and fishing village-society factors are estimated as 21% and 18% respectively. The study utilizes non-hierarchical cluster analysis in order to classify 20 fishes into 2, 3, and 4 groups. K-means cluster analysis is applied for grouping in conjunction with ANOVA to identify statistical differences in factors. Once again, the economic-business and bio-economic factors play main role in grouping 2-groups of fishes case. The third group of fishes in addition to the previous 2 groups of fishes appears as those 4 factors of decrease evenly play about the same role at a 3-groups of fishes case. Finally, the economic-business and bio-economic factors are turned out to be evenly important in the 4th group once there are 4-groups of fishes.

  • PDF

OFDM시스템의 PAPR 저감을 위한 심벌 그룹핑 SLM 기법의 성능분석 (Performance Analysis of the Symbol Grouping SLM Scheme for PAPR Reduction of OFDM System)

  • 최익녕;손성찬;오창헌
    • 디지털콘텐츠학회 논문지
    • /
    • 제5권2호
    • /
    • pp.143-150
    • /
    • 2004
  • 본 논문은 OFDM 시스템에서 PAPR (peak-to-average power ratio) 문제를 효과적으로 저감시키기 위하여 기존 SLM(selective mapping) 기법과는 달리 심벌 그룹핑 후 그룹별로 같은 스크램블 코드를 이용하는 새로운 방법을 제안하고 성능을 분석하였다. 기존의 SLM 기법은 IFFT단에 들어가는 OFDM 심벌을 여러 개의 스크램블 코드에 의해 랜덤화 시켜 PAPR 중 작은 값을 선택하여 전송한다. 그러므로 SLM 기법은 스크램블 코드만큼 부가 정보를 전송함으로써 대역손실이 발생한다. 그러나 제안한 심벌 그룹핑 SLM 기법은 부반송파들을 M개로 그룹화 하여 동일한 스크램블 코드를 사용함으로써 그룹화한 OFDM심벌의 개수만큼 부가정보 데이터를 줄일 수 있고, SLM 기법에서 사용한 숫자만큼 스크램블 코드를 사용한다면 기존의 SLM 기법보다 PAPR 성능을 더 개선할 수 있다.

  • PDF

Relationships between MMPI Scales under Defensive Attitude and Safety and Health Indices

  • Kim, Jong Hwan;Jeong, Byung Yong;Park, Myoung Hwan
    • 대한인간공학회지
    • /
    • 제35권6호
    • /
    • pp.611-619
    • /
    • 2016
  • Objective:This study aims to analyze the relationships between personality factors measured by Minnesota Multiphasic Personality Inventory (MMPI) scales and the indices of safety and health in the shipbuilding industry. Background: Many researches reported that there were significant relationships between some MMPI subscales and traffic and industrial accidents. Method: This study analyzes 230 male workers in shipyard for their MMPI scores gathered during recruitment process and their safety and health indices from the performance record during their working period. ${\chi}^2-test$ and one-way ANOVA are used for finding the statistical significance for personality factors. The conventional grouping rule for MMPI scales and other grouping criteria considering the attitude of positive answer for the MMPI test during recruitment process are used for analysis. Results: The Hypomania (Ma) and Psychopathic Deviate (Pd) scales of the MMPI are the main factors related to the safety and health related indices for most grouping rules. Depression (D), Psychasthenia (Pt), Hypochondriasis (Hs), Schizophrenia (Sc), and Masculinity and Femininity (Mf) scales are also related to the safety and health indices. Conclusion and Application: The results can be used for understanding the psychological factors in human behaviors and safety and can help professional personnel take the necessary steps in improving safety on the job and also in providing the effective teaching of safe work methods.

하천수질 오염요소 분석을 근거로 금강수계의 우선정비 대상하천 선정을 위한 집단화 기법적용 (Application of Grouping Method to select Priority Restoration Streams in Geumgang Watershed based on Analysis of Pollution Factors)

  • 이상호;황정재
    • 상하수도학회지
    • /
    • 제27권5호
    • /
    • pp.661-669
    • /
    • 2013
  • River-water quality has been greatly improved during past several decades with the extraordinary expansion for the wastewater treatment capacities by the government. Research aims to select the priority restoration streams based on the chronicle data for tributaries in Geumgang watershed as the main stream area in the Chungchungnamdo province. The quality of BOD, phosphorus and percent of sewered population on 15 branch streams were compared by the grouping methods. The results of group D streams by category I that exceed 3.0 mg/L for BOD and 0.1 mg/L for phosphorus were Seuksung, Ganggyung and Bangchuk stream. The results of group D streams by category II that exceed 3.0 mg/L for BOD and less than 63.5 % of average percent of sewered population were Ganggyung, Gilsan, Bangchuk and Seuksung stream. The final results of selected streams drawn by the chronicle data which exceeded the standard quality and lower than the average percent of sewered population were Seoksung, Gangeyung and Bangchuk stream. The pollution of rivers in the down streams were more serious than in the upper streams. Their watersheds have to be improved river water quality, especially to extend sewer systems as well as wastewater treatment facilities.

Scalable Search based on Fuzzy Clustering for Interest-based P2P Networks

  • Mateo, Romeo Mark A.;Lee, Jae-Wan
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제5권1호
    • /
    • pp.157-176
    • /
    • 2011
  • An interest-based P2P constructs the peer connections based on similarities for efficient search of resources. A clustering technique using peer similarities as data is an effective approach to group the most relevant peers. However, the separation of groups produced from clustering lowers the scalability of a P2P network. Moreover, the interest-based approach is only concerned with user-level grouping where topology-awareness on the physical network is not considered. This paper proposes an efficient scalable search for the interest-based P2P system. A scalable multi-ring (SMR) based on fuzzy clustering handles the grouping of relevant peers and the proposed scalable search utilizes the SMR for scalability of peer queries. In forming the multi-ring, a minimized route function is used to determine the shortest route to connect peers on the physical network. Performance evaluation showed that the SMR acquired an accurate peer grouping and improved the connectivity rate of the P2P network. Also, the proposed scalable search was efficient in finding more replicated files throughout the peer network compared to other traditional P2P approaches.

취약점 별 아티팩트 사례 분석을 통한 아티팩트 그룹핑 연구 : 어도비 플래시 플레이어 취약점을 이용하여 (A Study on Artifact Grouping by Analyzing Artifact Case by Vulnerability : Using Adobe Flash Player Vulnerabilities)

  • 송병관;김선광;권은진;진승택;김종혁;김형철;김민수
    • 융합보안논문지
    • /
    • 제19권1호
    • /
    • pp.87-95
    • /
    • 2019
  • 점차 고도화되는 사이버 공격에 의한 많은 침해사고로 피해가 증가하고 있다. 많은 기관 및 기업체에서는 사고 탐지를 위한 인프라만에 많은 자원을 투자하기에 초기대응에 미흡하다. 침해사고의 초기대응은 공격의 유입경로 파악이 우선이며, 이루어지고 있는 많은 사이버 공격은 소프트웨어 취약점을 대상으로 하고 있다. 따라서, 소프트웨어 취약점을 대상으로 윈도우 시스템의 아티팩트를 분석하고, 분석한 데이터를 분류하면 신속한 초기대응에 활용할 수 있다. 그러므로 소프트웨어 별 공격 유입 시 남는 아티팩트를 분류하여 침해사고 분석 시에 활용할 수 있는 아티팩트 그룹핑을 제시한다.