• Title/Summary/Keyword: Automatic Classification System

Search Result 417, Processing Time 0.028 seconds

AutoCor: A Query Based Automatic Acquisition of Corpora of Closely-related Languages

  • Dimalen, Davis Muhajereen D.;Roxas, Rachel Edita O.
    • Proceedings of the Korean Society for Language and Information Conference
    • /
    • 2007.11a
    • /
    • pp.146-154
    • /
    • 2007
  • AutoCor is a method for the automatic acquisition and classification of corpora of documents in closely-related languages. It is an extension and enhancement of CorpusBuilder, a system that automatically builds specific minority language corpora from a closed corpus, since some Tagalog documents retrieved by CorpusBuilder are actually documents in other closely-related Philippine languages. AutoCor used the query generation method odds ratio, and introduced the concept of common word pruning to differentiate between documents of closely-related Philippine languages and Tagalog. The performance of the system using with and without pruning are compared, and common word pruning was found to improve the precision of the system.

  • PDF

Implementation of DTW-kNN-based Decision Support System for Discriminating Emerging Technologies (DTW-kNN 기반의 유망 기술 식별을 위한 의사결정 지원 시스템 구현 방안)

  • Jeong, Do-Heon;Park, Ju-Yeon
    • Journal of Industrial Convergence
    • /
    • v.20 no.8
    • /
    • pp.77-84
    • /
    • 2022
  • This study aims to present a method for implementing a decision support system that can be used for selecting emerging technologies by applying a machine learning-based automatic classification technique. To conduct the research, the architecture of the entire system was built and detailed research steps were conducted. First, emerging technology candidate items were selected and trend data was automatically generated using a big data system. After defining the conceptual model and pattern classification structure of technological development, an efficient machine learning method was presented through an automatic classification experiment. Finally, the analysis results of the system were interpreted and methods for utilization were derived. In a DTW-kNN-based classification experiment that combines the Dynamic Time Warping(DTW) method and the k-Nearest Neighbors(kNN) classification model proposed in this study, the identification performance was up to 87.7%, and particularly in the 'eventual' section where the trend highly fluctuates, the maximum performance difference was 39.4% points compared to the Euclidean Distance(ED) algorithm. In addition, through the analysis results presented by the system, it was confirmed that this decision support system can be effectively utilized in the process of automatically classifying and filtering by type with a large amount of trend data.

Automatic Payload Signature Update System for Classification of Recent Network Applications (최신 네트워크 응용 분류를 위한 자동화 페이로드 시그니쳐 업데이트 시스템)

  • Shim, Kyu-Seok;Goo, Young-Hoon;Lee, Sung-Ho;Sija, Baraka D.;Kim, Myung-Sup
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.42 no.1
    • /
    • pp.98-107
    • /
    • 2017
  • In these days, the increase of applications that highly use network resources has revealed the limitations of the current research phase from the traffic classification for network management. Various researches have been conducted to solutions for such limitations. The representative study is automatic finding of the common pattern of traffic. However, since the study of automatic signature generation is a semi-automatic system, users should collect the traffic. Therefore, these limitations cause problems in the traffic collection step leading to untrusted accuracy of the signature verification process because it does not contain any of the generated signature. In this paper, we propose an automated traffic collection, signature management, signature generation and signature verification process to overcome the limitations of the automatic signature update system. By applying the proposed method in the campus network, actual traffic signatures maintained the completeness with no false-positive.

A Study on the Automatic Pulse Classification Method for Non-cooperative Bi-static Sonar System (비협동 양상태 소나 시스템을 위한 펄스식별 자동화 기법 연구)

  • Kim, Geun Hwan;Yoon, Kyung Sik;Kim, Seong il;Jeong, Eui Cheol;Lee, Kyun Kyung
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.21 no.2
    • /
    • pp.158-165
    • /
    • 2018
  • Recently there is a great interest in the bi-static sonar. However, since the transmitter and the receiver operate on different platforms, it may be necessary to operate the system in a non-cooperative mode. In this situation, the detection and localization performance are limited. Therefore, it is necessary to classify the received pulse from the transmitter to overcome the performance limitation. In this paper, we proposed a robust automatic pulse classification method that can be applied to real systems. The proposed method eliminates the effects of noise and multipath propagation through post-processing and improves the pulse classification performance. We also verified the proposed method through the sea experimental data.

Semi-Automatic Management of Classification Scheme with Interoperability (상호운용적 분류체계 관리를 위한 반자동 분류체계 관리방안)

  • Lee, Won-Goo;Shin, Sung-Ho;Kim, Kwang-Young;Jeon, Do-Heon;Yoon, Hwa-Mook;Sung, Won-Kyung;Lee, Min-Ho
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.12
    • /
    • pp.466-474
    • /
    • 2011
  • Under the knowledge-based economy in 21C, the convergence and complexity in science and technology are being more active. Therefore, we have science and technology are classified properly, make not easy to construct the system to new next generation area. Thus we suggest the systematic solution method to flexibly extend classification scheme in order for content management and service organizations. In this way, we expect that the difficult of classification scheme management is minimized and the expense of it is spared.

Follicular Unit Classification Method Using Angle Variation of Boundary Vector for Automatic Hair Implant System

  • Kim, Hwi Gang;Bae, Tae Wuk;Kim, Kyu Hyung;Lee, Hyung Soo;Lee, Soo In
    • ETRI Journal
    • /
    • v.38 no.1
    • /
    • pp.195-205
    • /
    • 2016
  • This paper presents a novel follicular unit (FU) classification method based on an angle variation of a boundary vector according to the number of hairs in several FU images. The recently developed robotic FU harvest system, ARTAS, classifies through digital imaging the FU type based on the number of hairs with defects in the contour and outline profile of the FU of interest. However, this method has a drawback in that the FU classification is inaccurate because it causes unintended defects in the outline profile of the FU. To overcome this drawback, the proposed method classifies the FU's type by the number of variation points that are calculated using an angle variation a boundary vector. The experimental results show that the proposed method is robust and accurate for various FU shapes, compared to the contour-outline profile FU classification method of the ARTAS system.

User Interface Application for Cancer Classification using Histopathology Images

  • Naeem, Tayyaba;Qamar, Shamweel;Park, Peom
    • Journal of the Korean Society of Systems Engineering
    • /
    • v.17 no.2
    • /
    • pp.91-97
    • /
    • 2021
  • User interface for cancer classification system is a software application with clinician's friendly tools and functions to diagnose cancer from pathology images. Pathology evolved from manual diagnosis to computer-aided diagnosis with the help of Artificial Intelligence tools and algorithms. In this paper, we explained each block of the project life cycle for the implementation of automated breast cancer classification software using AI and machine learning algorithms to classify normal and invasive breast histology images. The system was designed to help the pathologists in an automatic and efficient diagnosis of breast cancer. To design the classification model, Hematoxylin and Eosin (H&E) stained breast histology images were obtained from the ICIAR Breast Cancer challenge. These images are stain normalized to minimize the error that can occur during model training due to pathological stains. The normalized dataset was fed into the ResNet-34 for the classification of normal and invasive breast cancer images. ResNet-34 gave 94% accuracy, 93% F Score, 95% of model Recall, and 91% precision.

Readability Enhancement of English Speech Recognition Output Using Automatic Capitalisation Classification (자동 대소문자 식별을 이용한 영어 음성인식 결과의 가독성 향상)

  • Kim, Ji-Hwan
    • MALSORI
    • /
    • no.61
    • /
    • pp.101-111
    • /
    • 2007
  • A modified speech recogniser have been proposed for automatic capitalisation generation to improve the readability of English speech recognition output. In this modified speech recogniser, every word in its vocabulary is duplicated: once in a de-caplitalised form and again in the capitalised forms. In addition its language model is re-trained on mixed case texts. In order to evaluate the performance of the proposed system, experiments of automatic capitalisation generation were performed for 3 hours of Broadcast News(BN) test data using the modified HTK BN transcription system. The proposed system produced an F-measure of 0.7317 for automatic capitalisation generation with an SER of 48.55, a precision of 0.7736 and a recall of 0.6942.

  • PDF

Development of System Model for Integrated Information Management of Construction Material (건설자재 통합정보 관리를 위한 시스템 모델 구현)

  • Han, Choong-Han;Ju, Ki-Bum
    • The KIPS Transactions:PartD
    • /
    • v.16D no.3
    • /
    • pp.433-440
    • /
    • 2009
  • As information technology of constructional area develops recently, web-based on-line system is rapidly increasing to provide information on diverse constructional materials so as to enhance productivity of constructional business and to reduce cost. Since the constructional materials information provided by these systems, i.e., quality, specification, etc are not standardized, however, the staffs on the constructional site suffer considerable difficulties in using materials information when acquiring information on specific materials, e.g., using diverse information systems or repeating similar jobs. Thus, this research typified information items of constructional materials on the basis of GDAS and designed multi system model to control integrated information on constructional materials. This system can efficiently control and utilize materials information by supporting automatic classification of constructional materials to which OmniClass Part-22 and UNSPSC are applied, conditional complex retrieval of materials information, real-time automatic embodiment of electronic catalog and retrieving/controlling RFID.

Recognition of Raised Characters for Automatic Classification of Rubber Tires (고무타이어 자동분류를 위한 돌출문자 인식)

  • 함영국;강민석;정홍규;박래홍;박귀태
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.31B no.4
    • /
    • pp.77-87
    • /
    • 1994
  • This paper presents recognition of raised alphanumeric markings on rubber tires for their automatic classification. Raised alphanumeric markings on rubber tires have different characteristics as compared to those of printed characters. In the preprocessing step, we first determine the rotation angle using the Hough transform and align markings, then separate each character using vertical and horizontal projections. In the recognition step, we use several features such as width of a character, cross point, partial projection, and distance feature to recognize characters hierarchically. The computer simulation result shows that the proposed system can be successfully applied to the industrial automation of rubber tires classification.

  • PDF