• Title/Summary/Keyword: Extraction system

Search Result 3,450, Processing Time 0.027 seconds

Automatic Extraction of Hangul Stroke Element Using Faster R-CNN for Font Similarity (글꼴 유사도 판단을 위한 Faster R-CNN 기반 한글 글꼴 획 요소 자동 추출)

  • Jeon, Ja-Yeon;Park, Dong-Yeon;Lim, Seo-Young;Ji, Yeong-Seo;Lim, Soon-Bum
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.8
    • /
    • pp.953-964
    • /
    • 2020
  • Ever since media contents took over the world, the importance of typography has increased, and the influence of fonts has be n recognized. Nevertheless, the current Hangul font system is very poor and is provided passively, so it is practically impossible to understand and utilize all the shape characteristics of more than six thousand Hangul fonts. In this paper, the characteristics of Hangul font shapes were selected based on the Hangul structure of similar fonts. The stroke element detection training was performed by fine tuning Faster R-CNN Inception v2, one of the deep learning object detection models. We also propose a system that automatically extracts the stroke element characteristics from characters by introducing an automatic extraction algorithm. In comparison to the previous research which showed poor accuracy while using SVM(Support Vector Machine) and Sliding Window Algorithm, the proposed system in this paper has shown the result of 10 % accuracy to properly detect and extract stroke elements from various fonts. In conclusion, if the stroke element characteristics based on the Hangul structural information extracted through the system are used for similar classification, problems such as copyright will be solved in an era when typography's competitiveness becomes stronger, and an automated process will be provided to users for more convenience.

KONG-DB: Korean Novel Geo-name DB & Search and Visualization System Using Dictionary from the Web (KONG-DB: 웹 상의 어휘 사전을 활용한 한국 소설 지명 DB, 검색 및 시각화 시스템)

  • Park, Sung Hee
    • Journal of the Korean Society for information Management
    • /
    • v.33 no.3
    • /
    • pp.321-343
    • /
    • 2016
  • This study aimed to design a semi-automatic web-based pilot system 1) to build a Korean novel geo-name, 2) to update the database using automatic geo-name extraction for a scalable database, and 3) to retrieve/visualize the usage of an old geo-name on the map. In particular, the problem of extracting novel geo-names, which are currently obsolete, is difficult to solve because obtaining a corpus used for training dataset is burden. To build a corpus for training data, an admin tool, HTML crawler and parser in Python, crawled geo-names and usages from a vocabulary dictionary for Korean New Novel enough to train a named entity tagger for extracting even novel geo-names not shown up in a training corpus. By means of a training corpus and an automatic extraction tool, the geo-name database was made scalable. In addition, the system can visualize the geo-name on the map. The work of study also designed, implemented the prototype and empirically verified the validity of the pilot system. Lastly, items to be improved have also been addressed.

A Study on the Semiautomatic Construction of Domain-Specific Relation Extraction Datasets from Biomedical Abstracts - Mainly Focusing on a Genic Interaction Dataset in Alzheimer's Disease Domain - (바이오 분야 학술 문헌에서의 분야별 관계 추출 데이터셋 반자동 구축에 관한 연구 - 알츠하이머병 유관 유전자 간 상호 작용 중심으로 -)

  • Choi, Sung-Pil;Yoo, Suk-Jong;Cho, Hyun-Yang
    • Journal of Korean Library and Information Science Society
    • /
    • v.47 no.4
    • /
    • pp.289-307
    • /
    • 2016
  • This paper introduces a software system and process model for constructing domain-specific relation extraction datasets semi-automatically. The system uses a set of terms such as genes, proteins diseases and so forth as inputs and then by exploiting massive biological interaction database, generates a set of term pairs which are utilized as queries for retrieving sentences containing the pairs from scientific databases. To assess the usefulness of the proposed system, this paper applies it into constructing a genic interaction dataset related to Alzheimer's disease domain, which extracts 3,510 interaction-related sentences by using 140 gene names in the area. In conclusion, the resulting outputs of the case study performed in this paper indicate the fact that the system and process could highly boost the efficiency of the dataset construction in various subfields of biomedical research.

Optimization of Acquisition Time of Beta-Gamma Coincidence Counting System for Radioxenon Measurement (방사성제논 탐지를 위한 베타-감마 동시 계측시스템의 측정시간 최적화)

  • Byun, Jong-In;Park, Hong-Mo;Choi, Hee-Yeoul;Song, Myeong-Han;Yun, Ju-Yong
    • Journal of Radiation Protection and Research
    • /
    • v.40 no.3
    • /
    • pp.181-186
    • /
    • 2015
  • Measurement of xenon radioisotopes from nuclear fission is a key element for monitoring underground nuclear weapon tests. $^{131m}Xe$, $^{133}Xe$, $^{133}mXe$ and $^{135}Xe$ in the air can be detected via low background systems such as a beta-gamma coincidence counting system. Radioxenon monitoring is performed through air sampling, xenon extraction, measurement and spectrum analysis. The minimum detectable concentration of $^{135}Xe$ can be significantly variable depending on the sampling time, extraction time and data acquisition time due to its short half-life. In order to optimize the acquisition time with respect to certain experimental parameters such as sampling and xenon extraction, theoretical approach and experiment using SAUNA system were performed to determine the time to minimize the minimum detectable concentration, which the results were discussed.

Lane Violation Detection System Using Feature Tracking (특징점 추적을 이용한 끼어들기 위반차량 검지 시스템)

  • Lee, Hee-Sin;Lee, Joon-Whoan
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.8 no.2
    • /
    • pp.36-44
    • /
    • 2009
  • In this paper, we suggest a system of detecting a vehicle with lane violation, which can detect the vehicle with lane violation, by using the feature point tracking. The whole algorithm in the suggested system of detecting a vehicle with lane violation is composed of three stages such as feature extraction, register and tracking in feature for the tracking-targeted vehicle, and detecting a vehicle with lane violation. In the stage of feature extraction, the feature is extracted from the inputted image by sing the feature-extraction algorithm available for the real-time processing. The extracted features are again selected the racking-targeted feature. The registered feature is tracked by using NCC(normalized cross correlation). Finally, whether or not lane violation is finally detected by using information on the tracked features. As a result of experimenting the suggested system by using the acquired image in the section with a ban on intervention, the excellent performance was shown with 99.09% for positive recognition ratio and 0.9% for error ratio. The fast processing speed could be obtained in 34.48 frames per second available for real-time processing.

  • PDF

Automatic Keyword Extraction System for Korean Documents Information Retrieval (국내(國內) 문헌정보(文獻情報) 검색(檢索)을 위한 키워드 자동추출(自動抽出) 시스템 개발(開發))

  • Yae, Yong-Hee
    • Journal of Information Management
    • /
    • v.23 no.1
    • /
    • pp.39-62
    • /
    • 1992
  • In this paper about 60 auxiliary words and 320 stopwords are selected from analysis of sample data, four types of stop word are classified left, right and - auxiliary word truncation & normal. And a keyword extraction system is suggested which undertakes efficient truncation of auxiliary word from words, conversion of Chinese word to Korean and exclusion of stopword. The selected keyeords in this system show 92.2% of accordance ratio compared with manually selected keywords by expert. And then compound words consist of $4{\sim}6$ character generate twice of additional new words and 58.8% words of those are useful as keyword.

  • PDF

Three-phase Four-wire Series Active Power Filter Control Strategy for The Compensation of Harmonics and Reactive Power Based-on Direct Compensating Voltage Extraction Method (직접 보상전압 추출기법을 이용하여 고조파전류와 무효전력을 보상하는 3상 4선식 직렬 형 능동전력필터의 제어법)

  • 김진선;김영석
    • The Transactions of the Korean Institute of Power Electronics
    • /
    • v.9 no.3
    • /
    • pp.213-221
    • /
    • 2004
  • In recent years, since more and more diode rectifiers with smoothing dc capacitor are used in electronic equipments, household appliances and ac drives, harmonics generated by these loads have become a major issue. In addition, 3-phase 4-wire system is widely employed in distributing electric energy to several office building and manufacturing plants. This systems show excessive currents in the neutral. These neutral currents are fundamentally third harmonic, and their presence is tied to wiring failure, elevating of neutral potentials, transformer overheating, etc. In response to the concerns, this paper proposes a series active power filter scheme based on direct compensating voltage extraction method and the advantage of this control algorithm is direct extraction of compensation voltage reference without multiplying gain. Therefore, the calculation of the compensation voltage reference will becom much simpler than other control algorithm. To verify the effectiveness of the proposed algorithm, a prototype active power filter is built and some experiments are carried out.

A Study on the Extraction of Horizontal Alignment and Cross-Section of Roads using Mobile Laser Scanning Data (모바일 레이저 스캐닝 데이터를 이용한 도로선형 및 횡단면 추출에 관한 연구)

  • Kim, Se-Geun;Lee, Hyun-Yong;Joo, Young-Eun
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.9 no.3
    • /
    • pp.207-218
    • /
    • 2006
  • The extraction of horizontal alignment and cross-section of roads is very important task in road safety diagnosis. Existing road safety diagnosis methods by investigators need much time and expense but don't provide various data. Therefor, we need road shape classification automatically and extraction method of horizontal alignment and cross-section of roads through digital photogrammetry system using GPS-VAN with laser scanner. In this paper, we propose a method of mobile laser scanning data acquisition, processing and developing extraction methods of horizontal alignment and cross-section of roads using mobile laser scanning data by GPS-VAN.

  • PDF

Face Detection System Based on Candidate Extraction through Segmentation of Skin Area and Partial Face Classifier (피부색 영역의 분할을 통한 후보 검출과 부분 얼굴 분류기에 기반을 둔 얼굴 검출 시스템)

  • Kim, Sung-Hoon;Lee, Hyon-Soo
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.47 no.2
    • /
    • pp.11-20
    • /
    • 2010
  • In this paper we propose a face detection system which consists of a method of face candidate extraction using skin color and a method of face verification using the feature of facial structure. Firstly, the proposed extraction method of face candidate uses the image segmentation and merging algorithm in the regions of skin color and the neighboring regions of skin color. These two algorithms make it possible to select the face candidates from the variety of faces in the image with complicated backgrounds. Secondly, by using the partial face classifier, the proposed face validation method verifies the feature of face structure and then classifies face and non-face. This classifier uses face images only in the learning process and does not consider non-face images in order to use less number of training images. In the experimental, the proposed method of face candidate extraction can find more 9.55% faces on average as face candidates than other methods. Also in the experiment of face and non-face classification, the proposed face validation method obtains the face classification rate on the average 4.97% higher than other face/non-face classifiers when the non-face classification rate is about 99%.

Rotation and Translation Invariant Feature Extraction Using Angular Projection in Frequency Domain (주파수 영역에서 각도 투영법을 이용한 회전 및 천이 불변 특징 추출)

  • Lee, Bum-Shik;Kim, Mun-Churl
    • Journal of the HCI Society of Korea
    • /
    • v.1 no.2
    • /
    • pp.27-33
    • /
    • 2006
  • This paper presents a new approach to translation and rotation invariant feature extraction for image texture retrieval. For the rotation invariant feature extraction, we invent angular projection along angular frequency in Polar coordinate system. The translation and rotation invariant feature vector for representing texture images is constructed by the averaged magnitude and the standard deviations of the magnitude of the Fourier transform spectrum obtained by the proposed angular projection. In order to easily implement the angular projection, the Radon transform is employed to obtain the Fourier transform spectrum of images in the Polar coordinate system. Then, angular projection is applied to extract the feature vector. We present our experimental results to show the robustness against the image rotation and the discriminatory capability for different texture images using MPEG-7 data set. Our Experiment result shows that the proposed rotation and translation invariant feature vector is effective in retrieval performance for the texture images with homogeneity, isotropy and local directionality.

  • PDF