• Title/Summary/Keyword: categorization system

Search Result 278, Processing Time 0.026 seconds

A Study on the Product Categorization Model for Efficient Search in On-line Chartering

  • Choi, Hyung-Rim;Park, Nam-kyu;Park, Young-Jae;Park, Yong-Sung;Kang, Si-Hyeob
    • Journal of Navigation and Port Research
    • /
    • v.27 no.3
    • /
    • pp.307-313
    • /
    • 2003
  • Off-line ship chartering is done nearly through the brokers. Because of the international scale of chartering market, brokers spend too much times and costs on searching the most appropriate product which the consumers want. In this research, we propose the on-line Charter Product Categorization Model to search the products efficiently in the Cyber Chartering System. This Model will make concerned parties of the ship chartering to get unified product information efficiently, and the select the most appropriate product. In this research, we classified the ship chartering products into categories of cargo, ship type, and sea routes, and defined mutual relation of each products, and we verified that this classification is necessary to search the products through the product searching experiment.

A Robust Pattern-based Feature Extraction Method for Sentiment Categorization of Korean Customer Reviews (강건한 한국어 상품평의 감정 분류를 위한 패턴 기반 자질 추출 방법)

  • Shin, Jun-Soo;Kim, Hark-Soo
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.12
    • /
    • pp.946-950
    • /
    • 2010
  • Many sentiment categorization systems based on machine learning methods use morphological analyzers in order to extract linguistic features from sentences. However, the morphological analyzers do not generally perform well in a customer review domain because online customer reviews include many spacing errors and spelling errors. These low performances of the underlying systems lead to performance decreases of the sentiment categorization systems. To resolve this problem, we propose a feature extraction method based on simple longest matching of Eojeol (a Korean spacing unit) and phoneme patterns. The two kinds of patterns are automatically constructed from a large amount of POS (part-of-speech) tagged corpus. Eojeol patterns consist of Eojeols including content words such as nouns and verbs. Phoneme patterns consist of leading consonant and vowel pairs of predicate words such as verbs and adjectives because spelling errors seldom occur in leading consonants and vowels. To evaluate the proposed method, we implemented a sentiment categorization system using a SVM (Support Vector Machine) as a machine learner. In the experiment with Korean customer reviews, the sentiment categorization system using the proposed method outperformed that using a morphological analyzer as a feature extractor.

A Study on the Classification of Inventory Items (재고품목의 분류에 관한 연구)

  • 이상도;이강우
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.13 no.21
    • /
    • pp.61-72
    • /
    • 1990
  • In the past, a fundamental technique in inventory control system was to rank the items of a inventory from a distribution by value to draw an ABC curve. Unfortunately the use of ABC categorization technique has created problems and introduced some gross errors into the inventory control system. First, the author considers the possible applications of the technique in the real world and then attempts to discuss ABC fallacies through the literatures. Finally, an inventory categorization algorithm by the principal component analysis is derived and a numerical example is also presented to illustrate the algorithm.

  • PDF

Development of Categorization System for Efficient Calculation of Damage Cost according to Strong Wind (강풍 피해에 따른 피해비용의 효율적인 산정을 위한 분류체계 개발)

  • Song, Chang Young;Lee, Jong Hoon
    • Journal of the Korean Society of Safety
    • /
    • v.31 no.2
    • /
    • pp.127-132
    • /
    • 2016
  • In this study, the plan to construct a disaster information categorization system that can be objectively and efficiently performed was suggested in order to perform disaster management task systematically. Recently, the damage of natural disasters is gradually growing larger and faster, increasing the economic loss. Especially, as for the domestic storm damage, the damage from strong wind was found to be greater than the damage from torrential rain. Also, strong wind was found to be inflicting a great damage on human life, property and agricultural crops, so the necessity to study damage restoration from strong wind is increasing. Nevertheless, the damage items categorized in the domestic disaster year book are often comprehensive or unclear in criteria, and thus fail to reflect items or matters due to actual disaster damage. It is difficult to aggregate damage accurately such that it does not correspond to the national compensation scope or the damage amount is calculated according to subjective judgment of the investigator in charge. As such, if the disaster information management is inadequate by not applying accurate categorization criteria from damage amount calculation, there can be an issue with fairness when paying the damage support aid. Therefore, this study suggested a categorization plan for objective and efficient execution of disaster information management task in order to resolve such issues. It is expected that quick and efficient execution would be possible in disaster information management and task procedure domestically by constructing systematic categorization system related to disaster information.

Improving the Performance of a Fast Text Classifier with Document-side Feature Selection (문서측 자질선정을 이용한 고속 문서분류기의 성능향상에 관한 연구)

  • Lee, Jae-Yun
    • Journal of Information Management
    • /
    • v.36 no.4
    • /
    • pp.51-69
    • /
    • 2005
  • High-speed classification method becomes an important research issue in text categorization systems. A fast text categorization technique, named feature value voting, is introduced recently on the text categorization problems. But the classification accuracy of this technique is not good as its classification speed. We present a novel approach for feature selection, named document-side feature selection, and apply it to feature value voting method. In this approach, there is no feature selection process in learning phase; but realtime feature selection is executed in classification phase. Our results show that feature value voting with document-side feature selection can allow fast and accurate text classification system, which seems to be competitive in classification performance with Support Vector Machines, the state-of-the-art text categorization algorithms.

A Research on Enhancement of Text Categorization Performance by using Okapi BM25 Word Weight Method (Okapi BM25 단어 가중치법 적용을 통한 문서 범주화의 성능 향상)

  • Lee, Yong-Hun;Lee, Sang-Bum
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.11 no.12
    • /
    • pp.5089-5096
    • /
    • 2010
  • Text categorization is one of important features in information searching system which classifies documents according to some criteria. The general method of categorization performs the classification of the target documents by eliciting important index words and providing the weight on them. Therefore, the effectiveness of algorithm is so important since performance and correctness of text categorization totally depends on such algorithm. In this paper, an enhanced method for text categorization by improving word weighting technique is introduced. A method called Okapi BM25 has been proved its effectiveness from some information retrieval engines. We applied Okapi BM25 and showed its good performance in the categorization. Various other words weights methods are compared: TF-IDF, TF-ICF and TF-ISF. The target documents used for this experiment is Reuter-21578, and SVM and KNN algorithms are used. Finally, modified Okapi BM25 shows the most excellent performance.

A Study for Domain Categorization and Estimation of Complexity for Reliability Improvement of Domain Analysis (도메인 분석의 신뢰성 향상을 위한 도메인 분류와 복잡도 측정에 관한 연구)

  • Lee, Eun-Ser
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.5 no.1
    • /
    • pp.1-6
    • /
    • 2016
  • Domain analysis is an important component for reliability of development project. Domain analysis error have an effect in the whole system. As a result, the system reliability will be deteriorated. Therefore, we need a methodology to analyze domain characteristic for a reliable analysis in the domain analysis phase. In this paper, we propose a methodology for domain categorization and estimation of complexity for reliability improvement of domain analysis.

Improvement of A Concept-Based Text Categorization System(TAXON) Using Weight Determination Heuristic (가중치 부여 휴리스틱을 이용한 개념 기반 문서분류기 TAXON의 개선)

  • 강원석;강현규;김영섬
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1998.10c
    • /
    • pp.153-155
    • /
    • 1998
  • 본 논문에서는 개념을 기반으로 문서의 분류를 하는 확률벡터 모델의 분류기TAXON(Concept-based Text Categorization System)의 개선을 도모한다. TAXON은 한국어 문장을 분석하여 명사를 추출하고 명사의 개념을 시소러스 도구를 통해 획득한 후 이를 벡터화하여 주제와 입력 문서와의 관계성을 검사하는 문서 분류기이다. 본 논문은 문서 분류기 TAXON의 성능을 향상시키기 위하여 확률벡터 계산에 가중치 부여 휴리스틱을 도입한다. 그리고 시소러스 도구를 확장하여 문서 분류의 질을 높인다.

  • PDF

Making Anyatha (Upper Lander) and Auktha (Lower Lander): Crossing the Introduction of the Colonial Boundary System to British Burma (Myanmar)

  • Oo, Myo
    • SUVANNABHUMI
    • /
    • v.13 no.2
    • /
    • pp.135-164
    • /
    • 2021
  • In Myanmar studies, despite research on the categorization of ethnic nationalities are fairly much, research on the categorization of Myanmar people (ethnic Myanmar) is rarely exposed. People settled down in Central Myanmar had been categorized by regionalism into two groups as Anyatha (Upper Lander) and Auktha (Lower lander). It can be determined that the regionalism of Myanmar people existed and still exists. Previous scholarship in the colonial history of Myanmar has primarily referred to the documents recorded by the colonial officers and historical texts composed by the British authorities and scholars. The Catalogue of the Hluttaw Records is one of the rarest documents recorded in the Myanmar language on the affairs in the borderline drawn by the British after the Second Anglo-Myanmar War (1852-1853). Scrutinizing the Catalogue of the Hluttaw Records, it has been found that the text sheds light on the division of Central Myanmar into two regions in colonial Burma, later known as Lower Myanmar and Myanmar kingdom. These areas were known as Upper Myanmar between 1853 and 1885, and the categorization of the Myanmar king's subject, known as Anyatha (Upper Lander) and British colony citizen later known as Auktha (Lower Lander). This article traces back the relation of introducing the colonial boundary system and the division of Central Myanmar into two regions that allowed the emergence of regionalism among Myanmar people.

A study on The Product Categorization Model based efficient search in on-line chartering (온라인 용선거래에서 상품분류체계 기반의 검색 효율성에 관한 연구)

  • 최형림;박남규;박영재;박용성;강시협
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • 2003.05a
    • /
    • pp.265-272
    • /
    • 2003
  • Off-line ship chartering is done nearly through(by) the brokers. Because of the international scale of chartering marker, brokers spend too much times and costs on searching the most appropriate product which the consumers want. In this research, we propose the on-line Charter Product Categorization Model to search the products efficiently in the Cyber Chartering System. This Model will make concerned parties of the ship chartering to get efficient and unific product information. and to select the most appropriate product. In this research, we classified the ship chartering products into categories of cargo, ship type, and sea routes, and formed the definition of mutual relation of each products. Moreover we verified that this classification is necessary to search the products by the product searching experiment.

  • PDF