• 제목/요약/키워드: Semantic Classification

검색결과 329건 처리시간 0.025초

시맨틱 검색 : 서베이 (Semantic Search : A Survey)

  • 박진수;김남원;최민정;김철;최영석
    • 지능정보연구
    • /
    • 제17권4호
    • /
    • pp.19-36
    • /
    • 2011
  • 시맨틱 웹(Semantic Web)의 비전에 대한 공표가 이루어진 이래로 이와 관련한 많은 연구가 진행되어 왔다. 그러나 지금까지의 연구가 성공적이었다는 판단을 하기에는 아직 이르다. 본 논문은 시맨틱 관련 연구분야의 두 가지 문제점을 진단한다. 첫째는 '시맨틱 검색'이라는 개념의 합의된 정의가 없다는 것이고, 둘째는 장래의 유관 연구를 바라볼 수 있는 종합적이고 체계적인 시각이 부족하다는 것이다. 이러한 진단 아래, 본 논문은 시맨틱 검색의 개념을 '사용자의 입력에 따라 온톨로지와 같은 시맨틱 기술을 이용하여 원하는 정보를 얻는 행위'로 정의한다. 또한 시맨틱 검색에 대한 이해를 돕기 위해 시맨틱 검색 엔진 분류 프레임워크를 제안하였다. 본 연구에서 제안하는 프레임워크는 (쿼리) 입력문의 처리, 타겟 소스, 검색 방법론, 검색결과의 서열화, 출력 결과물의 데이터 종류, 이렇게 다섯 가지 부분으로 나뉜다. 마지막으로 본 논문은 제시한 프레임워크를 응용하여 기존의 연구결과물을 분석하고 앞으로의 연구 방향을 논하는 것으로 끝을 맺는다.

가중치 기반 PLSA를 이용한 문서 평가 분석 (Reputation Analysis of Document Using Probabilistic Latent Semantic Analysis Based on Weighting Distinctions)

  • 조시원;이동욱
    • 전기학회논문지
    • /
    • 제58권3호
    • /
    • pp.632-638
    • /
    • 2009
  • Probabilistic Latent Semantic Analysis has many applications in information retrieval and filtering, natural language processing, machine learning from text, and in related areas. In this paper, we propose an algorithm using weighted Probabilistic Latent Semantic Analysis Model to find the contextual phrases and opinions from documents. The traditional keyword search is unable to find the semantic relations of phrases, Overcoming these obstacles requires the development of techniques for automatically classifying semantic relations of phrases. Through experiments, we show that the proposed algorithm works well to discover semantic relations of phrases and presents the semantic relations of phrases to the vector-space model. The proposed algorithm is able to perform a variety of analyses, including such as document classification, online reputation, and collaborative recommendation.

비위론(脾胃論)에 기재된 용어 분류체계에 관한 연구 (A Study of Classification in the Terms of "Biwiron(脾胃論)")

  • 정두영;이병욱;엄동명;김은하
    • 대한한의학원전학회지
    • /
    • 제22권1호
    • /
    • pp.191-205
    • /
    • 2009
  • Objective : Up to the present, theories of medical books is too difficult to understand thoroughly. However, these study methods have some problems in dealing with lots of meaning because the comprehension of theories are dependent upon one's memory. Especially, comparison distinct medical books are more difficult matter. So, we have attempted to solve a problem. Method : We have researched medical terms in the "Piweilun" according to below the procedure. (1) Making a terms list: We have selected constituent of sentence. And we have made term list on the basis of concept of term. (2) Making a synonym list: We have collected identical conception and made a synonym list. So, using an synonym tables of DB, it is possible to search for the non-standard terms of medical theory. (3) Making a classification system: Using UMLS(Unified Medical Language System), MeSH(Medical Subject Headings), IST(International Standard Terminology) ect., we have made a classification system of oriental medicine terms in the "Piwelun". Analysis of relation between terms. Result : In the "Piweilun", there are more than 1,790s concepts. Parts of those are belonged to UMLS-Semantic Type, the other parts of those are not belonged to UMLS-Semantic Type. And those include predicate more than UMLS-Semantic Relations.

  • PDF

A Remote Sensing Scene Classification Model Based on EfficientNetV2L Deep Neural Networks

  • Aljabri, Atif A.;Alshanqiti, Abdullah;Alkhodre, Ahmad B.;Alzahem, Ayyub;Hagag, Ahmed
    • International Journal of Computer Science & Network Security
    • /
    • 제22권10호
    • /
    • pp.406-412
    • /
    • 2022
  • Scene classification of very high-resolution (VHR) imagery can attribute semantics to land cover in a variety of domains. Real-world application requirements have not been addressed by conventional techniques for remote sensing image classification. Recent research has demonstrated that deep convolutional neural networks (CNNs) are effective at extracting features due to their strong feature extraction capabilities. In order to improve classification performance, these approaches rely primarily on semantic information. Since the abstract and global semantic information makes it difficult for the network to correctly classify scene images with similar structures and high interclass similarity, it achieves a low classification accuracy. We propose a VHR remote sensing image classification model that uses extracts the global feature from the original VHR image using an EfficientNet-V2L CNN pre-trained to detect similar classes. The image is then classified using a multilayer perceptron (MLP). This method was evaluated using two benchmark remote sensing datasets: the 21-class UC Merced, and the 38-class PatternNet. As compared to other state-of-the-art models, the proposed model significantly improves performance.

Word2Vec를 이용한 토픽모델링의 확장 및 분석사례 (Expansion of Topic Modeling with Word2Vec and Case Analysis)

  • 윤상훈;김근형
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제30권1호
    • /
    • pp.45-64
    • /
    • 2021
  • Purpose The traditional topic modeling technique makes it difficult to distinguish the semantic of topics because the key words assigned to each topic would be also assigned to other topics. This problem could become severe when the number of online reviews are small. In this paper, the extended model of topic modeling technique that can be used for analyzing a small amount of online reviews is proposed. Design/methodology/approach The extended model of being proposed in this paper is a form that combines the traditional topic modeling technique and the Word2Vec technique. The extended model only allocates main words to the extracted topics, but also generates discriminatory words between topics. In particular, Word2vec technique is applied in the process of extracting related words semantically for each discriminatory word. In the extended model, main words and discriminatory words with similar words semantically are used in the process of semantic classification and naming of extracted topics, so that the semantic classification and naming of topics can be more clearly performed. For case study, online reviews related with Udo in Tripadvisor web site were analyzed by applying the traditional topic modeling and the proposed extension model. In the process of semantic classification and naming of the extracted topics, the traditional topic modeling technique and the extended model were compared. Findings Since the extended model is a concept that utilizes additional information in the existing topic modeling information, it can be confirmed that it is more effective than the existing topic modeling in semantic division between topics and the process of assigning topic names.

딥러닝 기반의 영상분할을 이용한 토지피복분류 (Land Cover Classification Using Sematic Image Segmentation with Deep Learning)

  • 이성혁;김진수
    • 대한원격탐사학회지
    • /
    • 제35권2호
    • /
    • pp.279-288
    • /
    • 2019
  • 본 연구에서는 항공정사영상을 이용하여 SegNet 기반의 의미분할을 수행하고, 토지피복분류에서의 그 성능을 평가하였다. 의미분할을 위한 분류 항목을 4가지(시가화건조지역, 농지, 산림, 수역)로 선정하였고, 항공정사영상과 세분류 토지피복도를 이용하여 총 2,000개의 데이터셋을 8:2 비율로 훈련(1,600개) 및 검증(400개)로 구분하여 구축하였다. 구축된 데이터셋은 훈련과 검증으로 나누어 학습하였고, 모델 학습 시 정확도에 영향을 미치는 하이퍼파라미터의 변화에 따른 검증 정확도를 평가하였다. SegNet 모델 검증 결과 반복횟수 100,000회, batch size 5에서 가장 높은 성능을 보였다. 이상과 같이 훈련된 SegNet 모델을 이용하여 테스트 데이터셋 200개에 대한 의미분할을 수행한 결과, 항목별 정확도는 농지(87.89%), 산림(87.18%), 수역(83.66%), 시가화건조지역(82.67%), 전체 분류정확도는 85.48%로 나타났다. 이 결과는 기존의 항공영상을 활용한 토지피복분류연구보다 향상된 정확도를 나타냈으며, 딥러닝 기반 의미분할 기법의 적용 가능성이 충분하다고 판단된다. 향후 다양한 채널의 자료와 지수의 활용과 함께 분류 정확도 향상에 크게 기여할 수 있을 것으로 기대된다.

An Ontology Driven Mapping Algorithm between Heterogeneous Product Classification Taxonomies

  • 김우주;최남혁;최태우
    • 한국지능정보시스템학회:학술대회논문집
    • /
    • 한국지능정보시스템학회 2005년도 공동추계학술대회
    • /
    • pp.295-303
    • /
    • 2005
  • Semantic Web and its related technologies have been opening the era of information sharing via Web. In the meantime, there are several huddles to overcome toward the new era and one of the major huddles is information integration issue unless we build and use a single unified but huge ontology which address everything in the world. Particularly in e-business area, information integration problem must be a great concern in search and comparison of products from various internet shopping sites and e-marketplaces. To overcome such an information integration problem, we propose an ontology driven mapping algorithm between heterogeneous product classification and description frameworks. We also perform comparative evaluation of the proposed mapping algorithm against a well-known ontology mapping tool, PROMPT.

  • PDF

Bag of Visual Words Method based on PLSA and Chi-Square Model for Object Category

  • Zhao, Yongwei;Peng, Tianqiang;Li, Bicheng;Ke, Shengcai
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제9권7호
    • /
    • pp.2633-2648
    • /
    • 2015
  • The problem of visual words' synonymy and ambiguity always exist in the conventional bag of visual words (BoVW) model based object category methods. Besides, the noisy visual words, so-called "visual stop-words" will degrade the semantic resolution of visual dictionary. In view of this, a novel bag of visual words method based on PLSA and chi-square model for object category is proposed. Firstly, Probabilistic Latent Semantic Analysis (PLSA) is used to analyze the semantic co-occurrence probability of visual words, infer the latent semantic topics in images, and get the latent topic distributions induced by the words. Secondly, the KL divergence is adopt to measure the semantic distance between visual words, which can get semantically related homoionym. Then, adaptive soft-assignment strategy is combined to realize the soft mapping between SIFT features and some homoionym. Finally, the chi-square model is introduced to eliminate the "visual stop-words" and reconstruct the visual vocabulary histograms. Moreover, SVM (Support Vector Machine) is applied to accomplish object classification. Experimental results indicated that the synonymy and ambiguity problems of visual words can be overcome effectively. The distinguish ability of visual semantic resolution as well as the object classification performance are substantially boosted compared with the traditional methods.

Deep Hashing for Semi-supervised Content Based Image Retrieval

  • Bashir, Muhammad Khawar;Saleem, Yasir
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제12권8호
    • /
    • pp.3790-3803
    • /
    • 2018
  • Content-based image retrieval is an approach used to query images based on their semantics. Semantic based retrieval has its application in all fields including medicine, space, computing etc. Semantically generated binary hash codes can improve content-based image retrieval. These semantic labels / binary hash codes can be generated from unlabeled data using convolutional autoencoders. Proposed approach uses semi-supervised deep hashing with semantic learning and binary code generation by minimizing the objective function. Convolutional autoencoders are basis to extract semantic features due to its property of image generation from low level semantic representations. These representations of images are more effective than simple feature extraction and can preserve better semantic information. Proposed activation and loss functions helped to minimize classification error and produce better hash codes. Most widely used datasets have been used for verification of this approach that outperforms the existing methods.

규칙 기반 추론을 이용한 이기종 시스템간의 제품 정보 상호운용에 관한 연구 (A Study on the Product Information Interoperability between Heterogeneous Systems using Rule-based Reasoning)

  • 이상석;양태호;이덕희;오석찬;노상도
    • 산업공학
    • /
    • 제24권3호
    • /
    • pp.248-257
    • /
    • 2011
  • The amount of Meta-data to be managed increases with development of information technology. However, when trying to integrate and share product information of heterogeneous systems within or between companies, sharing of information is impossible if product information classification systems are different. Due to the situation mentioned above, engineers judge the product information classification system and maps corresponding Meta-data for document-based sharing. Judging exponentially increasing amount of data by engineers and sharing product information using documents create great amount of time delay and errors in data handling. Therefore, construction of a system for integrated management and interoperability between product information based on semantic information similar to engineer's judgment is required. This paper proposes a methodology and necessity of a system for interoperability of product information based on semantic web, and also designs a system to integrate heterogeneous systems with different product information using rule based reasoning. This paper also suggests a system base for interoperability and integration of product information between heterogeneous systems by integrating the product information classification system semantically.