• Title/Summary/Keyword: 내용-기반 검색

Search Result 1,136, Processing Time 0.031 seconds

A method for metadata extraction from a collection of records using Named Entity Recognition in Natural Language Processing (자연어 처리의 개체명 인식을 통한 기록집합체의 메타데이터 추출 방안)

  • Chiho Song
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.24 no.2
    • /
    • pp.65-88
    • /
    • 2024
  • This pilot study explores a method of extracting metadata values and descriptions from records using named entity recognition (NER), a technique in natural language processing (NLP), a subfield of artificial intelligence. The study focuses on handwritten records from the Guro Industrial Complex, produced during the 1960s and 1970s, comprising approximately 1,200 pages and 80,000 words. After the preprocessing process of the records, which included digitization, the study employed a publicly available language API based on Google's Bidirectional Encoder Representations from Transformers (BERT) language model to recognize entity names within the text. As a result, 173 names of people and 314 of organizations and institutions were extracted from the Guro Industrial Complex's past records. These extracted entities are expected to serve as direct search terms for accessing the contents of the records. Furthermore, the study identified challenges that arose when applying the theoretical methodology of NLP to real-world records consisting of semistructured text. It also presents potential solutions and implications to consider when addressing these issues.

A Korean Document Sentiment Classification System based on Semantic Properties of Sentiment Words (감정 단어의 의미적 특성을 반영한 한국어 문서 감정분류 시스템)

  • Hwang, Jae-Won;Ko, Young-Joong
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.4
    • /
    • pp.317-322
    • /
    • 2010
  • This paper proposes how to improve performance of the Korean document sentiment-classification system using semantic properties of the sentiment words. A sentiment word means a word with sentiment, and sentiment features are defined by a set of the sentiment words which are important lexical resource for the sentiment classification. Sentiment feature represents different sentiment intensity in general field and in specific domain. In general field, we can estimate the sentiment intensity using a snippet from a search engine, while in specific domain, training data can be used for this estimation. When the sentiment intensity of the sentiment features are estimated, it is called semantic orientation and is used to estimate the sentiment intensity of the sentences in the text documents. After estimating sentiment intensity of the sentences, we apply that to the weights of sentiment features. In this paper, we evaluate our system in three different cases such as general, domain-specific, and general/domain-specific semantic orientation using support vector machine. Our experimental results show the improved performance in all cases, and, especially in general/domain-specific semantic orientation, our proposed method performs 3.1% better than a baseline system indexed by only content words.

A Design and Implementation of RSS Data Collecting Engine based on Web 2.0 (웹 2.0 기반 RSS 데이터 수집 엔진의 설계 및 구현)

  • Kang, Pil-Gu;Kim, Jae-Hwan;Lee, Sang-Jun;Chae, Jin-Seok
    • Journal of Korea Multimedia Society
    • /
    • v.10 no.11
    • /
    • pp.1496-1506
    • /
    • 2007
  • The environment of web service has changed a great deal due to the progress of internet technology and positive participation of users. The established web service is static and passive, but the recent web service is becoming dynamic and active. Web 2.0 reflects current web service change well. The primary feature of web 2.0 is positive participation of users. Since the size of generated information is becoming larger, it is highly required to share the information fast and correctly. The technology to satisfy this need is web syndication and tagging in web 2.0. The web syndication makes feeds for another site or users to receive the content of web site. In addition, the tagging is the kernel of a information. Many internet users share rapidly the information through tag search. In this paper, we propose the efficient technique to improve the web 2.0 technology such as web syndication and tagging by using the data collection engine. Data collection engine has stored in a database, a user's Web site to use the information. and it has a user's Web site with access to updated data to collect. The experimental results show that our approach can improve the search speed up to 3.14 times better than the existing method and reduce the size of data up to 66% for building associated tags.

  • PDF

An Item Pool System for Leveled Assessment (수준별 평가를 위한 문제은행 시스템)

  • Hong, Jong-Gee;Jun, Woo-Chun
    • Journal of The Korean Association of Information Education
    • /
    • v.6 no.3
    • /
    • pp.298-307
    • /
    • 2002
  • Recent advances in the Web technology have been changing our life in various aspects. These advances have brought us new paradigms of education. The Web provides teachers with many opportunities to implement wide ranges of new teaching and learning practices, which supplement the traditional classroom teaching-learning. Especially, the Web enables so-called WBI (Web-based instruction) system as a teaching aid. Now the WBI system can incorporate multimedia information with various communication and collaborative tools. In order for the WBI system to be successful, various supports are necessary. One of such supports comes from assessment. In this work, an item pool system for leveled assessment is designed and implemented. The proposed system has the following characteristics. First, the item pool is classified into three categories subject, semester, and chapter. This categorization makes lookup easier and faster. Second, any teacher can use the item pool system and enter their questions into the item pool. Third, the proposed system reflects various levels of students for each course. Thus, students can select their exams based on their progress and background. Finally, it can make difficulty of each item to be objective by repeated tests and refinements.

  • PDF

Nexus based Quality Inspection Support Model for Defect Prevention of Architectural Finishing Works (하자예방정보 넥서스 기반 건축마감공사 품질점검 지원 모델)

  • Lee, Hye-Rin;Cho, Dong-Hyun;Park, Sang-Hun;Koo, Kyo-Jin
    • Korean Journal of Construction Engineering and Management
    • /
    • v.18 no.5
    • /
    • pp.59-67
    • /
    • 2017
  • At the completion of the construction, various finishing processes are concentrated. This imposes a burden on the on-site manager and imposes on experience based quality control, thereby causing deviations in the quality of construction depending on supervisor or worker's individual competence. In addition, the information related to quality control is frequently scattered in various types of documents such as specifications and drawings, and checkpoints are frequently omitted. It is necessary to provide a tool that can effectively provide the practitioner before or during the inspection work by systematically storing the information related to the defect prevention and linking them in a mutually referential state. This paper proposes an quality inspection support model that can systematically store necessary information on activity or room basis for the quality check of the apartment house finishing work. Establish a defect prevention information base and a information nexus by linking specifications, design standards, checklists, regulations, defect cases, and drawings to the finishing process and the rooms. Based on this, information registration and search interface are presented. It can contribute to securing a certain level of construction quality or more by suggesting a frame that can be utilized by linking various defects prevention information with the focus on closing activity and room.

Building a Satellite Image Rinsed Blog System Using PPGIS (People Participatory GIS) (국민참여형 위성영상 블로그 시스템 구축)

  • Lee, Ki-Hwan;Lee, Dong-Cheon;Park, Seok-Ho;Kim, Il;Shin, Sang-Hee
    • Korean Journal of Remote Sensing
    • /
    • v.23 no.2
    • /
    • pp.125-130
    • /
    • 2007
  • This paper introduce a satellite image based blog system built by JeonNam local province. Main goals of this system are as follows : (1)Overcome the static aspect of traditional Web-GIS, (2)Providing a geoUCC generating platform by combining multimedia technology and GIS in a single web environment, (3)Building a two-way Web-GIS through user's participation, (4)Creating a new communicative way between government and citizen by using this system. As a result of the system building, this system enables users to create his/her own UCC(User Created Contents) on high-resolution satellite image and enables users to share his/her own UCC with other system using Web2.0 technology.

Design and Implement an Internet-Based Courseware (인터넷 기반의 코스웨어의 설계 및 구현)

  • Lee, Geon-Jin
    • Journal of The Korean Association of Information Education
    • /
    • v.1 no.1
    • /
    • pp.82-91
    • /
    • 1997
  • The purpose of thesis is to design and implement an efficient Internet-Based courseware which facilitates the problem solving learning. This courseware was developed in order to provide important foundations of learning in open-education environment using WWW. The targeted level is elementary students, To do this, the definition of problem solving, its processes, and advantages or pitfalls of computer-based problem solving learning were examined, with the advantage of using WWW as an educational tool. The theme of implemented courseware was selected from SATIS which is relevant for the problem solving learning. The courseware has three main parts; learning activity module, teaching activity module, and learning tool module. The learning activity module controls courseware flows and was implemented in accordance with the problem-based teaming processes. It: can be proceeded either sequential way or random access by setting linker. The advantage of random accessing method is that it may facilitate student learning because each student can regulate their learning processes which correspond to their own experiences. The teaching activity module provides for teachers useful informations for helping student's learning and it also can be used as an assessment tool for student's achievements, The learning: tool module consists of conversational note, e-mail address, help, and search tool. It is linked with learning activity module and teaching activity module so that teachers and students can actively participate in teaching-learning processes.

  • PDF

An Exploratory Investigation on Visual Cues for Emotional Indexing of Image (이미지 감정색인을 위한 시각적 요인 분석에 관한 탐색적 연구)

  • Chung, SunYoung;Chung, EunKyung
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.48 no.1
    • /
    • pp.53-73
    • /
    • 2014
  • Given that emotion-based computing environment has grown recently, it is necessary to focus on emotional access and use of multimedia resources including images. The purpose of this study aims to identify the visual cues for emotion in images. In order to achieve it, this study selected five basic emotions such as love, happiness, sadness, fear, and anger and interviewed twenty participants to demonstrate the visual cues for emotions. A total of 620 visual cues mentioned by participants were collected from the interview results and coded according to five categories and 18 sub-categories for visual cues. Findings of this study showed that facial expressions, actions / behaviors, and syntactic features were found to be significant in terms of perceiving a specific emotion of the image. An individual emotion from visual cues demonstrated distinctive characteristics. The emotion of love showed a higher relation with visual cues such as actions and behaviors, and the happy emotion is substantially related to facial expressions. In addition, the sad emotion was found to be perceived primarily through actions and behaviors and the fear emotion is perceived considerably through facial expressions. The anger emotion is highly related to syntactic features such as lines, shapes, and sizes. Findings of this study implicated that emotional indexing could be effective when content-based features were considered in combination with concept-based features.

Design and Implementation of Dynamic Form-based Editor for Writing Electronic Books (전자책 저작을 위한 동적 폼 기반 편집기의 설계 및 구현)

  • Koo, Eun-Young;Choy, Yoon-Chul
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.8 no.5
    • /
    • pp.540-550
    • /
    • 2002
  • Electronic Book(eBook) is a publication that stored and processed the contents of a book using digital mechanisms and has advantages such as easiness in saving and searching and the possibility of carrying. To activate Electronic Book which has the advantages mentioned above, studies on related techniques are required and a development of an editor exclusive for eBooks which is appropriate for eBook structure is still not adequate. In this paper, we design and implement Electronic Book editor providing form-based interface for eBook genre-based structure so that it would be easier for users to write. Especially because Electronic Book has genre-based structure due to the characteristic of literature, it is necessary to provide forms for each different genres. Therefore, compared to the problem of having to study XML grammar when writing Electronic Book using the existing XML editor, the proposed system can solve this problem by providing form-based interface. Additionally, with regard to the characteristic of eBook which have structures according to the intention of users, we provided the flexibility of adding dynamic forms to the form provided in default so that it will be more effective in writing Electronic Books. Therefore by providing form-based interface according to the genre and dynamic structure according to the intention of users, Electronic Book can be wrote more easily.

Semantic Fuzzy Implication Operator for Semantic Implication Relationship of Knowledge Descriptions in Question Answering System (질의 응답 시스템에서 지식 설명의 의미적 포함 관계를 고려한 의미적 퍼지 함의 연산자)

  • Ahn, Chan-Min;Lee, Ju-Hong;Choi, Bum-Ghi;Park, Sun
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.3
    • /
    • pp.73-83
    • /
    • 2011
  • The question answering system shows the answers that are input by other users for user's question. In spite of many researches to try to enhance the satisfaction level of answers for user question, there is a essential limitation. So, the question answering system provides users with the method of recommendation of another questions that can satisfy user's intention with high probability as an auxiliary function. The method using the fuzzy relational product operator was proposed for recommending the questions that can includes largely the contents of the user's question. The fuzzy relational product operator is composed of the Kleene-Dienes operator to measure the implication degree by contents between two questions. However, Kleene-Dienes operator is not fit to be the right operator for finding a question answers pair that semantically includes a user question, because it was not designed for the purpose of finding the degree of semantic inclusion between two documents. We present a novel fuzzy implication operator that is designed for the purpose of finding question answer pairs by considering implication relation. The new operator calculates a degree that the question semantically implies the other question. We show the experimental results that the probability that users are satisfied with the searched results is increased when the proposed operator is used for recommending of question answering system.