• Title/Summary/Keyword: 질문 주제 분류

Search Result 23, Processing Time 0.025 seconds

Similar Question Search System for Q&A board of The National Institute of the Korean Language using Topic Classification (주제 분류를 활용한 국립국어원 질의응답 게시판 유사 질문 검색 시스템)

  • Mun, Jung-Min;Song, Yeong-Ho;Jin, Ji-Hwan;Lee, Hyun-Seob;Lee, Hyun-Ah
    • Annual Conference on Human and Language Technology
    • /
    • 2014.10a
    • /
    • pp.201-205
    • /
    • 2014
  • 국립국어원의 온라인 가나다 서비스는 한국어에 대한 다양한 질문과 정확한 답변을 제공한다. 만일 새롭게 등록되는 질문에 대해 유사한 질문을 자동으로 찾을 수 있다면, 질문자는 빠른 시간에 답변을 얻을 수 있고 서비스 관리자는 수동 답변 작성의 부담을 덜 수 있다. 본 논문에서는 국립국어원 질의응답게시판의 특성을 분석하여 질문의 주제를 6가지로 분류하고, 주제 분류 정보와 벡터 유사도, 수열 유사도를 결합하여 유사한 질문을 검색하는 시스템을 제안한다. 평가에서는 본 논문에서 제시한 주제 분류 정보를 활용한 결과 1위 정답 검색 정확률이 향상되는 결과를 얻었다. 최종 실험에서는 MRR이 0.62, 정답이 1위, 5위내에 검색될 확률은 각각 54.2%, 78.2%를 보였다.

  • PDF

Similar Question Search System for online Q&A for the Korean Language Based on Topic Classification (온라인가나다를 위한 주제 분류 기반 유사 질문 검색 시스템)

  • Mun, Jung-Min;Song, Yeong-Ho;Jin, Ji-Hwan;Lee, Hyun-Seob;Lee, Hyun Ah
    • Korean Journal of Cognitive Science
    • /
    • v.26 no.3
    • /
    • pp.263-278
    • /
    • 2015
  • Online Q&A for the National Institute of the Korean Language provides expert's answers for questions about the Korean language, in which many similar questions are repeatedly posted like other Q&A boards. So, if a system automatically finds questions that are similar to a user's question, it can immediately provide users with recommendable answers to their question and prevent experts from wasting time to answer to similar questions repeatedly. In this paper, we set 5 classes of questions based on its topic which are frequently asked, and propose to classify questions to those classes. Our system searches similar questions by combining topic similarity, vector similarity and sequence similarity. Experiment shows that our method improves search correctness with topic classification. In experiment, Mean Reciprocal Rank(MRR) of our system is 0.756, and precision for the first result is 68.31% and precision for top five results is 87.32%.

A Topic Classification System Based on Clue Expressions for Person-Related Questions and Passages (단서표현 기반의 인물관련 질의-응답문 문장 주제 분류 시스템)

  • Lee, Gyoung Ho;Lee, Kong Joo
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.4 no.12
    • /
    • pp.577-584
    • /
    • 2015
  • In general, Q&A system retrieves passages by matching terms of a question in order to find an answer to the question. However it is difficult for Q&A system to find a correct answer because too many passages are retrieved and matching using terms is not enough to rank them according to their relevancy to a question. To alleviate this problem, we introduce a topic for a sentence, and adopt it for ranking in Q&A system. We define a set of person-related topic class and a clue expression which can indicate a topic of a sentence. A topic classification system proposed in this paper can determine a target topic for an input sentence by using clue expressions, which are manually collected from a corpus. We explain an architecture of the topic classification system and evaluate the performance of the components of this system.

Deep learning-based Answer Type Classifier Considering Topicality in Korean Question Answering (한국어 질의 응답에서의 화제성을 고려한 딥러닝 기반 정답 유형 분류기)

  • Cho, Seung Woo;Choi, DongHyun;Kim, EungGyun
    • Annual Conference on Human and Language Technology
    • /
    • 2019.10a
    • /
    • pp.103-108
    • /
    • 2019
  • 한국어 질의 응답의 입력 질문에 대한 예상 정답 유형을 단답형 또는 서술형으로 이진 분류하는 방법에 대해 서술한다. 일반적인 개체명 인식으로 확인할 수 없는 질의 주제어의 화제성을 반영하기 위하여, 검색 엔진 쿼리를 빈도수로 분석한다. 분석된 질의 주제어 정보와 함께, 정답의 범위를 제약할 수 있는 속성 표현과 육하원칙 정보를 입력 자질로 사용한다. 기존 신경망 분류 모델과 비교한 실험에서, 추가 자질을 적용한 모델이 4% 정도 향상된 분류 성능을 보이는 것을 확인할 수 있었다.

  • PDF

A Study on Functions and Present Situation of Subject Specialists for Information Services in Korean College and University Libraries (한국의 대학도서관 정보서비스에 있어서 주제전문사서의 현황과 기능에 관한 조사연구)

  • Han, Sang-Wan
    • Journal of the Korean Society for information Management
    • /
    • v.3 no.2
    • /
    • pp.42-74
    • /
    • 1986
  • The objective of the study is to search for a theoretical and practical solution for the question "what is the most effective and Qualitative method of information service for the college and university libraries in Korea." Assuming the maximum service, or total service theory in information services, it needs the subject specialist who has highly qualified in his subject. This research adapted the survey method by questionnaire to the reference/information librarian who worked in college and university libraries, 159 librarians returned the questionnaires. By the analysis of this questionnaires, the following major results were found: 1. There were only 7.6% who could be called as subject specialist in Korean college and university libraries. 2. The subject specialist system is necessary to enhance the Information services in college and university libraries. 3. The major functions of subject specialists are information services In given subject fields; to prepare the bibliographies, guides, reading lists, indexes and abstracts; distribution of information and current awareness services; well balanced collection developments; liaison function between academic departments, students and faculty members; formal and informal lecture on the use of the library and the resources; and the cataloging and classification. 4. The best library and information education system is the graduate level study which is offering the M.L.S. or M.S. of library and information science with the emphasis on the study of subject background. 5. They will establish the faculty status for academic librarian by the development of subject specialist system in college and university libraries in Korea.

  • PDF

Recent Trends in Research Methods in Library and Information Science : Content Analysis of the Journal Articles (내용분석법에 의한 문헌정보학 학술지 연구논문 분석)

  • Lee, Myeong-Hee
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.36 no.3
    • /
    • pp.287-310
    • /
    • 2002
  • This study used content analysis to examine the research methods in Library and Information Science journals from 1997 to 2001. Analyses measured research subjects, research method, data collection methods, data analysis methods, hypotheses, theories and research funds. Brief consideration is given to possible future methodological trends : web-based research methods, qualitative research, cooperation between academic society and librarians and research fundraising.

Characteristics of Middle School Students' Open-Inquiry Report and Their Perceptions of Conducting Inquiry (중학생의 자유 탐구 보고서에 나타난 특징과 탐구 수행에 대한 학생들의 인식)

  • Park, Mi-Hyun;Cha, Jeong-Ho;Kim, In-Whan
    • Journal of the Korean Chemical Society
    • /
    • v.56 no.3
    • /
    • pp.371-377
    • /
    • 2012
  • In this study, open inquiry reports of 165 eighth graders in Daegu were analyzed in terms of content area, the types of inquiry hypothesis, and the types of inquiry variables. Before summer vacation, students learned about inquiry process and explored their own inquiry topic for two class hours. During summer vacation, students performed open inquiry including problem selection, designing and performing experiment, data collection, data analysis, and writing report. After the vacation, students submitted their reports, and answered to additional survey regarding the source of inquiry idea, the definition of hypothesis, and the most difficult step of inquiry process. As a result, chemistry was the most dominant content area of the reports and biology and life science were the next. 130 out of 165 reports included inquiry hypotheses, and most of them were predictive hypotheses. In many reports, dependent and independent variables could not be identified because of their ambiguity. However, inquiry variables described in experimental design, which were mostly categorical variables, were clearer than those described in inquiry subject and inquiry hypothesis. The most difficult step of inquiry process for students was to generate an idea for open inquiry.

Planning of Oral History of Korean Astronomy (한국천문학 구술사연구 기획론)

  • Choi, Youngsil;Kim, Sang Hyuk;Mihn, Byeong-Hee;Seo, Yoon Kyung;Ahn, Young Sook;Yang, Hong-Jin;Choi, Go-Eun
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.44 no.2
    • /
    • pp.66.2-66.2
    • /
    • 2019
  • 구술채록은 특정 주제의 연구사 기록화 작업에 있어 후대에 생생한 역사체험을 전승할 수 있는 최적의 연구사업이다. 특히 국내 천문우주과학 분야의 원로들이 대부분 연로하다는 점에서 한국천문학 발전사에 대한 구술채록은 시급성이 더욱 요구되고 있다. 이에 한국천문연구원 고천문연구센터는 그간 기관에서 자체적으로 수행해 온 사료분류체계 수립작업과 단발적인 구술채록 경험을 기반으로 본격적인 구술채록 연구사업을 수행할 계획이다. 이 연구는 한국천문학 발전사 구술채록 사업의 절차적 방법에 대한 기획론이다. 크게 (1)구술채록 로드맵 수립, (2) 구술기록 생산 프로세스, (3) 산출물 관리 및 활용으로 제시하고자 한다. 먼저 구술채록 로드맵 수립에 있어서는 현대 한국천문학 발전의 태동기 1950년대 중반을 기점으로 역사연구 및 주제분류를 중심으로 천문학 구술기록 특성화를 기한다. 이를 기반으로 구술대상자를 선정하고 큰 맥락의 역사와 개인 생애사를 교차하는 분석 틀을 중심으로 인터뷰 질문지를 추출한다. 이 과정에서 구술대상자의 소장 사료를 도출하여 미리 잠재적 사료 수집을 도모하도록 한다. 둘째, 본격적 구술기록 생산 프로세스에서는 전 단계에서 이행한 수집정보를 바탕으로 구술 산출물을 제작한다. 면담일지, 상세녹취록, 요약본, 이용동의서 등 기타 필요한 구술 제반 서식을 바탕으로 구술 동영상을 산출하고 라벨링한다. 이 산출물에 대한 사실관계 검증 후 최종 산출물 완성 및 기타 행정 처리로 제작은 종료된다. 마지막으로 산출물 관리 및 활용에 있어서는 사료 수집 전략의 기반 자료와 다양한 지식정보콘텐츠의 활용체계를 수립한다. 더 나아가 향후 이 연구사업은 구술DB화와 서비스 체계화를 위하여 구술아카이브 시스템을 설계하는 데 성과물을 활용한다. 이 연구기획론은 한국천문학이라는 특정 주제에 대한 것이므로 큰 틀에서의 방법은 기록학적 전개방식을 차용하지만, 역사연구와 기록의 특성화에 있어서는 한국천문학 연구사에 대한 깊은 이해가 동반되어야 한다. 따라서 광범위한 한국천문학 네트워크에 해당하는 다양한 학회, 교육기관, 연구기관 및 각종 사단법인 등의 역사와도 긴밀히 연결되어야 성과물은 비로소 가치 있고 풍부할 것이다. 이 연구를 시발점으로 향후 한국천문학 발전사 구술채록 사업에 대한 다양한 관학연구의 인식 공감대가 마련되기를 기대한다.

  • PDF

Publication Trends in Smoking-Related Research for Children and Adolescents: An Analysis of Korean Academic Journals (아동과 청소년의 흡연 관련 연구 동향 분석: 학술지 게재 논문을 중심으로)

  • Son, Hyun-Dong
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.2
    • /
    • pp.269-276
    • /
    • 2019
  • The purpose of this study was to investigate the publication trends of children and adolescents' smoking-related researches published in Korean academic journals. Three hundred fifty papers published until 2018 were analyzed by focusing on the publication year, research participants, research themes and research methods. As a result, smoking-related research on children and adolescents increased sharply from 1995 to 2000, and the trend continued. The main research participants were general children and adolescents and the most frequently studied themes were 'Associated Factors,' 'Intervention,' 'Prevalence,' 'Prevention,' 'Characteristics,' 'Law and Policies,' 'Scales,' 'Review and Theories' respectively. The most frequently used research method was the quantitative method. Moreover, the most common data gathering method was using questionnaires, and the number of papers which used panel data was gradually increasing. Future studies were suggested to explore a broader range of themes, and a balanced research approach was also recommended using both qualitative and mixed methods.

Classification of Public Perceptions toward Smog Risks on Twitter Using Topic Modeling (Topic Modeling을 이용한 Twitter상에서 스모그 리스크에 관한 대중 인식 분류 연구)

  • Kim, Yun-Ki
    • Journal of Cadastre & Land InformatiX
    • /
    • v.47 no.1
    • /
    • pp.53-79
    • /
    • 2017
  • The main purpose of this study was to detect and classify public perceptions toward smog disasters on Twitter using topic modeling. To help achieve these objectives and to identify gaps in the literature, this research carried out a literature review on public opinions toward smog disasters and topic modeling. The literature review indicated that there are huge gaps in the related literature. In this research, this author formed five research questions to fill the gaps in the literature. And then this study performed research steps such as data extraction, word cloud analysis on the cleaned data, building the network of terms, correlation analysis, hierarchical cluster analysis, topic modeling with the LDA, and stream graphs to answer those research questions. The results of this research revealed that there exist huge differences in the most frequent terms, the shapes of terms network, types of correlation, and smog-related topics changing patterns between New York and London. Therefore, this author could find positive answers to the four of the five research questions and a partially positive answer to Research question 4. Finally, on the basis of the results, this author suggested policy implications and recommendations for future study.