• 제목/요약/키워드: research question

검색결과 2,060건 처리시간 0.026초

Korean TableQA: Structured data question answering based on span prediction style with S3-NET

  • Park, Cheoneum;Kim, Myungji;Park, Soyoon;Lim, Seungyoung;Lee, Jooyoul;Lee, Changki
    • ETRI Journal
    • /
    • 제42권6호
    • /
    • pp.899-911
    • /
    • 2020
  • The data in tables are accurate and rich in information, which facilitates the performance of information extraction and question answering (QA) tasks. TableQA, which is based on tables, solves problems by understanding the table structure and searching for answers to questions. In this paper, we introduce both novice and intermediate Korean TableQA tasks that involve deducing the answer to a question from structured tabular data and using it to build a question answering pair. To solve Korean TableQA tasks, we use S3-NET, which has shown a good performance in machine reading comprehension (MRC), and propose a method of converting structured tabular data into a record format suitable for MRC. Our experimental results show that the proposed method outperforms a baseline in both the novice task (exact match (EM) 96.48% and F1 97.06%) and intermediate task (EM 99.30% and F1 99.55%).

A Natural Language Question Answering System-an Application for e-learning

  • Gupta, Akash;Rajaraman, Prof. V.
    • 한국지능정보시스템학회:학술대회논문집
    • /
    • 한국지능정보시스템학회 2001년도 The Pacific Aisan Confrence On Intelligent Systems 2001
    • /
    • pp.285-291
    • /
    • 2001
  • This paper describes a natural language question answering system that can be used by students in getting as solution to their queries. Unlike AI question answering system that focus on the generation of new answers, the present system retrieves existing ones from question-answer files. Unlike information retrieval approaches that rely on a purely lexical metric of similarity between query and document, it uses a semantic knowledge base (WordNet) to improve its ability to match question. Paper describes the design and the current implementation of the system as an intelligent tutoring system. Main drawback of the existing tutoring systems is that the computer poses a question to the students and guides them in reaching the solution to the problem. In the present approach, a student asks any question related to the topic and gets a suitable reply. Based on his query, he can either get a direct answer to his question or a set of questions (to a maximum of 3 or 4) which bear the greatest resemblance to the user input. We further analyze-application fields for such kind of a system and discuss the scope for future research in this area.

  • PDF

Towards a small language model powered chain-of-reasoning for open-domain question answering

  • Jihyeon Roh;Minho Kim;Kyoungman Bae
    • ETRI Journal
    • /
    • 제46권1호
    • /
    • pp.11-21
    • /
    • 2024
  • We focus on open-domain question-answering tasks that involve a chain-of-reasoning, which are primarily implemented using large language models. With an emphasis on cost-effectiveness, we designed EffiChainQA, an architecture centered on the use of small language models. We employed a retrieval-based language model to address the limitations of large language models, such as the hallucination issue and the lack of updated knowledge. To enhance reasoning capabilities, we introduced a question decomposer that leverages a generative language model and serves as a key component in the chain-of-reasoning process. To generate training data for our question decomposer, we leveraged ChatGPT, which is known for its data augmentation ability. Comprehensive experiments were conducted using the HotpotQA dataset. Our method outperformed several established approaches, including the Chain-of-Thoughts approach, which is based on large language models. Moreover, our results are on par with those of state-of-the-art Retrieve-then-Read methods that utilize large language models.

초등과학 학습내용과 관련된 학생의 사전질문 분석 (The Analysis of Students' Pre-inquire related to Elementary Science Curriculum Contents)

  • 강헌태;노석구
    • 한국초등과학교육학회지:초등과학교육
    • /
    • 제36권4호
    • /
    • pp.331-345
    • /
    • 2017
  • The purpose of this study is to collect and analyze the student's pre-inquire and to obtain information on how to use the teaching-learning process. The specific research problem is to confirm the level of the student's pre-inquire, to identify the characteristics of each type, and to check what pre-inquire can be used in the teaching-learning process. The research was conducted on 149 children in the $3^{rd}$ and $4^{th}$ grade of elementary school, and collected a total of 2,034 inquires. As a result of analyzing three times, the students' pre-inquires accounted for 90% of Level 2 and Level 3, which are the inquires that give meaningful answers in the teaching-learning process. These results show that the pre-inquires presented before the students take up the new lesson are not low-level inquires and they can present meaningful inquires that can be used for teaching-learning. Next, as a result of analyzing the student's inquire by type, the factual question was the largest with 50%, followed by comprehension question, procedural question, application question, and prediction question. The factual and procedural questions showed that they could be used as learning activities during the teaching-learning process. Comprehension questions included in the wonderment question can be used as a learning question. And the application question is a question that can be applied to deepening activities, and the prediction question can be used in the inquiry and experiment process of learning activities.

탐구과제에 대한 사전지식이 초등과학 영재의 관찰방법과 의문에 미치는 영향 (Effect that Prior Knowledge about Research Subject Gets Primary Grade Science Brilliant Intellect's Observation Method and Question)

  • 임재근
    • 과학교육연구지
    • /
    • 제34권1호
    • /
    • pp.105-112
    • /
    • 2010
  • 본 연구의 목적은 탐구 과제에 대한 사전지식이 초등 과학 영재의 관찰방법과 의문에 미치는 영향을 알아보기 위한 것이다. 이를 위해 '거미와 거미줄' 이라는 과제를 해결하는 과정에서의 학생들의 관찰방법과 의문이 사전지식에 의해 어떻게 영향을 받는지 연구하였다. 사전지식이 상대적으로 높은 집단의 학생들은 사전지식이 낮은 집단에 비해서 관찰의 방법과 양이 높게 나타났다. 그러나 사전지식이 양이 높은 집단이나 낮은 집단 모두 관찰 방법에서는 오감을 이용한 단순 관찰을 주로 하였다. 이는 사전지식의 양이 관찰의 양에는 영향을 미치나 관찰 방법에는 영향을 주지 못하는 것으로 볼 수 있다. 사전지식이 높은 집단에서는 보다 높은 수준의 관찰이 나타났다. 이는 관찰의 수준이 높아지기 위해서는 사전에 충분한 교육이 필요함을 의미한다. 따라서 교사들이 관찰을 이용한 수업을 수행할 경우에 사전에 관련된 지식에 대하여 충분한 수업이 이루어져야 보다 효율적인 관찰이 일어난다고 볼 수 있다.

  • PDF

Enhancing Performance with a Learnable Strategy for Multiple Question Answering Modules

  • Oh, Hyo-Jung;Myaeng, Sung-Hyon;Jang, Myung-Gil
    • ETRI Journal
    • /
    • 제31권4호
    • /
    • pp.419-428
    • /
    • 2009
  • A question answering (QA) system can be built using multiple QA modules that can individually serve as a QA system in and of themselves. This paper proposes a learnable, strategy-driven QA model that aims at enhancing both efficiency and effectiveness. A strategy is learned using a learning-based classification algorithm that determines the sequence of QA modules to be invoked and decides when to stop invoking additional modules. The learned strategy invokes the most suitable QA module for a given question and attempts to verify the answer by consulting other modules until the level of confidence reaches a threshold. In our experiments, our strategy learning approach obtained improvement over a simple routing approach by 10.5% in effectiveness and 27.2% in efficiency.

Research on the Movie Reviews Regarded as Unsuccessful in Box Office Outcomes in Korea: Based on Big Data Posted on Naver Movie Portal

  • Jeon, Ho-Seong
    • 아태비즈니스연구
    • /
    • 제12권3호
    • /
    • pp.51-69
    • /
    • 2021
  • Purpose - Based on literature studies of movie reviews and movie ratings, this study raised two research questions on the contents of online word of mouth and the number of movie screens as mediator variables. Research question 1 wanted to figure out which topics of word groups had a positive or negative impact on movie ratings. Research question 2 tried to identify the role of the number of movie screens between movie ratings and box office outcomes. Design/methodology/approach - Through R program, this study collected about 82,000 movie reviews and movie ratings posted on Naver's movie website to examine the role of online word of mouths and movie screen counts in 10 movies that were considered commercially unsuccessful with fewer than 2 million viewers despite securing about 1,000 movie screens. To confirm research question 1, topic modeling, a text mining technique, was conducted on movie reviews. In addition, this study linked the movie ratings posted on Naver with information of KOBIS by date, to identify the research question 2. Findings - Through topic modeling, 5 topics were identified. Topics found in this study were largely organized into two groups, the content of the movie (topic 1, 2, 3) and the evaluation of the movie (topics 4, 5). When analyzing the relationship between movie reviews and movie ratings with 5 mediators identified in topic modeling to probe research question 1, the topic word groups related to topic 2, 3 and 5 appeared having a negative effect on the netizen's movie ratings. In addition, by connecting two secondary data by date, analysis for research question 2 was implemented. The outcomes showed that the causal relationship between movie ratings and audience numbers was mediated by the number of movie screens. Research implications or Originality - The results suggested that the information presented in text format was harder to quantify than the information provided in scores, but if content information could be digitalized through text mining techniques, it could become variable and be analyzed to identify causality with other variables. The outcomes in research question 2 showed that movie ratings had a direct impact on the number of viewers, but also had indirect effects through changes in the number of movie screens. An interesting point is that the direct effect of movie ratings on the number of viewers is found in most American films released in Korea.

Restricting Answer Candidates Based on Taxonomic Relatedness of Integrated Lexical Knowledge Base in Question Answering

  • Heo, Jeong;Lee, Hyung-Jik;Wang, Ji-Hyun;Bae, Yong-Jin;Kim, Hyun-Ki;Ock, Cheol-Young
    • ETRI Journal
    • /
    • 제39권2호
    • /
    • pp.191-201
    • /
    • 2017
  • This paper proposes an approach using taxonomic relatedness for answer-type recognition and type coercion in a question-answering system. We introduce a question analysis method for a lexical answer type (LAT) and semantic answer type (SAT) and describe the construction of a taxonomy linking them. We also analyze the effectiveness of type coercion based on the taxonomic relatedness of both ATs. Compared with the rule-based approach of IBM's Watson, our LAT detector, which combines rule-based and machine-learning approaches, achieves an 11.04% recall improvement without a sharp decline in precision. Our SAT classifier with a relatedness-based validation method achieves a precision of 73.55%. For type coercion using the taxonomic relatedness between both ATs and answer candidates, we construct an answer-type taxonomy that has a semantic relationship between the two ATs. In this paper, we introduce how to link heterogeneous lexical knowledge bases. We propose three strategies for type coercion based on the relatedness between the two ATs and answer candidates in this taxonomy. Finally, we demonstrate that this combination of individual type coercion creates a synergistic effect.

Investigating Factors Affecting Automated Question Triage for Social Reference: A Study of Adopting Decision Factors from Digital Reference

  • 박종도
    • 한국문헌정보학회지
    • /
    • 제49권1호
    • /
    • pp.483-511
    • /
    • 2015
  • The efficiency and quality of the social reference sites are being challenged because a large quantity of the questions have not been answered or satisfied for quite a long time. Main goal of this study is to investigate important factors that affect the performance of question triage to relevant answerers in the context of social reference. To achieve the goal, expert finding techniques were used to construct an automated question triage approach to resolve this problem. Furthermore, important factors affecting triage decisions in digital reference were first examined, and extended them to the social reference setting by investigating important factors affecting the performance of automated question triage in the social reference setting. The study was conducted using question-answer pairs collected from Ask Metafilter. For the evaluation, logistic regression analyses were conducted to examine which factors would significantly affect the performance of predicting relevant answerers to questions. The results of the current study have important implications for research and practice in automated question triage for social reference. Furthermore, the results will offer insights into designing user-participatory digital reference systems.

질의응답시스템에서 정답 특징에 관한 실험적 분석 (Experimental Analysis of Correct Answer Characteristics in Question Answering Systems)

  • 한경수
    • 디지털콘텐츠학회 논문지
    • /
    • 제19권5호
    • /
    • pp.927-933
    • /
    • 2018
  • 자연어 질문에 대해 답변을 찾아 제공하는 질의응답시스템의 오류에 가장 큰 영향을 미치는 요소 중 하나가 질문으로 정답을 포함하고 있을 만한 문서나 단락을 검색하는 단계이다. 검색의 성능 향상을 위해서는 정답 포함 문서 및 단락의 특징을 잘 이해해야 한다. 본 논문은 질문, 정답 포함 문서, 정답 미포함 문서로 구성된 말뭉치를 사용하여 정답 문서에는 질문 단어가 얼마나 많이 출현하는지, 출현 위치는 어떻게 분포하는지, 질문과 정답 문서의 주제는 얼마나 유사한지 등을 실험적으로 분석한다. 이를 통해 질의응답시스템을 위한 기존의 검색 연구 결과들에 대한 원인을 설명하고 효과적인 검색 단계의 필요 요소에 관해 논의한다.