Search | Korea Science

KFREB: Korean Fictional Retrieval-based Evaluation Benchmark for Generative Large Language Models (KFREB: 생성형 한국어 대규모 언어 모델의 검색 기반 생성 평가 데이터셋)

Jungseob Lee;Junyoung Son;Taemin Lee;Chanjun Park;Myunghoon Kang;Jeongbae Park;Heuiseok Lim
- Annual Conference on Human and Language Technology
- /
- 2023.10a
- /
- pp.9-13
- /
- 2023
본 논문에서는 대규모 언어모델의 검색 기반 답변 생성능력을 평가하는 새로운 한국어 벤치마크, KFREB(Korean Fictional Retrieval Evaluation Benchmark)를 제안한다. KFREB는 모델이 사전학습 되지 않은 허구의 정보를 바탕으로 검색 기반 답변 생성 능력을 평가함으로써, 기존의 대규모 언어모델이 사전학습에서 보았던 사실을 반영하여 생성하는 답변이 실제 검색 기반 답변 시스템에서의 능력을 제대로 평가할 수 없다는 문제를 해결하고자 한다. 제안된 KFREB는 검색기반 대규모 언어모델의 실제 서비스 케이스를 고려하여 장문 문서, 두 개의 정답을 포함한 골드 문서, 한 개의 골드 문서와 유사 방해 문서 키워드 유무, 그리고 문서 간 상호 참조를 요구하는 상호참조 멀티홉 리즈닝 경우 등에 대한 평가 케이스를 제공하며, 이를 통해 대규모 언어모델의 적절한 선택과 실제 서비스 활용에 대한 인사이트를 제공할 수 있을 것이다.
PDF

A Study on Smart Knowledge Sharing System with Friends (지인 기반의 스마트 지식공유 시스템에 관한 연구)

Yoon, Won-Beom;Park, Kinam;Lim, Heui-Seok
- Journal of Digital Convergence
- /
- v.11 no.2
- /
- pp.279-285
- /
- 2013
The development of information networks and computer technology has become a foundation to open up a sea of information and knowledge. The recent popularization of smart devices has been used as a tool to easily obtain the desired information and knowledge. In this paper, a knowledge-sharing system using information and social networks based on smart devices is proposed. The proposed system consists of functions of an Internet information search for user queries, accumulated knowledge, and social network response from acquaintances. An evaluation for user satisfaction was conducted to analyze the efficacy of the proposed system. According to the experiment, the knowledge-sharing system using smart device information results in significant satisfaction compared to the general information search engines.
https://doi.org/10.14400/JDPM.2013.11.2.279 인용 PDF

A Study on Personal Experience Knowledge Evaluation Model for Knowledge Service (지식서비스를 위한 개인경험지식 분석 평가 모델 연구)

Kim, Yu-Doo;Joo, In-Hak;Park, Yun-Kyung;Moon, Il-Young;Kwon, Oh-Young
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.17 no.8
- /
- pp.1865-1872
- /
- 2013
The social network services are grown rapidly through dissemination of smart devices. Therefore, increasing the data exponentially because many people use web services. Using these big data, it will be needed study of providing customized knowledge. So in this paper, we had collected data of 40 people for implementation of knowledge service using big data during one month. Based on these data, we had inferred information of location and moving type, and evaluated accuracy. Through that we had studied personal experience knowledge evaluation model for knowledge service.
https://doi.org/10.6109/jkiice.2013.17.8.1865 인용 PDF KSCI

KMSCR: A system for managing knowledge assets of an IT consulting firm (IT 컨설팅 회사의 지적 자산 관리를 위한 지식관리시스템)

김수연;황현석;서의호
- Proceedings of the Korea Inteligent Information System Society Conference
- /
- 2001.06a
- /
- pp.233-239
- /
- 2001
최근 대부분의 회사들은 업무를 수행하는데 필요한 지식과 노하우를 공유하고 재사용하기 위하여 지적 자산 관리의 중요성을 인식하고 있다. 특히 고도로 지식 집약적인 업종이라 할 수 있는 IT컨설팅 회사에서는 지적 자산의 관리가 다른 어떤 회사에서보다 큰 중요성을 가지게 된다. 컨설팅 회사에 있어서 검증이 완료된 지적 자산의 공유 및 지능적이면서도 신속한 검색은 컨설팅 서비스의 품질과 고객 만족에 직결되는 중요한 요소이다. 따라서 대부분의 컨설팅 회사들은 자사의 지식 자산을 관리하기 위하여 많은 노력을 기울이고 있다. 본 논문의 목적은 IT 컨설팅 회사예서 관리되는 다양한 형태의 지적 자산들을 중앙 관리하여 설친 고객 사이트에 흩어져 프로젝트를 수행하는 컨설턴트들이 공유할 수 있도록 함으로써 컨설팅 서비스의 생산성과 품질들 높이고자 하는데 있다 이를 위하여 건설팅 회사에서 관리되는 모든 지적 자산의 재고를 조사하여 모델링하고 이를 쉽게 저장하고 검색할 수 있는 시스템 아키텍처를 제안한다. 제안된 아키텍처를 NT 기반에서 Index server를 이용하여 시스템으로 구현하였다 (KMSCR: A Knowledge Management System for managing Consulting Resources). KMSCR에서는 컨설턴트가 찾고자 하는 검색어를 입력하면 다양한 포맷의 (.doc, .ppt, xls, .rtf, .txt, .html 등과 같은) 결과물을 관련성이 높은 순서대로 출력해 줌으로써 컨설팅 리소스를 효과적으로 재사용할 수 있도록 도와 준다. 또한 검색 시에는 미리 등록된 키워드 뿐 아니라 본문 내의 텍스트 검색까지 가능하게 함으로써 컨설팅 리소스에 대한 보다 효과적이고 효율적인 검색을 가능하게 한다.간을 성능 평가 인자로 하여 수행하였다. 논문에서 제한된 방법을 적용한 개선된 RICH-DP을 모의 실험을 통하여 분석한 결과 기존의 제한된 RICH-DP는 실시간 서비스에 대한 처리율이 낮아지며 서비스 시간이 보장되지 못했다. 따라서 실시간 서비스에 대한 새로운 제안된 기법을 제안하고 성능 평가한 결과 기존의 RICH-DP보다 성능이 향상됨을 확인 할 수 있었다.(actual world)에서 가상 관성 세계(possible inertia would)로 변화시켜서, 완수동사의 종결점(ending point)을 현실세계에서 가상의 미래 세계로 움직이는 역할을 한다. 결과적으로, IMP는 완수동사의 닫힌 완료 관점을 현실세계에서는 열린 미완료 관점으로 변환시키되, 가상 관성 세계에서는 그대로 닫힌 관점으로 유지 시키는 효과를 가진다. 한국어와 영어의 관점 변환 구문의 차이는 각 언어의 지속부사구의 어휘 목록의 전제(presupposition)의 차이로 설명된다. 본 논문은 영어의 지속부사구는 논항의 하위간격This paper will describe the application based on this approach developed by the authors in the FLEX EXPRIT IV n$^{\circ}$EP29158 in the Work-package "Knowledge Extraction & Data mining"where the information captured from digital newspapers is extracted and reused in tourist information context.terpolation performance of CNN was relatively
PDF

Search Space Reduction Model for Keyword Query Transformation on Semantic Search (시맨틱 검색에서 키워드 질의 변환을 위한 탐색 공간 축소 모델)

Yeom, Jeong-Nam;Cho, Joon-Myun;Yoo, Jeong-Ju
- Proceedings of the Korea Information Processing Society Conference
- /
- 2013.11a
- /
- pp.1390-1393
- /
- 2013
인터페이스가 제한된 단말에서 정보 검색 서비스를 제공하는 경우, 검색 재현율보다는 정확도가 중요하다. 데이터를 쉽게 구조화할 수 있고 검색 정확도가 중요한 한정된 도메인에서는 시맨틱 검색 기술을 통해 강력한 정보 검색 서비스를 제공할 수 있지만, 사용자 키워드 질의를 시스템 질의로 변환하는 과정에서 다양한 해석들이 존재할 수 있기에 개선의 여지도 많다. 본 논문에서는 해석 정확도와 확장성을 동시에 향상시키기 위한 새로운 모델을 제안한다. 제안 모델은 공간의 구조와 요소들의 해석을 제한함으로써 중간 탐색 공간의 크기를 점진적으로 줄이면서 사용자의 검색 의도는 가능한 보존할 수 있다. 실제 데이터로 이루어진 대용량 지식을 이용해 다른 최신 기술과 비교하여 실험적 평가를 제시하였다.
https://doi.org/10.3745/PKIPS.y2013m11a.1390 인용 PDF

Answer Recommendation for Knowledge Search using Term Frequency (어휘 빈도를 활용한 지식 검색에서의 답변 추천 시스템)

Lee, Ho-Chang;Tak, Hyun-Ki;Lee, Hyun-Ah
- Proceedings of the Korean Information Science Society Conference
- /
- 2012.06b
- /
- pp.315-317
- /
- 2012
지식iN 등의 지식검색 서비스는 잘못된 답변으로 인한 낮은 신뢰성과 다수의 중복 답변 등의 문제점을 가진다. 질의문 '세상에서 가장 큰 나라'에 대해서 관련된 모든 질문과 답변을 제시하지 않고 질의문과 관련된 다수의 답변을 분석하여 답변 '러시아'를 추천하여 제시할 수 있다면 지식검색의 효용성과 신뢰성이 크게 향상될 수 있다. 본 논문에서는 질문-답변의 유형을 단어, 글, 도표, 목록의 네가지로 분류하고, 그 중 단어 유형에 대한 답변 추천 방법을 제시한다. 질의문에 대해 검색된 질문을 군집화하고, 질문에 대한 답변들에 대해서 TF, IDF, 어휘간 거리 정보를 다양하게 결합하여 어휘의 점수를 계산한다. 각 군집에서 가장 높은 점수를 가지는 어휘를 해당 군집에서 가장 중요한 어휘로 보고 추천 정답으로 제시한다. 단어 유형인 질문 100개에 대한 네이버 지식iN에 대한 시스템 평가에서 추천된 상위 1위에 대해서는 68%의 정답률을, 상위 5위까지에 대해서는 89%의 정답률을 보였다.

QualityRank : Measuring Authority of Answer in Q&A Community using Social Network Analysis (QualityRank : 소셜 네트워크 분석을 통한 Q&A 커뮤니티에서 답변의 신뢰 수준 측정)

Kim, Deok-Ju;Park, Gun-Woo;Lee, Sang-Hoon
- Journal of KIISE:Databases
- /
- v.37 no.6
- /
- pp.343-350
- /
- 2010
We can get answers we want to know via questioning in Knowledge Search Service (KSS) based on Q&A Community. However, it is getting more difficult to find credible documents in enormous documents, since many anonymous users regardless of credibility are participate in answering on the question. In previous works in KSS, researchers evaluated the quality of documents based on textual information, e.g. recommendation count, click count and non-textual information, e.g. answer length, attached data, conjunction count. Then, the evaluation results are used for enhancing search performance. However, the non-textual information has a problem that it is difficult to get enough information by users in the early stage of Q&A. The textual information also has a limitation for evaluating quality because of judgement by partial factors such as answer length, conjunction counts. In this paper, we propose the QualityRank algorithm to improve the problem by textual and non-textual information. This algorithm ranks the relevant and credible answers by considering textual/non-textual information and user centrality based on Social Network Analysis(SNA). Based on experimental validation we can confirm that the results by our algorithm is improved than those of textual/non-textual in terms of ranking performance.
PDF KSCI

User Reputation Evaluation Using Co-occurrence Feature and Collective Intelligence (동시출현 자질과 집단 지성을 이용한 지식검색 문서 사용자 명성 평가)

Lee, Hyun-Woo;Han, Yo-Sub;Kim, LaeHyun;Cha, Jeung-Won
- Annual Conference on Human and Language Technology
- /
- 2008.10a
- /
- pp.79-84
- /
- 2008
많은 사용자들의 참여로 구축된 집단 지성을 이용한 지식 검색 서비스에서 사용자가 원하는 답변을 빨리 찾고자 하는 요구가 증가하고 있다. 기존의 연구에서 조회 수, 추천 수, 답변 수와 같은 비텍스트 정보가 답변을 평가하는데 좋은 자질임이 증명되었고, 신뢰도를 추정할 수 있는 여러 종류의 단어 사전을 이용하여 답변의 좋고 나쁨을 평가할 수 있는 연구도 진행되었다. 하지만, 조회 수, 추천 수, 답변 수와 같은 비텍스트 정보는 사용자 조작이 간단하여 지속적으로 관리를 해야 하며, 신뢰도를 추정할 수 있는 단어는 지속적으로 보강되어야 한다. 본 논문에서는 이러한 문제점을 해결하고자 동시출현 자질을 이용한 질문과 답변의 유사성을 활용하여 집단 지성에서 사용자의 활동을 분석하여 사용자의 명성을 평가하는 방법을 제안한다. 사용자의 명성을 계산할 수 있다면 조회 수와 추천 수가 많지 않은 답변의 신뢰도도 비교적 정확하게 추정할 수 있다. 이를 위해 우리는 PageRank 알고리즘을 수정하여 사용자 명성을 계산한다. 네이버 지식iN의 문서로 실험한 결과, 기존 정답 선택률을 보완할 수 있는 결과를 보였다.
PDF

Neural Network based Multi-Agent Web Information Retrieval System (신경망 기반 멀티 에이전트 웹 정보 검색 시스템)

Choe, Yong-Seok;Yu, Seok-In
- Journal of KIISE:Software and Applications
- /
- v.26 no.5
- /
- pp.665-673
- /
- 1999
본 논문에서는 웹 정보검색을 위한 신경망 기반 멀티 에이전트 시스템을 제안한다. 제안된 시스템에서 각 에이전트는 신경망 메카니즘을 이용하여 사용자의 관련도 피드백으로부터 환경을 학습하고 사용자가 원하는 정보를 제공하는 자원을 찾아내어 효율적으로 웹 정보를 검색한다. 먼저 신경망 기반 웹 정보 검색 에이전트를 제시하고 단일 에이전트 기법을 사용할 경우의 문제점을 분석한다. 이를 기반으로 하여 멀티 에이전트 웹 정보 검색 시스템을 정의하고 사용자로부터 정보 검색 지식을 습득하기위한 훈련절차를 기술하며 협동적 정보 검색에 대해 설명한다. 마지막으로 제안된 시스템의 성능을 정형적으로 분석하고 실험을 통하여 기존의 검색 서비스와 비교 평가한다.

Research and Development of Document Recognition System for Utilizing Image Data (이미지데이터 활용을 위한 문서인식시스템 연구 및 개발)

Kwag, Hee-Kue
- The KIPS Transactions:PartB
- /
- v.17B no.2
- /
- pp.125-138
- /
- 2010
The purpose of this research is to enhance document recognition system which is essential for developing full-text retrieval system of the document image data stored in the digital library of a public institution. To achieve this purpose, the main tasks of this research are: 1) analyzing the document image data and then developing its image preprocessing technology and document structure analysis one, 2) building its specialized knowledge base consisting of document layout and property, character model and word dictionary, respectively. In addition, developing the management tool of this knowledge base, the document recognition system is able to handle the various types of the document image data. Currently, we developed the prototype system of document recognition which is combined with the specialized knowledge base and the library of document structure analysis, respectively, adapted for the document image data housed in National Archives of Korea. With the results of this research, we plan to build up the test-bed and estimate the performance of document recognition system to maximize the utilization of full-text retrieval system.
https://doi.org/10.3745/KIPSTB.2010.17B.2.125 인용 PDF KSCI

Search Result 52, Processing Time 0.034 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)