Search | Korea Science

High-Quality Multimodal Dataset Construction Methodology for ChatGPT-Based Korean Vision-Language Pre-training (ChatGPT 기반 한국어 Vision-Language Pre-training을 위한 고품질 멀티모달 데이터셋 구축 방법론)

Jin Seong;Seung-heon Han;Jong-hun Shin;Soo-jong Lim;Oh-woog Kwon
- Annual Conference on Human and Language Technology
- /
- 2023.10a
- /
- pp.603-608
- /
- 2023
본 연구는 한국어 Vision-Language Pre-training 모델 학습을 위한 대규모 시각-언어 멀티모달 데이터셋 구축에 대한 필요성을 연구한다. 현재, 한국어 시각-언어 멀티모달 데이터셋은 부족하며, 양질의 데이터 획득이 어려운 상황이다. 따라서, 본 연구에서는 기계 번역을 활용하여 외국어(영문) 시각-언어 데이터를 한국어로 번역하고 이를 기반으로 생성형 AI를 활용한 데이터셋 구축 방법론을 제안한다. 우리는 다양한 캡션 생성 방법 중, ChatGPT를 활용하여 자연스럽고 고품질의 한국어 캡션을 자동으로 생성하기 위한 새로운 방법을 제안한다. 이를 통해 기존의 기계 번역 방법보다 더 나은 캡션 품질을 보장할 수 있으며, 여러가지 번역 결과를 앙상블하여 멀티모달 데이터셋을 효과적으로 구축하는데 활용한다. 뿐만 아니라, 본 연구에서는 의미론적 유사도 기반 평가 방식인 캡션 투영 일치도(Caption Projection Consistency) 소개하고, 다양한 번역 시스템 간의 영-한 캡션 투영 성능을 비교하며 이를 평가하는 기준을 제시한다. 최종적으로, 본 연구는 ChatGPT를 이용한 한국어 멀티모달 이미지-텍스트 멀티모달 데이터셋 구축을 위한 새로운 방법론을 제시하며, 대표적인 기계 번역기들보다 우수한 영한 캡션 투영 성능을 증명한다. 이를 통해, 우리의 연구는 부족한 High-Quality 한국어 데이터 셋을 자동으로 대량 구축할 수 있는 방향을 보여주며, 이 방법을 통해 딥러닝 기반 한국어 Vision-Language Pre-training 모델의 성능 향상에 기여할 것으로 기대한다.
PDF

Generating Sponsored Blog Texts through Fine-Tuning of Korean LLMs (한국어 언어모델 파인튜닝을 통한 협찬 블로그 텍스트 생성)

Bo Kyeong Kim;Jae Yeon Byun;Kyung-Ae Cha
- Journal of Korea Society of Industrial Information Systems
- /
- v.29 no.3
- /
- pp.1-12
- /
- 2024
In this paper, we fine-tuned KoAlpaca, a large-scale Korean language model, and implemented a blog text generation system utilizing it. Blogs on social media platforms are widely used as a marketing tool for businesses. We constructed training data of positive reviews through emotion analysis and refinement of collected sponsored blog texts and applied QLoRA for the lightweight training of KoAlpaca. QLoRA is a fine-tuning approach that significantly reduces the memory usage required for training, with experiments in an environment with a parameter size of 12.8B showing up to a 58.8% decrease in memory usage compared to LoRA. To evaluate the generative performance of the fine-tuned model, texts generated from 100 inputs not included in the training data produced on average more than twice the number of words compared to the pre-trained model, with texts of positive sentiment also appearing more than twice as often. In a survey conducted for qualitative evaluation of generative performance, responses indicated that the fine-tuned model's generated outputs were more relevant to the given topics on average 77.5% of the time. This demonstrates that the positive review generation language model for sponsored content in this paper can enhance the efficiency of time management for content creation and ensure consistent marketing effects. However, to reduce the generation of content that deviates from the category of positive reviews due to elements of the pre-trained model, we plan to proceed with fine-tuning using the augmentation of training data.
https://doi.org/10.9723/jksiis.2024.29.3.001 인용 PDF

Prompt Engineering Technique for efficient use of ChatGPT (ChatGPT 를 효율적으로 사용하기 위한 Prompt Engineering 기법 )

Gyeong-Won Jang;Seong-Soo Han
- Proceedings of the Korea Information Processing Society Conference
- /
- 2023.05a
- /
- pp.595-597
- /
- 2023
대규모 언어 모델에 기반한 AI 챗봇인 ChatGPT 의 사용과 출력 품질을 개선하는 데 있어 Prompt Engineering 의 중요성이 확대되고 있다. Prompt Engineering 은 AI 모델에서 보다 정확하고 관련성 높은 응답을 생성하기 위해 프롬프트의 요소를 선택하고 구성하는 작업을 포함한다. 본 논문에서는 ChatGPT 에서 정보나 답변을 효과적으로 추출하는 데 사용할 수 있는 다양한 Prompt Engineering 기법을 소개하고 이러한 기법이 실제 시나리오에 어떻게 적용될 수 있는지에 대한 예를 제공한다.
https://doi.org/10.3745/PKIPS.y2023m05a.595 인용 PDF

Infrastructure Proposal for the Safe Implementation of Private LLMs in SME: sLLM and Cloud Based Approach (중소기업에서의 안전한 프라이빗 LLM 도입을 위한 인프라 제안: sLLM 과 클라우드 기반으로)

Jiwon Hong;Eunseon Ryu;Jiyoon Baek;Seoyeong Kim;Jungjoo Oh
- Proceedings of the Korea Information Processing Society Conference
- /
- 2024.05a
- /
- pp.350-351
- /
- 2024
최근 몇 년 간 대규모 언어 모델의 발전과 보급이 비즈니스 운영 통합을 가속화하고 있다. 그러나 내부 데이터 유출과 같은 문제로 많은 기업들이 보다 안전한 프라이빗 LLM 을 도입하려는 움직임을 보이고 있다. 대기업과 공공기관은 높은 비용을 부담하여 온프레미스 솔루션을 선택할 수 있으나, 중소 기업 혹은 개인에게는 예산과 기술적인 한계가 존재하기 때문에 별도의 인프라가 요구된다. 이에, 본 연구는 클라우드 서비스와 네트워크 망분리를 사용하여 중소기업이 내부 데이터를 안전하게 관리하며 LLM 을 도입할 수 있는 방안을 제시하며, RAG 모델을 통한 기술적 향상 가능성 또한 제시한다.
PDF

A Study on the Data Literacy Education in the Library of the Chat GPT, Generative AI Era (ChatGPT, 생성형 AI 시대 도서관의 데이터 리터러시 교육에 대한 연구)

Jeong-Mee Lee
- Journal of the Korean Society for Library and Information Science
- /
- v.57 no.3
- /
- pp.303-323
- /
- 2023
The purpose of this study is to introduce this language model in the era of generative AI such as ChatGPT, and to provide direction for data literacy education components in libraries using it. To this end, the following three research questions are proposed. First, the technical features of ChatGPT-like language models are examined, and then, it is argued that data literacy education is necessary for the proper and accurate use of information by users using a service platform based on generative AI technology. Finally, for library data literacy education in the ChatGPT era, it is proposed a data literacy education scheme including seven components such as data understanding, data generation, data collection, data verification, data management, data use and sharing, and data ethics. In conclusion, since generative AI technologies such as ChatGPT are expected to have a significant impact on users' information utilization, libraries should think about the advantages, disadvantages, and problems of these technologies first, and use them as a basis for further improving library information services.
https://doi.org/10.4275/KSLIS.2023.57.3.303 인용 PDF

Can ChatGPT Pass the National Korean Occupational Therapy Licensure Examination? (ChatGPT는 한국작업치료사면허시험에 합격할 수 있을까?)

Hong, Junhwa;Kim, Nayeon;Min, Hyemin;Yang, Hamin;Lee, Sihyun;Choi, Seojin;Park, Jin-Hyuck
- Therapeutic Science for Rehabilitation
- /
- v.13 no.1
- /
- pp.65-74
- /
- 2024
Objective : This study assessed ChatGPT, an artificial intelligence system based on a large language model, for its ability to pass the National Korean Occupational Therapy Licensure Examination (NKOTLE). Methods : Using NKOTLE questions from 2018 to 2022, provided by the Korea Health and Medical Personnel Examination Institute, this study employed English prompts to determine the accuracy of ChatGPT in providing correct answers. Two researchers independently conducted the entire process, and the average accuracy of both researchers was used to determine whether ChatGPT passed over the 5-year period. The degree of agreement between ChatGPT answers of the two researchers was assessed. Results : ChatGPT passed the 2020 examination but failed to pass the other 4 years' examination. Specifically, its accuracy in questions related to medical regulations ranged from 25% to 57%, whereas its accuracy in other questions exceeded 60%. ChatGPT exhibited a strong agreement between researchers, except for medical regulation questions, and this agreement was significantly correlated with accuracy. Conclusion : There are still limitations to the application of ChatGPT to answer questions influenced by language or culture. Future studies should explore its potential as an educational tool for students majoring in occupational therapy through optimized prompts and continuous learning from the data.
https://doi.org/10.22683/tsnr.2024.13.1.065 인용 PDF

A Study on Prompt Engineering Techniques based on chatGPT (ChatGPT를 기반으로 한 프롬프트 엔지니어링 기법 연구)

Myung-Suk Lee
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2023.07a
- /
- pp.715-718
- /
- 2023
본 연구는 ChatGPT 모델의 특성과 장점을 활용하여 프롬프트 엔지니어링 기법을 연구하고자 하였다. 프롬프트는 엔지니어가 원하는 결과를 잘 얻을 수 있도록 하는 것이 목표이기 때문에 ChatGPT와 프롬프트 엔지니어링의 상호작용과 효과적인 프롬프트 엔지니어링 기법을 개발할 필요가 있다. 연구 방법으로는 ChatGPT에 대한 학습자 사전 설문조사에서 학습자를 분석하였고, 이를 반영하여 프로그래밍 문제를 제시하고 해결하는 과정을 거치면서 다양한 ChatGPT 사용에 대한 분석과 학습자 분석이 이루어졌다. 그 결과 비전공자가 듣고 있는 프로그래밍 수업에서 ChatGPT를 활용하여 얻은 통찰력으로 프롬프트에 필요한 가이드 라인을 마련하였다. 본 연구를 기반으로 향후 비전공자를 위한 파이썬 프로그래밍 수업에서 ChatGPT를 활용한 수업모델을 제시하고 학습자의 피드백 또는 적응형 학습에 활용할 수 있는 방법을 모색할 것이다.
PDF

Building Sentiment-Annotated Datasets for Training a FbSA model based on the SSP methodology (반자동 언어데이터 증강 방식에 기반한 FbSA 모델 학습을 위한 감성주석 데이터셋 FeSAD 구축)

Yoon, Jeong-Woo;Hwang, Chang-Hoe;Choi, Su-Won;Nam, Jee-Sun
- Annual Conference on Human and Language Technology
- /
- 2021.10a
- /
- pp.66-71
- /
- 2021
본 연구는 한국어 자질 기반 감성분석(Feature-based Sentiment Analysis: FbSA)을 위한 대규모의 학습데이터 구축에 있어 반자동 언어데이터 증강 기법(SSP: Semi-automatic Symbolic Propagation)에 입각한 자질-감성 주석 데이터셋 FeSAD(Feature-Sentiment-Annotated Dataset)의 개발 과정과 성능 평가를 소개하는 것을 목표로 한다. FeSAD는 언어자원을 활용한 SSP 1단계 주석 이후, 작업자의 주석이 2단계에서 이루어지는 2-STEP 주석 과정을 통해 구축된다. SSP 주석을 위한 언어자원에는 부분 문법 그래프(Local Grammar Graph: LGG) 스키마와 한국어 기계가독형 전자사전 DECO(Dictionnaire Electronique du COréen)가 활용되며, 본 연구에서는 7개의 도메인(코스메틱, IT제품, 패션/의류, 푸드/배달음식, 가구/인테리어, 핀테크앱, KPOP)에 대해, 오피니언 트리플이 주석된 FeSAD 데이터셋을 구축하는 프로세싱을 소개하였다. 코스메틱(COS)과 푸드/배달음식(FOO) 두 도메인에 대해, 언어자원을 활용한 1단계 SSP 주석 성능을 평가한 결과, 각각 F1-score 0.93과 0.90의 성능을 보였으며, 이를 통해 FbSA용 학습데이터 주석을 위한 작업자의 작업이 기존 작업의 10% 이하의 비중으로 감소함으로써, 학습데이터 구축을 위한 프로세싱의 소요시간과 품질이 획기적으로 개선될 수 있음을 확인하였다.
PDF

Dynamic Web Design with AI-Driven Templates: Proposing Personalized Solutions for Enhanced User Engagement (AI 기반 템플릿을 사용한 동적 웹 설계: 사용자 참여도 향상을 위한 맞춤형 솔루션 제안)

Jong-In Choi;Do-Yoon Kim;Tae-Woong Shim;Kwang-Young Park
- Proceedings of the Korea Information Processing Society Conference
- /
- 2023.05a
- /
- pp.565-566
- /
- 2023
본 논문은 AI 기술을 활용하여 웹 디자인 및 개발 과정에서 생산성과 비용 측면에서 발생하는 문제점을 해결하기 위한 솔루션을 제안하고 있다. ChatGPT와 같은 대규모 언어 모델을 기반으로 사용자가 원하는 웹사이트 유형과 데이터를 입력하면, AI가 적절한 웹 템플릿을 제시하고, 해당 템플릿을 기반으로 동적인 웹사이트를 자동으로 제작하는 방식으로 작동한다. 이를 통해 개발자는 웹사이트 디자인 및 레이아웃 개발에 소요되는 시간과 비용을 절감할 수 있다. 이러한 방식은 웹사이트를 더 사용자 친화적이고 참여도 높은 것으로 만들어, 고객 유치와 이타적 상호작용을 유도하며, 회사나 조직의 성공에 기여할 수 있다.
https://doi.org/10.3745/PKIPS.y2023m05a.565 인용 PDF

Large-scale Language-image Model-based Bag-of-Objects Extraction for Visual Place Recognition (영상 기반 위치 인식을 위한 대규모 언어-이미지 모델 기반의 Bag-of-Objects 표현)

Seung Won Jung;Byungjae Park
- Journal of Sensor Science and Technology
- /
- v.33 no.2
- /
- pp.78-85
- /
- 2024
We proposed a method for visual place recognition that represents images using objects as visual words. Visual words represent the various objects present in urban environments. To detect various objects within the images, we implemented and used a zero-shot detector based on a large-scale image language model. This zero-shot detector enables the detection of various objects in urban environments without additional training. In the process of creating histograms using the proposed method, frequency-based weighting was applied to consider the importance of each object. Through experiments with open datasets, the potential of the proposed method was demonstrated by comparing it with another method, even in situations involving environmental or viewpoint changes.
https://doi.org/10.46670/JSST.2024.33.2.78 인용 PDF

Search Result 90, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)