Search | Korea Science

Deep Learning OCR based document processing platform and its application in financial domain (금융 특화 딥러닝 광학문자인식 기반 문서 처리 플랫폼 구축 및 금융권 내 활용)

Dongyoung Kim;Doohyung Kim;Myungsung Kwak;Hyunsoo Son;Dongwon Sohn;Mingi Lim;Yeji Shin;Hyeonjung Lee;Chandong Park;Mihyang Kim;Dongwon Choi
- Journal of Intelligence and Information Systems
- /
- v.29 no.1
- /
- pp.143-174
- /
- 2023
With the development of deep learning technologies, Artificial Intelligence powered Optical Character Recognition (AI-OCR) has evolved to read multiple languages from various forms of images accurately. For the financial industry, where a large number of diverse documents are processed through manpower, the potential for using AI-OCR is great. In this study, we present a configuration and a design of an AI-OCR modality for use in the financial industry and discuss the platform construction with application cases. Since the use of financial domain data is prohibited under the Personal Information Protection Act, we developed a deep learning-based data generation approach and used it to train the AI-OCR models. The AI-OCR models are trained for image preprocessing, text recognition, and language processing and are configured as a microservice architected platform to process a broad variety of documents. We have demonstrated the AI-OCR platform by applying it to financial domain tasks of document sorting, document verification, and typing assistance The demonstrations confirm the increasing work efficiency and conveniences.
https://doi.org/10.13088/jiis.2023.29.1.143 인용 PDF

KorQuAD 2.0: Korean QA Dataset for Web Document Machine Comprehension (KorQuAD 2.0: 웹문서 기계독해를 위한 한국어 질의응답 데이터셋)

Kim, Youngmin;Lim, Seungyoung;Lee, Hyunjeong;Park, Soyoon;Kim, Myungji
- Annual Conference on Human and Language Technology
- /
- 2019.10a
- /
- pp.97-102
- /
- 2019
KorQuAD 2.0은 총 100,000+ 쌍으로 구성된 한국어 질의응답 데이터셋이다. 기존 질의응답 표준 데이터인 KorQuAD 1.0과의 차이점은 크게 세가지가 있는데 첫 번째는 주어지는 지문이 한두 문단이 아닌 위키백과 한 페이지 전체라는 점이다. 두 번째로 지문에 표와 리스트도 포함되어 있기 때문에 HTML tag로 구조화된 문서에 대한 이해가 필요하다. 마지막으로 답변이 단어 혹은 구의 단위뿐 아니라 문단, 표, 리스트 전체를 포괄하는 긴 영역이 될 수 있다. Baseline 모델로 구글이 오픈소스로 공개한 BERT Multilingual을 활용하여 실험한 결과 F1 스코어 46.0%의 성능을 확인하였다. 이는 사람의 F1 점수 85.7%에 비해 매우 낮은 점수로, 본 데이터가 도전적인 과제임을 알 수 있다. 본 데이터의 공개를 통해 평문에 국한되어 있던 질의응답의 대상을 다양한 길이와 형식을 가진 real world task로 확장하고자 한다.
PDF

A Study on the Effective Preprocessing Methods for Accelerating Point Cloud Registration

Chungsu, Jang;Yongmin, Kim;Taehyun, Kim;Sunyong, Choi;Jinwoo, Koh;Seungkeun, Lee
- Korean Journal of Remote Sensing
- /
- v.39 no.1
- /
- pp.111-127
- /
- 2023
In visual slam and 3D data modeling, the Iterative Closest Point method is a primary fundamental algorithm, and many technical fields have used this method. However, it relies on search methods that take a high search time. This paper solves this problem by applying an effective point cloud refinement method. And this paper also accelerates the point cloud registration process with an indexing scheme using the spatial decomposition method. Through some experiments, the results of this paper show that the proposed point cloud refinement method helped to produce better performance.
https://doi.org/10.7780/kjrs.2023.39.1.8 인용 PDF HTML

Korean Question Generation Using Co-Attention Layer of Answer and Passage (정답과 구절의 공동 주의 집중 계층을 이용한 한국어 질문 생성)

Kim, Jintae;Noh, Hyungjong;Lee, Yeonsoo;Kim, Harksoo
- Annual Conference on Human and Language Technology
- /
- 2019.10a
- /
- pp.315-320
- /
- 2019
질문 생성이란 구절이 입력되면 구절에서 답을 찾을 수 있는 질문을 자동으로 생성하는 작업으로 교육용 시스템, 대화 시스템, QA 시스템 등 다양한 분야에서 중요한 역할을 한다. 질문 생성에서 정답의 단어가 질문에 포함되는 문제점을 해결하기 위해 구절과 정답을 분리한다. 하지만 구절과 정답을 분리하게 되면 구절에서 정답의 정보가 손실되고, 정답에서는 구절의 문맥 정보가 손실되어 정답 유형에 맞는 질문을 생성할 수 없는 문제가 발생된다. 본 논문은 이러한 문제를 해결하기 위해 분리된 정답과 구절의 정보를 연결시켜주는 정답과 구절의 공동 주의 집중 계층을 제안한다. 23,658개의 질문-응답 쌍의 말뭉치를 이용한 실험에서 정답과 구절의 공동 주의 집중 계층이 성능 향상에 기여해 우수한 성능(BLEU-26.7, ROUGE-57.5)을 보였다.
PDF

Artificial Intelligence in Gastric Cancer Imaging With Emphasis on Diagnostic Imaging and Body Morphometry

Kyung Won Kim;Jimi Huh ;Bushra Urooj ;Jeongjin Lee ;Jinseok Lee ;In-Seob Lee ;Hyesun Park ;Seongwon Na ;Yousun Ko
- Journal of Gastric Cancer
- /
- v.23 no.3
- /
- pp.388-399
- /
- 2023
Gastric cancer remains a significant global health concern, coercing the need for advancements in imaging techniques for ensuring accurate diagnosis and effective treatment planning. Artificial intelligence (AI) has emerged as a potent tool for gastric-cancer imaging, particularly for diagnostic imaging and body morphometry. This review article offers a comprehensive overview of the recent developments and applications of AI in gastric cancer imaging. We investigated the role of AI imaging in gastric cancer diagnosis and staging, showcasing its potential to enhance the accuracy and efficiency of these crucial aspects of patient management. Additionally, we explored the application of AI body morphometry specifically for assessing the clinical impact of gastrectomy. This aspect of AI utilization holds significant promise for understanding postoperative changes and optimizing patient outcomes. Furthermore, we examine the current state of AI techniques for the prognosis of patients with gastric cancer. These prognostic models leverage AI algorithms to predict long-term survival outcomes and assist clinicians in making informed treatment decisions. However, the implementation of AI techniques for gastric cancer imaging has several limitations. As AI continues to evolve, we hope to witness the translation of cutting-edge technologies into routine clinical practice, ultimately improving patient care and outcomes in the fight against gastric cancer.
https://doi.org/10.5230/jgc.2023.23.e30 인용 PDF

A Study on User Switching Intention from Contact Center-oriented to AI Chatbot-Oriented Customer Services (컨택센터 중심에서 인공지능 챗봇 중심 고객 서비스로의 사용자 전환의도에 관한 연구)

Ann Seunggyu;Ahn Hyunchul
- Journal of Korea Society of Digital Industry and Information Management
- /
- v.19 no.1
- /
- pp.57-76
- /
- 2023
This study analyzes the factors and effects on the users' intention to switch from contact center-oriented to AI chatbot-oriented customer services by combining Push-Pull-Mooring Model and provides insights for companies considering the adoption of AI chatbots. To test the model, we surveyed users with experience using chatbots at least once across different age groups. Finally, we analyzed 176 cases for the analysis using IBM SPSS Statistics and SmartPLS 4.0. The results of hypotheses testing rejected the hypotheses for variables of inconsistent quality and low availability of push factors and low switching cost of mooring factor while accepting the hypotheses for the tardy response of push factors and all pull factors. Therefore, these findings provide important implications for researchers and practitioners who wish to conduct research or adopt AI chatbots. In conclusion, users do not feel inconvenienced by the contact center-oriented service but also perceive high trust and convenience with AI chatbot-oriented service. However, despite low switching costs, users consider chatbots a complementary tool rather than an alternative. So, companies adopting AI chatbots should consider what features the users expect from AI chatbots and facilitate these features when implementing AI chatbots.
https://doi.org/10.17662/ksdim.2023.19.1.057 인용 PDF HTML

Question Generation of Machine Reading Comprehension for Data Augmentation and Domain Adaptation (추가 데이터 및 도메인 적응을 위한 기계독해 질의 생성)

Lee, Hyeon-gu;Jang, Youngjin;Kim, Jintae;Wang, JiHyun;Shin, Donghoon;Kim, Harksoo
- Annual Conference on Human and Language Technology
- /
- 2019.10a
- /
- pp.415-418
- /
- 2019
기계독해 모델에 새로운 도메인을 적용하기 위해서는 도메인에 맞는 데이터가 필요하다. 그러나 추가 데이터 구축은 많은 비용이 발생한다. 사람이 직접 구축한 데이터 없이 적용하기 위해서는 자동 추가 데이터 확보, 도메인 적응의 문제를 해결해야한다. 추가 데이터 확보의 경우 번역, 질의 생성의 방법으로 연구가 진행되었다. 그러나 도메인 적응을 위해서는 새로운 정답 유형에 대한 질의가 필요하며 이를 위해서는 정답 후보 추출, 추출된 정답 후보로 질의를 생성해야한다. 본 논문에서는 이러한 문제를 해결하기 위해 듀얼 포인터 네트워크 기반 정답 후보 추출 모델로 정답 후보를 추출하고, 포인터 제너레이터 기반 질의 생성 모델로 새로운 데이터를 생성하는 방법을 제안한다. 실험 결과 추가 데이터 확보의 경우 KorQuAD, 경제, 금융 도메인의 데이터에서 모두 성능 향상을 보였으며, 도메인 적응 실험에서도 새로운 도메인의 문맥만을 이용해 데이터를 생성했을 때 기존 도메인과 다른 도메인에서 모두 기계독해 성능 향상을 보였다.
PDF

Proposed of AI-Model Information Management Structure for Media Service Construction based on Edge (엣지 기반 미디어 서비스 구성을 위한 AI모델 정보 관리구조의 제안)

Yeom, Jeongcheol;Kum, Seungwoo
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2022.06a
- /
- pp.84-86
- /
- 2022
최근 미디어, 금융 등 다양한 분야의 기업들이 AI를 활용해 제공하는 서비스가 늘어남에 따라 학습된 모델을 엣지 자원에 배포하여 기능을 제공하는 서비스형태 또한 늘어나고 있다. AI-Application이 동작하기 위해서는 AI-Model 파일뿐 아니라 동작을 위한 설정 파일들이 필요하여 AI-Application이 사용 중인 AI-Model의 정보를 수집, 관리하는 것은 중요한 이슈라고 할 수 있다. 하지만 단일 서비스서버에서 동작하는 형태가 아닌 각 자원이 산재되어 다양한 형태로 서비스를 제공하는 엣지컴퓨팅의 구조적인 특성상 AI-Application의 기존 서비스구조, 기능을 수정하지 않고 정보를 수집하는 과정은 다양한 문제에 부딪치게 된다. 이에 따라 본 논문에서는 기존 서비스구조를 변경하지 않고 독립적으로 AI-Application에서 사용중인 AI-Model의 정보를 파악하고, 사용자 요청에 대응할 수 있는 관리구조를 제안한다.
PDF

A Study of AI model extraction attack and defense techniques (AI 모델 탈취 공격 및 방어 기법들에 관한 연구)

Jun, So-Hee;Lee, Young-Han;Kim, Hyun-Jun;Paek, Yun-Heung
- Proceedings of the Korea Information Processing Society Conference
- /
- 2021.05a
- /
- pp.382-384
- /
- 2021
AI (Artificial Intelligence)기술이 상용화되면서 최근 기업들은 AI 모델의 기능을 서비스화하여 제공하고 있다. 하지만 최근 이러한 서비스를 이용하여 기업이 자본을 투자해 학습시킨 AI 모델을 탈취하는 공격이 등장하여 위협이 되고 있다. 본 논문은 최근 연구되고 있는 이러한 모델 탈취 공격들에 대해 공격자의 정보를 기준으로 분류하여 서술한다. 또한 본 논문에서는 모델 탈취 공격에 대응하기 위해 다양한 관점에서 시도되는 방어 기법들에 대해 서술한다.
https://doi.org/10.3745/PKIPS.y2021m05a.382 인용 PDF

Cost Effective 60Hz FHD LCD with 800Mbps AiPi Technology

Nam, Hyoung-Sik;Oh, Kwan-Young;Kim, Seon-Ki;Kim, Nam-Deog;Berkeley, Brian H.;Kim, Sang-Soo;Lee, Yong-Jae;Nakajima, Keiichi
- 한국정보디스플레이학회:학술대회논문집
- /
- 2008.10a
- /
- pp.677-680
- /
- 2008
AiPi technology incorporates an embedded clock and control scheme with a point-to-point bus topology, achieving the smallest possible number of interface lines between a timing controller and source drivers. A 46" AiPi-based 10-bit FHD prototype requires only 20 interface lines, compared to 38 lines for mini-LVDS. The measured maximum data rate per one data pair is more than 800Mbps.
PDF

Search Result 661, Processing Time 0.035 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)