Search | Korea Science

Candidate Word List and Probability Score Guided for Korean Scene Text Recognition (후보 단어 리스트와 확률 점수에 기반한 한국어 문자 인식 모델)

Lee, Yoonji;Lee, Jong-Min
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2022.05a
- /
- pp.73-75
- /
- 2022
Scene Text Recognition is a technology used in the field of artificial intelligence that requires manless robot, automatic vehicles and human-computer interaction. Though scene text images are distorted by noise interference, such as illumination, low resolution and blurring. Unlike previous studies that recognized only English, this paper shows a strong recognition accuracy including various characters, English, Korean, special character and numbers. Instead of selecting only one class having the highest probability value, a candidate word can be generated by considering the probability value of the second rank as well, thus a method can be corrected an existing language misrecognition problem.
PDF

KorQuAD 2.0: Korean QA Dataset for Web Document Machine Comprehension (KorQuAD 2.0: 웹문서 기계독해를 위한 한국어 질의응답 데이터셋)

Kim, Youngmin;Lim, Seungyoung;Lee, Hyunjeong;Park, Soyoon;Kim, Myungji
- Annual Conference on Human and Language Technology
- /
- 2019.10a
- /
- pp.97-102
- /
- 2019
KorQuAD 2.0은 총 100,000+ 쌍으로 구성된 한국어 질의응답 데이터셋이다. 기존 질의응답 표준 데이터인 KorQuAD 1.0과의 차이점은 크게 세가지가 있는데 첫 번째는 주어지는 지문이 한두 문단이 아닌 위키백과 한 페이지 전체라는 점이다. 두 번째로 지문에 표와 리스트도 포함되어 있기 때문에 HTML tag로 구조화된 문서에 대한 이해가 필요하다. 마지막으로 답변이 단어 혹은 구의 단위뿐 아니라 문단, 표, 리스트 전체를 포괄하는 긴 영역이 될 수 있다. Baseline 모델로 구글이 오픈소스로 공개한 BERT Multilingual을 활용하여 실험한 결과 F1 스코어 46.0%의 성능을 확인하였다. 이는 사람의 F1 점수 85.7%에 비해 매우 낮은 점수로, 본 데이터가 도전적인 과제임을 알 수 있다. 본 데이터의 공개를 통해 평문에 국한되어 있던 질의응답의 대상을 다양한 길이와 형식을 가진 real world task로 확장하고자 한다.
PDF

Survey on DGA Botnet Domain Detection and Family Classification (DGA 봇넷 도메인 감지 및 패밀리 분류 연구 동향)

Jungmin Lee;Minjae Kang;Yeonjoon Lee
- Proceedings of the Korea Information Processing Society Conference
- /
- 2023.11a
- /
- pp.543-546
- /
- 2023
봇넷은 지속적으로 사이버 범죄에 이용되고 있으며 네트워크 환경에 큰 위협이 되고 있다. 기존에는 봇들이 C&C 서버와 통신하는 것을 방지하기 위해 블랙리스트를 기반으로 DNS 서버에서 봇넷 도메인을 탐지하는 방식을 주로 사용하였다. 그러나 도메인 생성 알고리즘(DGA)을 이용하는 봇넷이 증가하면서 기존에 사용하던 블랙리스트 기반의 도메인 차단 방식으로는 더 이상 봇넷 도메인을 효율적으로 차단하기 어려워졌다. 이에 따라 봇넷 도메인 생성 알고리즘을 통해 생성되는 도메인의 특성을 분석하고 이를 토대로 봇넷 도메인을 식별하고 차단하고자 하는 시도가 계속되고 있다. 특히 연속적인 데이터 처리에 주로 사용되는 딥러닝 알고리즘을 이용하여 봇넷 도메인의 특징을 효과적으로 추출하고 정확도가 높은 탐지 모델을 구축하고자 하는 연구가 주를 이루고 있으며, 탐지뿐만 아니라 봇넷 그룹(Family) 분류까지 연구가 확장되고 있다. 이에 본 논문에서는 봇넷 도메인 생성 알고리즘에 의해 생성되는 봇넷 도메인을 식별 및 분류하기 위해 딥러닝 기술을 적용한 최근 연구 동향을 조사하고 앞으로의 연구 방향성을 논의하고자 한다.
https://doi.org/10.3745/PKIPS.y2023m11a.543 인용 PDF

A Study on Simplification of Machine Learning Model (기계학습 모델의 간략화 방법에 대한 연구)

Lee, Gye-Sung;Kim, In-Kook
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.16 no.4
- /
- pp.147-152
- /
- 2016
One of major issues in machine learning that extracts and acquires knowledge implicit in data is to find an appropriate way of representing it. Knowledge can be represented by a number of structures such as networks, trees, lists, and rules. The differences among these exist not only in their structures but also in effectiveness of the models for their problem solving capability. In this paper, we propose partition utility as a criterion function for clustering that can lead to simplification of the model and thus avoid overfitting problem. In addition, a heuristic is proposed as a way to construct balanced hierarchical models.
https://doi.org/10.7236/JIIBC.2016.16.4.147 인용 PDF KSCI

Differences in the analysis of a model's relationship marketing factors for TV home shopping fashion stylist (TV 홈쇼핑 패션스타일리스트에 대한 모델의 관계마케팅 요인 분석의 차이)

An, Si Hyun;Chung, Sung Jee
- Journal of the Korea Fashion and Costume Design Association
- /
- v.20 no.2
- /
- pp.63-71
- /
- 2018
The purpose of this study is to establish a relationship between TV home shopping model and the marketing data of the TV home shopping industry. Differences in relationship marketing factors, trust, and intent to reuse depending on the experience of the model have resulted in a higher assessment of both the expertise and service factors of fashion stylists than groups with 10 years or less experience. In addition, model groups with 15 or more broadcasts in one month rated the professionalism, communication, ties, trust, and intent to reuse fashion stabilists more than 15 model groups. The difference in marketing factors, services, communications and ties between the professional and use of home shopping models was found to be 40 years old as compared to those in their 20s and 30s. Finally, in terms of the gender of the home shopping model, the difference between the marketing factors and the reliability of the relationship and the intent to reuse, the professional, communication and bonding, trust, and re-purpose factors were all rated higher by the female group than the male group. The results of the study suggest that a relationship marketing strategy needs to be established between a fashion stylist and a TV home shopping model, and fashion stylists should be judged based on the characteristics of a TV home shopping model.
https://doi.org/10.30571/kfcda.2018.20.2.63 인용 PDF

A Natural Language Information Retrieval Model using Automatic Network and Two-level Document Ranking (자동 키워드망과 2단계 문서 순위 결정에 의한 자연어 정보검색 모델)

Kang, Hyun-Kyu;Park, Se-Young;Choi, Key-Sun
- Annual Conference on Human and Language Technology
- /
- 1995.10a
- /
- pp.8-12
- /
- 1995
본 논문은 정보검색에서 사용자에게 순서화된 문서를 제시하기 이전에 1차로 검색된 문서들에 대하여 자동 키워드망과 2단계로 문서 순위 결정하는 모델에 대하여 논하였다. 자연어 검색을 위한 색인은 자동으로 구축된 키워드 색인으로 1차로 자연어 검색을 하고, 2차로 자동 키워드망을 이용한 순위재조정을 통해 검색효율의 향상에 관해 검색 효율을 평가하여 1차 검색 결과보다 최대 10.9%의 검색효율 향상을 보였다. 또한 문서 순위 조정 방법에 있어서 여러 가지 공식을 비교 분석하였으며 내용 검색을 반영하는 공식을 찾았다. 본 논문에서 제시한 2단계 순위 결정 방법은 리스트를 기반으로 하는 정보 검색의 분야에 적용되어 검색효율을 높일 수 있는 한가지 방법이 될 수 있을 것이다.
PDF

Effective Cultural Properties management and Accident Prevention Using GIS (GIS를 활용한 효율적인 문화재관리 및 사고예방)

Song, Sang-Hun;Jeong, Jong-Pil
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2010.07a
- /
- pp.307-310
- /
- 2010
본 연구에서는 GIS(Geographic Information System : 지리정보시스템)을 활용한 문화재 유형 및 위험요인 분석을 통해 문화재사고 위험성 분석 지역을 선정하였다. 선정된 지역의 문화재 방재 시스템 구축현황 분석, 체크리스트 평가 모델에 의한 문화재사고위험성평가를 통해 도출된 결과로 문화재 관리에 대한 문제점과 종합적인 대책 및 사고 예방을 위한 개선방안 제시로 사고발생을 최소화 할 수 있을 것으로 사료된다.
PDF

An Implementation of the Multimedia Dynamic Authoring System based on Causality Model (인과성 모델에 기반 한 멀티미디어 동적 저작시스템 구현)

Shin Hyun-san
- Journal of Internet Computing and Services
- /
- v.5 no.6
- /
- pp.67-77
- /
- 2004
In this paper, we implement the multimedia dynamic authoring system based on causality model. we define two specifications which support user to specify intuitively and naturally what he/she wants, The temporal specification describes causal-based temporal relationships between presentation objects, and the spatial specification describes relative layout structure among objects on the screen. Using the specifications, the system processes for multimedia documentation are one-dimensional string list, relational trees, such as temporal. spatial, and annotated composition tree generation phases.
PDF

Music Recommender System Weighting Similar Users' Preference in the Temporal Context (유사 취향 사용자의 시간 상황에 따른 선호 아이템에 가중치를 둔 음악 추천)

Park, Sung-Eun;Lee, Dong-Joo;Kahng, Min-Suk;Lee, Sang-Goo
- Proceedings of the Korean Information Science Society Conference
- /
- 2010.06c
- /
- pp.122-125
- /
- 2010
사용자와 취향이 비슷한 사용자를 찾고, 이 유사 사용자가 선호한 아이템을 추천하는 협력적 필터링방식은 일반적으로 많이 사용되는 추천 방식이다. 하지만 협력적 필터링 방식은 어떤 상황적 요소도 고려하지 않아 모든 상황에서 동일한 추천 결과를 제시하게 된다. 반면, 상황을 고려한 추천 방식은 다른 상황에서 그 상황에 적합하다고 판단되는 추천 리스트를 보여주는 다양성을 가지지만 개인의 선호를 반영하지 못하는 한계를 가진다. 이에 협력적 필터링 방식과 상황에 따른 추천 방식을 함께 고려하려는 시도가 있다. 본 논문에서는 시간 상황에 따른 음악 추천 시, 전체 상황에서 가장 유사한 사용자를 찾고 이 유사 사용자의 현재 상황에서의 선호 아이템을 추천하는 모델을 제시하고 실험을 통하여 이 모델의 한계와 실용 가능한 상황을 제시한다.
PDF

Design of an XMl Document Storage System using Object Oriented Database (객체지향형 데이터베이스를 이용한 XML 문서 저장 시스템 설계)

김영일;신동욱;권택근;김형선
- Proceedings of the Korean Information Science Society Conference
- /
- 1999.10a
- /
- pp.63-65
- /
- 1999
최근 인터넷을 통한 정보 교환을 위해 XML(eXtensible Markup Language)에 대한 저장 및 검색에 대한 연구가 활발히 진행되고 있다. 본 연구에서는 객체지향형 데이터베이스를 이용하여 대량의 XML문서에 대한 저장 및 검색을 지원하는 XML 문서 저장 시스템을 설계하였다. 제안하는 데이터 모델은 XML 문서의 삽입 및 갱신이 용이하도록 분할 방식을 사용하였으며, 객체지향형 데이터베이스에서 구조정보를 추출하기 위한 새로운 모델을 제시하고 있다. XML 문서의 주된 구조정보를 갖는 엘리먼트와 에트리뷰트를 DTD별로 저장하고, 하나의 DTD를 따르는 문서 인스턴스들에 대한 관계를 리스트롤 이용하여 저장해 둠으로서 객체지향형 데이터베이스 내에서 임의의 위치에 존재하는 객체에 대한 접근을 빠르게 지원할 수 있도록 설계하였다.
PDF

Search Result 184, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)