• Title/Summary/Keyword: text recognition

Search Result 673, Processing Time 0.029 seconds

A Study on Hangul Handwriting Generation and Classification Mode for Intelligent OCR System (지능형 OCR 시스템을 위한 한글 필기체 생성 및 분류 모델에 관한 연구)

  • Jin-Seong Baek;Ji-Yun Seo;Sang-Joong Jung;Do-Un Jeong
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.23 no.4
    • /
    • pp.222-227
    • /
    • 2022
  • In this paper, we implemented a Korean text generation and classification model based on a deep learning algorithm that can be applied to various industries. It consists of two implemented GAN-based Korean handwriting generation models and CNN-based Korean handwriting classification models. The GAN model consists of a generator model for generating fake Korean handwriting data and a discriminator model for discriminating fake handwritten data. In the case of the CNN model, the model was trained using the 'PHD08' dataset, and the learning result was 92.45. It was confirmed that Korean handwriting was classified with % accuracy. As a result of evaluating the performance of the classification model by integrating the Korean cursive data generated through the implemented GAN model and the training dataset of the existing CNN model, it was confirmed that the classification performance was 96.86%, which was superior to the existing classification performance.

Development of Dog Name Recommendation System for the Image Abstraction (이미지 추상화 기법을 이용한 반려견 이름 추천 시스템 개발)

  • Jae-Heon Lee;Ye-Rin Jeong;Mi-Kyeong Moon;Seung-Min Park
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.18 no.2
    • /
    • pp.313-320
    • /
    • 2023
  • The cumulative registration status of dogs is from 1.07 million in 2016 to 2.32 million in 2020. Animal registration is increasing by more than 10% every year, and accordingly, a name must be decided when registering a dog. We want to give a name that fits the characteristics of a dog's appearance, but there are many difficulties in naming it. This paper explains the development of a system for recognizing dog images and recommends dog names based on similar objects or food. This system extracts similarities with dogs' images through models that learn images of various objects and foods, and recommends dog names based on similarities. In addition, by recommending additional related words based on the image data of the result value, it was possible to provide users with various options, increase convenience, and increase interest and fun. Through this system, it is expected that users will be able to solve their concerns about naming their dogs, check names that suit their dogs comfortably, and give them various options through various recommended names to increase satisfaction.

A Korean Multi-speaker Text-to-Speech System Using d-vector (d-vector를 이용한 한국어 다화자 TTS 시스템)

  • Kim, Kwang Hyeon;Kwon, Chul Hong
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.3
    • /
    • pp.469-475
    • /
    • 2022
  • To train the model of the deep learning-based single-speaker TTS system, a speech DB of tens of hours and a lot of training time are required. This is an inefficient method in terms of time and cost to train multi-speaker or personalized TTS models. The voice cloning method uses a speaker encoder model to make the TTS model of a new speaker. Through the trained speaker encoder model, a speaker embedding vector representing the timbre of the new speaker is created from the small speech data of the new speaker that is not used for training. In this paper, we propose a multi-speaker TTS system to which voice cloning is applied. The proposed TTS system consists of a speaker encoder, synthesizer and vocoder. The speaker encoder applies the d-vector technique used in the speaker recognition field. The timbre of the new speaker is expressed by adding the d-vector derived from the trained speaker encoder as an input to the synthesizer. It can be seen that the performance of the proposed TTS system is excellent from the experimental results derived by the MOS and timbre similarity listening tests.

Using Roots and Patterns to Detect Arabic Verbs without Affixes Removal

  • Abdulmonem Ahmed;Aybaba Hancrliogullari;Ali Riza Tosun
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.4
    • /
    • pp.1-6
    • /
    • 2023
  • Morphological analysis is a branch of natural language processing, is now a rapidly growing field. The fundamental tenet of morphological analysis is that it can establish the roots or stems of words and enable comparison to the original term. Arabic is a highly inflected and derivational language and it has a strong structure. Each root or stem can have a large number of affixes attached to it due to the non-concatenative nature of Arabic morphology, increasing the number of possible inflected words that can be created. Accurate verb recognition and extraction are necessary nearly all issues in well-known study topics include Web Search, Information Retrieval, Machine Translation, Question Answering and so forth. in this work we have designed and implemented an algorithm to detect and recognize Arbic Verbs from Arabic text.The suggested technique was created with "Python" and the "pyqt5" visual package, allowing for quick modification and easy addition of new patterns. We employed 17 alternative patterns to represent all verbs in terms of singular, plural, masculine, and feminine pronouns as well as past, present, and imperative verb tenses. All of the verbs that matched these patterns were used when a verb has a root, and the outcomes were reliable. The approach is able to recognize all verbs with the same structure without requiring any alterations to the code or design. The verbs that are not recognized by our method have no antecedents in the Arabic roots. According to our work, the strategy can rapidly and precisely identify verbs with roots, but it cannot be used to identify verbs that are not in the Arabic language. We advise employing a hybrid approach that combines many principles as a result.

Visualizing Unstructured Data using a Big Data Analytical Tool R Language (빅데이터 분석 도구 R 언어를 이용한 비정형 데이터 시각화)

  • Nam, Soo-Tai;Chen, Jinhui;Shin, Seong-Yoon;Jin, Chan-Yong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.151-154
    • /
    • 2021
  • Big data analysis is the process of discovering meaningful new correlations, patterns, and trends in large volumes of data stored in data stores and creating new value. Thus, most big data analysis technology methods include data mining, machine learning, natural language processing, and pattern recognition used in existing statistical computer science. Also, using the R language, a big data tool, we can express analysis results through various visualization functions using pre-processing text data. The data used in this study was analyzed for 21 papers in the March 2021 among the journals of the Korea Institute of Information and Communication Engineering. In the final analysis results, the most frequently mentioned keyword was "Data", which ranked first 305 times. Therefore, based on the results of the analysis, the limitations of the study and theoretical implications are suggested.

  • PDF

Visualizing Article Material using a Big Data Analytical Tool R Language (빅데이터 분석 도구 R 언어를 이용한 논문 데이터 시각화)

  • Nam, Soo-Tai;Shin, Seong-Yoon;Jin, Chan-Yong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.326-327
    • /
    • 2021
  • Newly, big data utilization has been widely interested in a wide variety of industrial fields. Big data analysis is the process of discovering meaningful new correlations, patterns, and trends in large volumes of data stored in data stores and creating new value. Thus, most big data analysis technology methods include data mining, machine learning, natural language processing, and pattern recognition used in existing statistical computer science. Also, using the R language, a big data tool, we can express analysis results through various visualization functions using pre-processing text data. The data used in this study were analyzed for 29 papers in a specific journal. In the final analysis results, the most frequently mentioned keyword was "Research", which ranked first 743 times. Therefore, based on the results of the analysis, the limitations of the study and theoretical implications are suggested.

  • PDF

Design of Artificial Intelligence Course for Humanities and Social Sciences Majors

  • KyungHee Lee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.4
    • /
    • pp.187-195
    • /
    • 2023
  • This study propose to develop artificial intelligence liberal arts courses for college students in the humanities and social sciences majors using the entry artificial intelligence model. A group of experts in computer, artificial intelligence, and pedagogy was formed, and the final artificial intelligence liberal arts course was developed using previous research analysis and Delphi techniques. As a result of the study, the educational topics were largely composed of four categories: image classification, image recognition, text classification, and sound classification. The training consisted of 1) Understanding the principles of artificial intelligence, 2) Practice using the entry artificial intelligence model, 3) Identifying the Ethical Impact, and 4) Based on learned, team idea meeting to solve real-life problems. Through this course, understanding the principles of the core technology of artificial intelligence can be directly implemented through the entry artificial intelligence model, and furthermore, based on the experience of solving various real-life problems with artificial intelligence, and it can be expected to contribute positively to understanding technology, exploring the ethics needed in the artificial intelligence era.

Study on a Smart Cane for the Visually Impaired utilizing ESP32-CAM for Enhanced Safety (안전성 강화를 위한 ESP32-CAM을 활용한 시각장애인용 스마트지팡이에 대한 연구)

  • Doo-Hyeon-Hong;Jong-Hwan-Lim;Jun-Sun-Yu;Seung-Hyeop-Beak;Jae-Wook Kim
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.18 no.6
    • /
    • pp.1379-1386
    • /
    • 2023
  • In this paper, research was conducted to prevent various safety accidents that may occur from infant carriages carrying children and to make the use of infant carriages easier. In order to prevent the baby car from running without protection, a brake function is installed on the baby car wheels using a pressure sensor and a servo motor. Then, a pressure sensor and LCD are used to determine whether the seat belt is fastened to prevent the child from falling out of the baby car. In addition, it was designed to use LCD and LED to turn on a warning light when the temperature and humidity exceed a certain level, so that infants can be in a comfortable environment when using the baby car.

COVID-19 vaccine-induced immune thrombotic thrombocytopenia: a review

  • Siti Nur Atikah Aishah Suhaimi;Izzati Abdul Halim Zaki;Zakiah Mohd Noordin;Nur Sabiha Md Hussin;Long Chiau Ming;Hanis Hanum Zulkifly
    • Clinical and Experimental Vaccine Research
    • /
    • v.12 no.4
    • /
    • pp.265-290
    • /
    • 2023
  • Rare but serious thrombotic incidents in relation to thrombocytopenia, termed vaccine-induced immune thrombotic thrombocytopenia (VITT), have been observed since the vaccine rollout, particularly among replication-defective adenoviral vector-based severe acute respiratory syndrome coronavirus 2 vaccine recipients. Herein, we comprehensively reviewed and summarized reported studies of VITT following the coronavirus disease 2019 (COVID-19) vaccination to determine its prevalence, clinical characteristics, as well as its management. A literature search up to October 1, 2021 using PubMed and SCOPUS identified a combined total of 720 articles. Following the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guideline, after screening the titles and abstracts based on the eligibility criteria, the remaining 47 full-text articles were assessed for eligibility and 29 studies were included. Findings revealed that VITT cases are strongly related to viral vector-based vaccines, which are the AstraZeneca COVID-19 vaccine (95%) and the Janssen COVID-19 vaccine (4%), with much rarer reports involving messenger RNA-based vaccines such as the Moderna COVID-19 vaccine (0.2%) and the Pfizer COVID-19 vaccine (0.2%). The most severe manifestation of VITT is cerebral venous sinus thrombosis with 317 cases (70.4%) and the earliest primary symptom in the majority of cases is headache. Intravenous immunoglobulin and non-heparin anticoagulant are the main therapeutic options for managing immune responses and thrombosis, respectively. As there is emerging knowledge on and refinement of the published guidelines regarding VITT, this review may assist the medical communities in early VITT recognition, understanding the clinical presentations, diagnostic criteria as well as its management, offering a window of opportunity to VITT patients. Further larger sample size trials could further elucidate the link and safety profile.

An Analysis of News Media Coverage of the QRcode: Based on 2008-2023 News Big Data (QR코드에 대한 언론 보도 경향: 2008-2023년 뉴스 빅데이터 분석)

  • Sunjeong Kim;Jisu Lee
    • Journal of the Korean Society for information Management
    • /
    • v.41 no.2
    • /
    • pp.269-294
    • /
    • 2024
  • This study analyzed the news media coverage of QRcodes in Korea over a 16-year period (2008 to 2023). A total of 13,335 articles were extracted from the Korea Press Foundation's BigKinds. A quantitative and content analysis was conducted on the news frames. The results indicated that the quantity of news coverage has increased. The greatest quantity of news coverage was observed in 2020, and the most frequently discussed topic in the news was 'IT_Science'. The results of the keyword analysis indicated that the primary words were 'QRcode', 'smartphone', 'service', 'application', and 'payment'. The news media primarily focused on the QRcode's ability to provide instant access and recognition technology. This study demonstrates that advanced information and communication technologies and the increased prevalence of mobile devices have led to a rise in the utilization of QRcodes. Furthermore, QRcodes have become a significant information media in contemporary society.