• Title/Summary/Keyword: Deep Learning AI

Search Result 611, Processing Time 0.024 seconds

Methods of Classification and Character Recognition for Table Items through Deep Learning (딥러닝을 통한 문서 내 표 항목 분류 및 인식 방법)

  • Lee, Dong-Seok;Kwon, Soon-Kak
    • Journal of Korea Multimedia Society
    • /
    • v.24 no.5
    • /
    • pp.651-658
    • /
    • 2021
  • In this paper, we propose methods for character recognition and classification for table items through deep learning. First, table areas are detected in a document image through CNN. After that, table areas are separated by separators such as vertical lines. The text in document is recognized through a neural network combined with CNN and RNN. To correct errors in the character recognition, multiple candidates for the recognized result are provided for a sentence which has low recognition accuracy.

A study on real-time internet comment system through sentiment analysis and deep learning application

  • Hae-Jong Joo;Ho-Bin Song
    • Journal of Platform Technology
    • /
    • v.11 no.2
    • /
    • pp.3-14
    • /
    • 2023
  • This paper proposes a big data sentiment analysis method and deep learning implementation method to provide a webtoon comment analysis web page for convenient comment confirmation and feedback of webtoon writers for the development of the cartoon industry in the video animation field. In order to solve the difficulty of automatic analysis due to the nature of Internet comments and provide various sentiment analysis information, LSTM(Long Short-Term Memory) algorithm, ranking algorithm, and word2vec algorithm are applied in parallel, and actual popular works are used to verify the validity. If the analysis method of this paper is used, it is easy to expand to other domestic and overseas platforms, and it is expected that it can be used in various video animation content fields, not limited to the webtoon field

  • PDF

Methodology for Apartment Space Arrangement Based on Deep Reinforcement Learning

  • Cheng Yun Chi;Se Won Lee
    • Architectural research
    • /
    • v.26 no.1
    • /
    • pp.1-12
    • /
    • 2024
  • This study introduces a deep reinforcement learning (DRL)-based methodology for optimizing apartment space arrangements, addressing the limitations of human capability in evaluating all potential spatial configurations. Leveraging computational power, the methodology facilitates the autonomous exploration and evaluation of innovative layout options, considering architectural principles, legal standards, and client re-quirements. Through comprehensive simulation tests across various apartment types, the research demonstrates the DRL approach's effec-tiveness in generating efficient spatial arrangements that align with current design trends and meet predefined performance objectives. The comparative analysis of AI-generated layouts with those designed by professionals validates the methodology's applicability and potential in enhancing architectural design practices by offering novel, optimized spatial configuration solutions.

Generating 2D LEGO Instruction Manual Using Deep Learning Model (딥러닝 모델을 이용한 2D 레고 조립 설명서 생성)

  • Jongseok Ahn;Seunghyeon Lee;Cheolhee Kim;Donghee Kang
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2024.01a
    • /
    • pp.481-484
    • /
    • 2024
  • 본 논문에서는 레고(LEGO®) 조립 설명서를 생성하기 위해 딥러닝을 이용한 조립 및 설명서 생성 시스템을 제안한다. 이 시스템은 사용자가 제공한 단일 이미지를 기반으로 레고 조립 설명서를 자동 생성한다. 해당 시스템은 딥러닝 기반 이미지 분할 기술을 활용하여 물체를 배경으로부터 분리하고 이를 통해 조립 설명서를 생성하는 과정을 포함하며, 조립을 위한 알고리즘을 새로 설계하였다. 이 시스템은 기존 레고 제품의 한계를 극복하고, 사용자에게 주어진 부품으로 다양한 모델을 자유롭게 조립할 수 있게 한다. 또한, 복잡한 레고 조립 과정을 간소화하고, 조립의 장벽을 낮추는 데 도움을 준다.

  • PDF

A review of artificial intelligence based demand forecasting techniques (인공지능 기반 수요예측 기법의 리뷰)

  • Jeong, Hyerin;Lim, Changwon
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.6
    • /
    • pp.795-835
    • /
    • 2019
  • Big data has been generated in various fields. Many companies have now tried to make profits by building a system capable of analyzing big data based on artificial intelligence (AI) techniques. Integrating AI technology has made analyzing and utilizing vast amounts of data increasingly valuable. In particular, demand forecasting with maximum accuracy is critical to government and business management in various fields such as finance, procurement, production and marketing. In this case, it is important to apply an appropriate model that considers the demand pattern for each field. It is possible to analyze complex patterns of real data that can also be enlarged by a traditional time series model or regression model. However, choosing the right model among the various models is difficult without prior knowledge. Many studies based on AI techniques such as machine learning and deep learning have been proven to overcome these problems. In addition, demand forecasting through the analysis of stereotyped data and unstructured data of images or texts has also shown high accuracy. This paper introduces important areas where demand forecasts are relatively active as well as introduces machine learning and deep learning techniques that consider the characteristics of each field.

Implementation of the Stone Classification with AI Algorithm Based on VGGNet Neural Networks (VGGNet을 활용한 석재분류 인공지능 알고리즘 구현)

  • Choi, Kyung Nam
    • Smart Media Journal
    • /
    • v.10 no.1
    • /
    • pp.32-38
    • /
    • 2021
  • Image classification through deep learning on the image from photographs has been a very active research field for the past several years. In this paper, we propose a method of automatically discriminating stone images from domestic source through deep learning, which is to use Python's hash library to scan 300×300 pixel photo images of granites such as Hwangdeungseok, Goheungseok, and Pocheonseok, performing data preprocessing to create learning images by examining duplicate images for each stone, removing duplicate images with the same hash value as a result of the inspection, and deep learning by stone. In addition, to utilize VGGNet, the size of the images for each stone is resized to 224×224 pixels, learned in VGG16 where the ratio of training and verification data for learning is 80% versus 20%. After training of deep learning, the loss function graph and the accuracy graph were generated, and the prediction results of the deep learning model were output for the three kinds of stone images.

DQN Reinforcement Learning for Mountain-Car in OpenAI Gym Environment (OpenAI Gym 환경의 Mountain-Car에 대한 DQN 강화학습)

  • Myung-Ju Kang
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2024.01a
    • /
    • pp.375-377
    • /
    • 2024
  • 본 논문에서는 OpenAI Gym 환경에서 프로그램으로 간단한 제어가 가능한 Mountain-Car-v0 게임에 대해 DQN(Deep Q-Networks) 강화학습을 진행하였다. 본 논문에서 적용한 DQN 네트워크는 입력층 1개, 은닉층 3개, 출력층 1개로 구성하였고, 입력층과 은닉층에서의 활성화함수는 ReLU를, 출력층에서는 Linear함수를 활성화함수로 적용하였다. 실험은 Mountain-Car-v0에 대해 DQN 강화학습을 진행했을 때 각 에피소드별로 획득한 보상 결과를 살펴보고, 보상구간에 포함된 횟수를 분석하였다. 실험결과 전체 100회의 에피소드 중 보상을 50 이상 획득한 에피소드가 85개로 나타났다.

  • PDF

Research on Developing a Conversational AI Callbot Solution for Medical Counselling

  • Won Ro LEE;Jeong Hyon CHOI;Min Soo KANG
    • Korean Journal of Artificial Intelligence
    • /
    • v.11 no.4
    • /
    • pp.9-13
    • /
    • 2023
  • In this study, we explored the potential of integrating interactive AI callbot technology into the medical consultation domain as part of a broader service development initiative. Aimed at enhancing patient satisfaction, the AI callbot was designed to efficiently address queries from hospitals' primary users, especially the elderly and those using phone services. By incorporating an AI-driven callbot into the hospital's customer service center, routine tasks such as appointment modifications and cancellations were efficiently managed by the AI Callbot Agent. On the other hand, tasks requiring more detailed attention or specialization were addressed by Human Agents, ensuring a balanced and collaborative approach. The deep learning model for voice recognition for this study was based on the Transformer model and fine-tuned to fit the medical field using a pre-trained model. Existing recording files were converted into learning data to perform SSL(self-supervised learning) Model was implemented. The ANN (Artificial neural network) neural network model was used to analyze voice signals and interpret them as text, and after actual application, the intent was enriched through reinforcement learning to continuously improve accuracy. In the case of TTS(Text To Speech), the Transformer model was applied to Text Analysis, Acoustic model, and Vocoder, and Google's Natural Language API was applied to recognize intent. As the research progresses, there are challenges to solve, such as interconnection issues between various EMR providers, problems with doctor's time slots, problems with two or more hospital appointments, and problems with patient use. However, there are specialized problems that are easy to make reservations. Implementation of the callbot service in hospitals appears to be applicable immediately.

Remote Multi-control Smart Farm with Deep Learning Growth Diagnosis Function

  • Kim, Mi-jin;Kim, Ji-ho;Lee, Dong-hyeon;Han, Jung-hoon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.9
    • /
    • pp.49-57
    • /
    • 2022
  • Currently, the problem of food shortage is emerging in our society due to climate problems and an increase population in the world. As a solution to this problem, we propose a multi-remote control smart farm that combines artificial intelligence (AI) and information and communication technology (ICT) technologies. The proposed smart farm integrates ICT technology to remotely control and manage crops without restrictions in space and time, and to multi-control the growing environment of crops. In addition, using Arduino and deep-learning technology, a smart farm capable of multiple control through a smart-phone application (APP) was proposed, and Ai technology with various data securing and diagnosis functions while observing crop growth in real-time was included. Various sensors in the smart farm are controlled by using the Arduino, and the data values of the sensors are stored in the built database, so that the user can check the stored data with the APP. For multiple control for multiple crops, each LED, COOLING FAN, and WATER PUMP for two or more growing environments were applied so that the user could control it conveniently. And by implementing an APP that diagnoses the growth stage through the Tensor-Flow framework using deep-learning technology, we developed an application that helps users to easily diagnose the growth status of the current crop.

Characterization of Deep Learning-Based and Hybrid Iterative Reconstruction for Image Quality Optimization at Computer Tomography Angiography (전산화단층촬영조영술에서 화질 최적화를 위한 딥러닝 기반 및 하이브리드 반복 재구성의 특성분석)

  • Pil-Hyun, Jeon;Chang-Lae, Lee
    • Journal of the Korean Society of Radiology
    • /
    • v.17 no.1
    • /
    • pp.1-9
    • /
    • 2023
  • For optimal image quality of computer tomography angiography (CTA), different iodine concentrations and scan parameters were applied to quantitatively evaluate the image quality characteristics of filtered back projection (FBP), hybrid-iterative reconstruction (hybrid-IR), and deep learning reconstruction (DLR). A 320-row-detector CT scanner scanned a phantom with various iodine concentrations (1.2, 2.9, 4.9, 6.9, 10.4, 14.3, 18.4, and 25.9 mg/mL) located at the edge of a cylindrical water phantom with a diameter of 19 cm. Data obtained using each reconstruction technique was analyzed through noise, coefficient of variation (COV), and root mean square error (RMSE). As the iodine concentration increased, the CT number value increased, but the noise change did not show any special characteristics. COV decreased with increasing iodine concentration for FBP, adaptive iterative dose reduction (AIDR) 3D, and advanced intelligent clear-IQ engine (AiCE) at various tube voltages and tube currents. In addition, when the iodine concentration was low, there was a slight difference in COV between the reconstitution techniques, but there was little difference as the iodine concentration increased. AiCE showed the characteristic that RMSE decreased as the iodine concentration increased but rather increased after a specific concentration (4.9 mg/mL). Therefore, the user will have to consider the characteristics of scan parameters such as tube current and tube voltage as well as iodine concentration according to the reconstruction technique for optimal CTA image acquisition.