• Title/Summary/Keyword: learning through the image

Search Result 925, Processing Time 0.032 seconds

A dual path encoder-decoder network for placental vessel segmentation in fetoscopic surgery

  • Yunbo Rao;Tian Tan;Shaoning Zeng;Zhanglin Chen;Jihong Sun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.18 no.1
    • /
    • pp.15-29
    • /
    • 2024
  • A fetoscope is an optical endoscope, which is often applied in fetoscopic laser photocoagulation to treat twin-to-twin transfusion syndrome. In an operation, the clinician needs to observe the abnormal placental vessels through the endoscope, so as to guide the operation. However, low-quality imaging and narrow field of view of the fetoscope increase the difficulty of the operation. Introducing an accurate placental vessel segmentation of fetoscopic images can assist the fetoscopic laser photocoagulation and help identify the abnormal vessels. This study proposes a method to solve the above problems. A novel encoder-decoder network with a dual-path structure is proposed to segment the placental vessels in fetoscopic images. In particular, we introduce a channel attention mechanism and a continuous convolution structure to obtain multi-scale features with their weights. Moreover, a switching connection is inserted between the corresponding blocks of the two paths to strengthen their relationship. According to the results of a set of blood vessel segmentation experiments conducted on a public fetoscopic image dataset, our method has achieved higher scores than the current mainstream segmentation methods, raising the dice similarity coefficient, intersection over union, and pixel accuracy by 5.80%, 8.39% and 0.62%, respectively.

A Study on the Recognition of Face Based on CNN Algorithms (CNN 알고리즘을 기반한 얼굴인식에 관한 연구)

  • Son, Da-Yeon;Lee, Kwang-Keun
    • Korean Journal of Artificial Intelligence
    • /
    • v.5 no.2
    • /
    • pp.15-25
    • /
    • 2017
  • Recently, technologies are being developed to recognize and authenticate users using bioinformatics to solve information security issues. Biometric information includes face, fingerprint, iris, voice, and vein. Among them, face recognition technology occupies a large part. Face recognition technology is applied in various fields. For example, it can be used for identity verification, such as a personal identification card, passport, credit card, security system, and personnel data. In addition, it can be used for security, including crime suspect search, unsafe zone monitoring, vehicle tracking crime.In this thesis, we conducted a study to recognize faces by detecting the areas of the face through a computer webcam. The purpose of this study was to contribute to the improvement in the accuracy of Recognition of Face Based on CNN Algorithms. For this purpose, We used data files provided by github to build a face recognition model. We also created data using CNN algorithms, which are widely used for image recognition. Various photos were learned by CNN algorithm. The study found that the accuracy of face recognition based on CNN algorithms was 77%. Based on the results of the study, We carried out recognition of the face according to the distance. Research findings may be useful if face recognition is required in a variety of situations. Research based on this study is also expected to improve the accuracy of face recognition.

Curriculum Design for Digital Fashion Film Making (디지털 패션필름 제작 교과에 관한 커리큘럼 개발)

  • Mikyung Kim;Eunhyuk Yim
    • Fashion & Textile Research Journal
    • /
    • v.25 no.4
    • /
    • pp.429-438
    • /
    • 2023
  • In the 21st century fashion industry, the rise of digital environments has transformed it into a dynamic medium, expanding the horizons of media utilization. Consequently, digital fashion film has emerged as a pivotal tool for fashion communication. Functioning as a visual expression medium, fashion film animates fashion concepts into immersive moving images. Proficiency in digital fashion communication has become imperative, considering the attributes of fashion media. Notably, the role of creative directors in ensuring coherent communication across diverse fashion media platforms has gained prominence, underscoring the need for systematic fashion education to nurture specialized talent. This study, therefore, devised a comprehensive curriculum amalgamating fashion communication and practical digital media skills, implemented within fashion major courses. Through this approach, students gained experimental media proficiency and explored innovative approaches to crafting fashion films that eloquently convey fashion narratives. The participants were exposed to the entire spectrum of fashion media production, encompassing digital storytelling, fashion film conceptualization, filming techniques, meticulous editing, and adept utilization of special effects technology. The study's pedagogical strategy, characterized by a focused learning trajectory, garnered significant acclaim. In essence, this study holds significance by formulating a curriculum that nurtures the imaginative and pragmatic aptitudes of fashion majors, immersing them in the dynamic realm of rapidly evolving digital fashion films and their integration with fashion content.

Implementation of Optical Pattern Recognition System Based on Perceptron Neural Network (Perceptron 신경회로망에 근거한 광 패턴인식 시스템의 구현)

  • 한종욱;용상순;이진호;이기서;김은수
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.16 no.6
    • /
    • pp.545-555
    • /
    • 1991
  • In this paper, We discuss optical implementation of new optical adaptive patern recognition system based on single layer perception with learning capability and associative memory model having error corrective capability. The single layer perceptron is optically implemented by using 2 D LCTV spatial light modulators through the nonlinear quantization and polarization encoding methods, and 2 D hopfield associative memory is also implemented by using multifocus holographic lens. From some experimental results on classfication of Arabic numbers into even & edd numbers, it is shown that the proposed system can classify the patterns to the right classes correctly even for the partial and erronenous input patterns. Accordingly, the proposed optical adaptive pattern recognition system can be suggested for practical application in the fields of image processing and pattern recognition.

  • PDF

A study of intelligent system to improve the accuracy of pattern recognition (패턴인식의 정화성을 향상하기 위한 지능시스템 연구)

  • Chung, Sung-Boo;Kim, Joo-Woong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.12 no.7
    • /
    • pp.1291-1300
    • /
    • 2008
  • In this paper, we propose a intelligent system to improve the accuracy of pattern recognition. The proposed intelligent system consist in SOFM, LVQ and FCM algorithm. We are confirmed the effectiveness of the proposed intelligent system through the several experiments that classify Fisher's Iris data and face image data that offered by ORL of Cambridge Univ. and EMG data. As the results of experiments, the proposed intelligent system has better accuracy of pattern recognition than general LVQ.

User Data Collection and Personalization Services in Mobile Shopping Environment (모바일 쇼핑 환경에서 사용자 데이터 수집 및 개인화 서비스 방법)

  • Kim, Sung-jin;Kim, Sung-gyu;Oh, Chang-heon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.05a
    • /
    • pp.560-561
    • /
    • 2018
  • The spread of smartphones is increasing the proportion of mobile shopping in the online shopping market. Most mobile shopping services are delivered through applications. However, personalization services are very important for user data collection and analysis. Therefore, in this paper, we implemented the product barcode recognition function and machine learning-based product image recognition function using smartphones camera to collect user data in mobile shopping environment. The implemented function and push notification services enabled the collection and analysis of user data and personalization services for online shopping platform applications.

  • PDF

An ANN-based gesture recognition algorithm for smart-home applications

  • Huu, Phat Nguyen;Minh, Quang Tran;The, Hoang Lai
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.5
    • /
    • pp.1967-1983
    • /
    • 2020
  • The goal of this paper is to analyze and build an algorithm to recognize hand gestures applying to smart home applications. The proposed algorithm uses image processing techniques combing with artificial neural network (ANN) approaches to help users interact with computers by common gestures. We use five types of gestures, namely those for Stop, Forward, Backward, Turn Left, and Turn Right. Users will control devices through a camera connected to computers. The algorithm will analyze gestures and take actions to perform appropriate action according to users requests via their gestures. The results show that the average accuracy of proposal algorithm is 92.6 percent for images and more than 91 percent for video, which both satisfy performance requirements for real-world application, specifically for smart home services. The processing time is approximately 0.098 second with 10 frames/sec datasets. However, accuracy rate still depends on the number of training images (video) and their resolution.

Study on hole-filling technique of motion capture images using GANs (Generative Adversarial Networks) (GANs(Generative Adversarial Networks)를 활용한 모션캡처 이미지의 hole-filling 기법 연구)

  • Shin, Kwang-Seong;Shin, Seong-Yoon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2019.05a
    • /
    • pp.160-161
    • /
    • 2019
  • As a method for modeling a three-dimensional object, there are a method using a 3D scanner, a method using a motion capture system, and a method using a Kinect system. Through this method, a portion that is not captured due to occlusion occurs in the process of creating a three-dimensional object. In order to implement a perfect three-dimensional object, it is necessary to arbitrarily fill the obscured part. There is a technique to fill the unexposed part by various image processing methods. In this study, we propose a method using GANs, which is the latest trend of unsupervised machine learning, as a method for more natural hole-filling.

  • PDF

Classification Performance Analysis of Silicon Wafer Micro-Cracks Based on SVM (SVM 기반 실리콘 웨이퍼 마이크로크랙의 분류성능 분석)

  • Kim, Sang Yeon;Kim, Gyung Bum
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.33 no.9
    • /
    • pp.715-721
    • /
    • 2016
  • In this paper, the classification rate of micro-cracks in silicon wafers was improved using a SVM. In case I, we investigated how feature data of micro-cracks and SVM parameters affect a classification rate. As a result, weighting vector and bias did not affect the classification rate, which was improved in case of high cost and sigmoid kernel function. Case II was performed using a more high quality image than that in case I. It was identified that learning data and input data had a large effect on the classification rate. Finally, images from cases I and II and another illumination system were used in case III. In spite of different condition images, good classification rates was achieved. Critical points for micro-crack classification improvement are SVM parameters, kernel function, clustered feature data, and experimental conditions. In the future, excellent results could be obtained through SVM parameter tuning and clustered feature data.

Real-time Abnormal Behavior Analysis System Based on Pedestrian Detection and Tracking (보행자의 검출 및 추적을 기반으로 한 실시간 이상행위 분석 시스템)

  • Kim, Dohun;Park, Sanghyun
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.25-27
    • /
    • 2021
  • With the recent development of deep learning technology, computer vision-based AI technologies have been studied to analyze the abnormal behavior of objects in image information acquired through CCTV cameras. There are many cases where surveillance cameras are installed in dangerous areas or security areas for crime prevention and surveillance. For this reason, companies are conducting studies to determine major situations such as intrusion, roaming, falls, and assault in the surveillance camera environment. In this paper, we propose a real-time abnormal behavior analysis algorithm using object detection and tracking method.

  • PDF