• Title/Summary/Keyword: facial recognition

Search Result 711, Processing Time 0.026 seconds

Research on Generative AI for Korean Multi-Modal Montage App (한국형 멀티모달 몽타주 앱을 위한 생성형 AI 연구)

  • Lim, Jeounghyun;Cha, Kyung-Ae;Koh, Jaepil;Hong, Won-Kee
    • Journal of Service Research and Studies
    • /
    • v.14 no.1
    • /
    • pp.13-26
    • /
    • 2024
  • Multi-modal generation is the process of generating results based on a variety of information, such as text, images, and audio. With the rapid development of AI technology, there is a growing number of multi-modal based systems that synthesize different types of data to produce results. In this paper, we present an AI system that uses speech and text recognition to describe a person and generate a montage image. While the existing montage generation technology is based on the appearance of Westerners, the montage generation system developed in this paper learns a model based on Korean facial features. Therefore, it is possible to create more accurate and effective Korean montage images based on multi-modal voice and text specific to Korean. Since the developed montage generation app can be utilized as a draft montage, it can dramatically reduce the manual labor of existing montage production personnel. For this purpose, we utilized persona-based virtual person montage data provided by the AI-Hub of the National Information Society Agency. AI-Hub is an AI integration platform aimed at providing a one-stop service by building artificial intelligence learning data necessary for the development of AI technology and services. The image generation system was implemented using VQGAN, a deep learning model used to generate high-resolution images, and the KoDALLE model, a Korean-based image generation model. It can be confirmed that the learned AI model creates a montage image of a face that is very similar to what was described using voice and text. To verify the practicality of the developed montage generation app, 10 testers used it and more than 70% responded that they were satisfied. The montage generator can be used in various fields, such as criminal detection, to describe and image facial features.

Caricaturing using Local Warping and Edge Detection (로컬 와핑 및 윤곽선 추출을 이용한 캐리커처 제작)

  • Choi, Sung-Jin;Bae, Hyeon;Kim, Sung-Shin;Woo, Kwang-Bang
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.13 no.4
    • /
    • pp.403-408
    • /
    • 2003
  • A general meaning of caricaturing is that a representation, especially pictorial or literary, in which the subject's distinctive features or peculiarities are deliberately exaggerated to produce a comic or grotesque effect. In other words, a caricature is defined as a rough sketch(dessin) which is made by detecting features from human face and exaggerating or warping those. There have been developed many methods which can make a caricature image from human face using computer. In this paper, we propose a new caricaturing system. The system uses a real-time image or supplied image as an input image and deals with it on four processing steps and then creates a caricatured image finally. The four Processing steps are like that. The first step is detecting a face from input image. The second step is extracting special coordinate values as facial geometric information. The third step is deforming the face image using local warping method and the coordinate values acquired in the second step. In fourth step, the system transforms the deformed image into the better improved edge image using a fuzzy Sobel method and then creates a caricatured image finally. In this paper , we can realize a caricaturing system which is simpler than any other exiting systems in ways that create a caricatured image and does not need complex algorithms using many image processing methods like image recognition, transformation and edge detection.

Study on the Development of Program for Measuring Preference of Portrait based on Sensibility (감성기반 인물사진 선호도 측정 프로그램 개발 연구)

  • Lee, Chang-Seop;Har, Dong-Hwan
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.2
    • /
    • pp.178-187
    • /
    • 2018
  • This study aimed to develop a model of the program for automation measuring the preference of the portraits based on the relationship between the image quality factors and the preferences in the portraits for manufacturers aiming at high utilization of the users. in order to proceed with the evaluation, the image quality measurement was divided into objective and subjective items, and the evaluation was done through image processing and statistical methods. the image quality measurement items can be divided into objective evaluation items and subjective evaluation items. RSC Contrast, Dynamic Range and Noise were selected for the objective evaluation items, and the numerical values were statistically analyzed and evaluated through the program. Exposure, Color Tone, composition of person, position of person, and out of focus were selected for subjective evaluation items and evaluated by image processing method. By applying objective and subjective assessment items, the results were very accurate, with the results obtained by the developed program and the results of the actual visual inspection. but since the currently developed program can be evalua ted only after facial recognition of the person, future research will need to develop a program that can evaluate all kinds of portraits.

Review of Research Trends on Virtual Reality-Based Intervention for Students with Autism Spectrum Disorders and Intervention Characteristics (자폐 범주성 학생을 위한 가상현실 기반 중재 연구동향 및 중재 특성 고찰)

  • Yang, Yi;Lee, Suk-Hyang;Suh, Min-Kyung
    • The Journal of the Korea Contents Association
    • /
    • v.17 no.2
    • /
    • pp.623-636
    • /
    • 2017
  • The use of virtual reality(VR)-based interventions for students with autism spectrum disorders(ASD) has received special attention as evidence-based practices for its feasiblity, practicality, and appropriateness. However, there is little research to investigate the effects of VR-based intervention for students with ASD in Korea. This study identifies and reviews studies applying VR-based interventions. In total, 13 experimental studies were found that examine the effects of VR interventions published from 1990 to 2016. The selected studies were analyzed by 6 variables including publication year, participants, research design, independent variable, dependent variable, and outcome. The results of this study showed the feasibility of the implementing VR-based interventions in various age group students with ASD. In addition, the utilization of VR techniques was particularly effective in improving a wide range of social communication skills including facial recognition, empathy, joint attention, understanding social context, and resolving issues due to limited cognitive abilities. Several recommendations for the future study on VR-based intervention for students with ASD such as interdisciplinary approach to VR-based interventions, support needs regarding characteristics of ASD, generalization and maintenance of acquired technology, and consideration for participants' cultural background. were discussed.

Automatic Face and Eyes Detection: A Scale and Rotation Invariant Approach based on Log-Polar Mapping (Log-Polar 사상의 크기와 회전 불변 특성을 이용한 얼굴과 눈 검출)

  • Choi, Il;Chien, Sung-Il
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.36S no.8
    • /
    • pp.88-100
    • /
    • 1999
  • Detecting human face and facial landmarks automatically in an image is as essential step to a fully automatic face recognition system. In this paper, we present a new approach to detect automatically face and its eyes of input image with scale and rotation variations of faces by using an intensity based template matching with a single log-polar face template. In a template-based matching it is necessary to normalize the scale changes and rotations of an input image to a template ones. The log-polar mapping which simulates space-variant human visual system converts scale changes and rotations of input image into constant horizontal and cyclic vertical shifts in the output plane. Intelligent use of this property allows us to shift of the candidate log-polar faces mapped at various fixation points of an input image to be matched to a template over the log-polar plane. Thus, the proposed method eliminates the need of adapting multitemplate and multiresolution schemes, which inevitably give rise to intensive computation involved to cope with scale and rotation variations of faces. Through this scale and rotation involved to cope with scale and method can lead to detecting face and its eyes simultaneously. Experimental results on a database of 795 images show over 98% detection rate.

  • PDF

An Analysis on the Empathic Changing Process of the Members in Empathy Training Program (공감훈련프로그램 참여아동의 공감표현 변화과정 분석)

  • Kim, Mi-Young
    • The Korean Journal of Elementary Counseling
    • /
    • v.7 no.1
    • /
    • pp.205-226
    • /
    • 2008
  • The purpose of the study you have seen is to verify the effectiveness of existing quantitative research and to put the Empathy Training Program to practical use for participating children. From looking into this, the changes in empathic understanding that came to light in relationships between teacher and children and children and children are sure to have that effect. For this work, I established the following subject of inquiry: What kind of changing processes can be seen in the empathic understanding of participating children in the Empathy Training Program? To resolve the above line of inquiry, six female sixth grade elementary school students were chosen and they progressed through twelve sessions of the Empathy Training Program. The children were given a sentence completion exam, recognition work, neat writing exam and a school adaptation exam both before and after participation in the program, making data for analysis. To analyze, first, participants had one or two meetings of forty to fifty minutes each. Progress through the program's curriculum was recorded and through the repeating and copying method, to be sure participating children's empathic understanding was revealed, empathic language and behavior was routinely chosen. Next, according the above criteria I looked into visible changes of the participating children's empathic expressions, classifying and analyzing changes in empathic understanding and six instances of common changes in the emphatic understanding of the participants relationships were analyzed and put together. Next I will summarize the findings we have seen in this research: First, if we look into changes in common empathic understanding from the beginning, using the criteria of empathic language, each individual showed understanding at the beginning and passed and progressed through stages of care, insight and emotional expressions. Second, when we looked at the criteria of empathic behavior from the beginning to the end, one's line of vision and ability to concentrate one's attention was connected. Next, the act of nodding one's head looked like a brief nod at first but at the end, it was not just a simple nod but rather they could feel deep empathy. The condition and substance of the facial expression was seen to match and at the very end the child was expressive and stretched out arms to hold and pat the other person and the act of holding hands could also be seen. Among lots of empathic behavior the final stage was shown by half of the children. Third, from the first stage to the last stage there were many cases revealed. The more the children went the more complete their empathic language became. Their vocabulary increased and became more diverse with empathic actions. Also, when comparing actions and expressions from the beginning with the end, visible expressions became more natural and sincere at the end. The result of the research we have seen is that through receiving experience of empathic understanding, participating children showed a sense of self-confidence and they looked to make peaceful expressions while not being aggressive or defensive about problems. In addition, from understanding empathic expressions, participating children's relationships felt closer. This outcome within this group in this case will be applied and the formation of empathic understanding can be used by the children internally to solve their own problems, acquire close relationships with their teachers and others. It will also contribute to smooth classroom management.

  • PDF

Optimization of In Vivo Stickiness Evaluation for Cosmetic Creams Using Texture Analyzer (Texture Analyzer (TA)를 이용한 화장품 크림의 In Vivo 끈적임 평가법의 최적화)

  • Ryoo, Joo-Yeon;Bae, Jung-Eun;Kang, Nae-Gyu
    • Journal of the Society of Cosmetic Scientists of Korea
    • /
    • v.46 no.4
    • /
    • pp.371-382
    • /
    • 2020
  • There have been continuous attempts to quantify sensory attributes of cosmetic products by measuring relevant physical properties. The most representative method to evaluate stickiness is to measure axial force using texture analyzer. Stickiness is known to correlate with AUC which abbreviates area under curve in the obtained axial force curve as a function of time. Recently, Normandie University research group developed in vivo stickiness evaluation method considering the characteristics of skin along with established evaluation method[8]. Based on the study, we tried to optimize in vivo stickiness evaluation method especially for cosmetic creams. The experiment was carried out on 5 different facial creams products by changing the amount and the times of rolling of creams, and the shape and material of probes. Based on the results of the sensory evaluation, the most consistent conditions were established as the optimal evaluation method. As a result, applying 70 μL of cream and rubbing 10 times for 7 s inside the 3.4 cm circle were judged to be suitable. As for the probes, spherical metallic probe was more proper due to its reproducibility. We conducted the settled method on 10 subjects to check its validity. Although the absolute values of AUC differed depending on the individuals, the AUC values were all ranked the same. Finally, for the standardization of stickiness of AUC, polyvinylpyrrolidone (PVP) was set as a reference material and we measured AUC of its aqueous solution by changing concentration. Then, the degree of stickiness recognition for 5 different creams was surveyed to check the correlation between AUC and stickiness.

Multifaceted Evaluation Methodology for AI Interview Candidates - Integration of Facial Recognition, Voice Analysis, and Natural Language Processing (AI면접 대상자에 대한 다면적 평가방법론 -얼굴인식, 음성분석, 자연어처리 영역의 융합)

  • Hyunwook Ji;Sangjin Lee;Seongmin Mun;Jaeyeol Lee;Dongeun Lee;kyusang Lim
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2024.01a
    • /
    • pp.55-58
    • /
    • 2024
  • 최근 각 기업의 AI 면접시스템 도입이 증가하고 있으며, AI 면접에 대한 실효성 논란 또한 많은 상황이다. 본 논문에서는 AI 면접 과정에서 지원자를 평가하는 방식을 시각, 음성, 자연어처리 3영역에서 구현함으로써, 면접 지원자를 다방면으로 분석 방법론의 적절성에 대해 평가하고자 한다. 첫째, 시각적 측면에서, 면접 지원자의 감정을 인식하기 위해, 합성곱 신경망(CNN) 기법을 활용해, 지원자 얼굴에서 6가지 감정을 인식했으며, 지원자가 카메라를 응시하고 있는지를 시계열로 도출하였다. 이를 통해 지원자가 면접에 임하는 태도와 특히 얼굴에서 드러나는 감정을 분석하는 데 주력했다. 둘째, 시각적 효과만으로 면접자의 태도를 파악하는 데 한계가 있기 때문에, 지원자 음성을 주파수로 환산해 특성을 추출하고, Bidirectional LSTM을 활용해 훈련해 지원자 음성에 따른 6가지 감정을 추출했다. 셋째, 지원자의 발언 내용과 관련해 맥락적 의미를 파악해 지원자의 상태를 파악하기 위해, 음성을 STT(Speech-to-Text) 기법을 이용하여 텍스트로 변환하고, 사용 단어의 빈도를 분석하여 지원자의 언어 습관을 파악했다. 이와 함께, 지원자의 발언 내용에 대한 감정 분석을 위해 KoBERT 모델을 적용했으며, 지원자의 성격, 태도, 직무에 대한 이해도를 파악하기 위해 객관적인 평가지표를 제작하여 적용했다. 논문의 분석 결과 AI 면접의 다면적 평가시스템의 적절성과 관련해, 시각화 부분에서는 상당 부분 정확도가 객관적으로 입증되었다고 판단된다. 음성에서 감정분석 분야는 면접자가 제한된 시간에 모든 유형의 감정을 드러내지 않고, 또 유사한 톤의 말이 진행되다 보니 특정 감정을 나타내는 주파수가 다소 집중되는 현상이 나타났다. 마지막으로 자연어처리 영역은 면접자의 발언에서 나오는 말투, 특정 단어의 빈도수를 넘어, 전체적인 맥락과 느낌을 이해할 수 있는 자연어처리 분석모델의 필요성이 더욱 커졌음을 판단했다.

  • PDF

Analysis of Success Cases of InsurTech and Digital Insurance Platform Based on Artificial Intelligence Technologies: Focused on Ping An Insurance Group Ltd. in China (인공지능 기술 기반 인슈어테크와 디지털보험플랫폼 성공사례 분석: 중국 평안보험그룹을 중심으로)

  • Lee, JaeWon;Oh, SangJin
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.3
    • /
    • pp.71-90
    • /
    • 2020
  • Recently, the global insurance industry is rapidly developing digital transformation through the use of artificial intelligence technologies such as machine learning, natural language processing, and deep learning. As a result, more and more foreign insurers have achieved the success of artificial intelligence technology-based InsurTech and platform business, and Ping An Insurance Group Ltd., China's largest private company, is leading China's global fourth industrial revolution with remarkable achievements in InsurTech and Digital Platform as a result of its constant innovation, using 'finance and technology' and 'finance and ecosystem' as keywords for companies. In response, this study analyzed the InsurTech and platform business activities of Ping An Insurance Group Ltd. through the ser-M analysis model to provide strategic implications for revitalizing AI technology-based businesses of domestic insurers. The ser-M analysis model has been studied so that the vision and leadership of the CEO, the historical environment of the enterprise, the utilization of various resources, and the unique mechanism relationships can be interpreted in an integrated manner as a frame that can be interpreted in terms of the subject, environment, resource and mechanism. As a result of the case analysis, Ping An Insurance Group Ltd. has achieved cost reduction and customer service development by digitally innovating its entire business area such as sales, underwriting, claims, and loan service by utilizing core artificial intelligence technologies such as facial, voice, and facial expression recognition. In addition, "online data in China" and "the vast offline data and insights accumulated by the company" were combined with new technologies such as artificial intelligence and big data analysis to build a digital platform that integrates financial services and digital service businesses. Ping An Insurance Group Ltd. challenged constant innovation, and as of 2019, sales reached $155 billion, ranking seventh among all companies in the Global 2000 rankings selected by Forbes Magazine. Analyzing the background of the success of Ping An Insurance Group Ltd. from the perspective of ser-M, founder Mammingz quickly captured the development of digital technology, market competition and changes in population structure in the era of the fourth industrial revolution, and established a new vision and displayed an agile leadership of digital technology-focused. Based on the strong leadership led by the founder in response to environmental changes, the company has successfully led InsurTech and Platform Business through innovation of internal resources such as investment in artificial intelligence technology, securing excellent professionals, and strengthening big data capabilities, combining external absorption capabilities, and strategic alliances among various industries. Through this success story analysis of Ping An Insurance Group Ltd., the following implications can be given to domestic insurance companies that are preparing for digital transformation. First, CEOs of domestic companies also need to recognize the paradigm shift in industry due to the change in digital technology and quickly arm themselves with digital technology-oriented leadership to spearhead the digital transformation of enterprises. Second, the Korean government should urgently overhaul related laws and systems to further promote the use of data between different industries and provide drastic support such as deregulation, tax benefits and platform provision to help the domestic insurance industry secure global competitiveness. Third, Korean companies also need to make bolder investments in the development of artificial intelligence technology so that systematic securing of internal and external data, training of technical personnel, and patent applications can be expanded, and digital platforms should be quickly established so that diverse customer experiences can be integrated through learned artificial intelligence technology. Finally, since there may be limitations to generalization through a single case of an overseas insurance company, I hope that in the future, more extensive research will be conducted on various management strategies related to artificial intelligence technology by analyzing cases of multiple industries or multiple companies or conducting empirical research.

A Study on Correlations Among Cognitive Functions, Neurobehavioral Symptoms and Daily Living Functions in Patients with Non-Traumatic Subcortical Cerebrovascular Disease (비외상성 피질하 뇌혈관 질환 환자에서 인지기능, 정신행동 증상 및 일상 생활 기능간의 상관에 대한 연구)

  • Lee, Young-Ho;Park, Young-Soo;Choi, Hong;Choi, Young-Hee;Ko, Dae-Kwan;Chung, Young-Cho;Park, Byoung-Kwan;Kim, Soo-Ji;Chung, Suk-Hai;Ko, Byoung-Hee;Song, Il-Byoung;Park, Kun-Woo;Lee, Dae-Hee
    • Korean Journal of Psychosomatic Medicine
    • /
    • v.4 no.2
    • /
    • pp.170-181
    • /
    • 1996
  • Objective : This study was tried to investigate the specific relationships among cognitve function, neurbehavioral symptoms, and daily living functions, as well as provide the guidline of more proper clinical approches for patients with subcortical cerebrovascular disease. Objects and Methods Subjects were 85 patients whose diagnosis was confirmed by brain CT or MRI and controls were 195 normal persons matched by educational level with the subjects. The cognitive functions were evaluated by BNA(Benton neuropsychiatric assessment), subjective neurobehavioral symptoms by SCL-90-R(Sympton Check List-90-Revised), objective neurobehavioral symptoms by NRS(Neurobehavioral Rating Scale), and daily living function symptoms by NRS(Neurobehavioral Rating Scale), and daily living function by GERRI(Geriatric Evaluation by Relative's Rating Instrument) and IADL(Instrumental Activities of Daily Living Scale). Results: 1) Subjects showed significantly lower cognitive functions than controls in all tests of BNA except Lt-Rt Orientation Test(p=0.09) and facial Recognition Test(p=0.186). 2) In subjective neurobehavioral symptoms, subjects showed significantly lower scores in all symptoms except anxiety(p=0.059), hostility(p=0.159), and phobic anxiety(p=0.849). But in objects neurobehavioral symptoms, subjects showed significantly higher in scores in psychoticism (p=0.000) and neuroticism(p=0.025) of NRS. 3) The score of social functioning of GERRI(p=0.000) and that of IADL(p=0.000) were significantly higher in subjects than in controls. 4) for correlation between cognitive and daily living functions, there were significant correlations between the scores of all items on BNA and the score of cognitive or social function of GERRI and the socre of MDL in corntrols, whereas in subjects, there were significant correlations only between the scores of BNA and the score of IADL. 5) for correlation between neuroehavioral symptoms and daily living functions, there were significant correlatons between the socre of subjective neurobehavioral symptoms and the scores of all subscales of GERRI and the score of MDL in controls. On the contrary, in subjects, there were significant correlations between the score of social function of GERRI and the score of objective neurobehavioral symptoms such as psychoticism, agitiation-hostility, and decrease d motivation-emotional withdrawl. Conclusion : Above results suggest that disturbances in specific function of brain may play a role as a predictor of impairments with specific daily living functions and also suggest that specific correlations among various functions may be useful as clinical parameters for setting of the treatment goal and for assessing the ongoing process in the treatment and rehavilitation of the patients with subcortical cerebrovascular disease.

  • PDF