• Title/Summary/Keyword: Audio Data

Search Result 883, Processing Time 0.031 seconds

Analysis of learning flow and learning satisfaction according to the non-face-to-face class operation method

  • You-Jung, Kim;Su-Jin, Won;Eun-Young, Choi
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.15 no.1
    • /
    • pp.195-202
    • /
    • 2023
  • This study is a comparative survey study conducted to explore the differences in learners' learning flow and learning satisfaction according to the non-face-to-face class operation methods implemented at universities. After implementing different class management methods for the same subject taught by the same instructor non-face-to-face for 15 weeks, each learning flow and learning satisfaction were compared and analyzed, and the collected data were analyzed with IBM SPSS 21.0. As a result of the study, learning flow was high in the order of lectures using real-time ZOOM and recorded lectures using self-studio(3.41±0.91, 3.28±1.01), and learning satisfaction was high in the order of lectures using real-time ZOOM and lectures using the automatic recording system of classes(3.40±0.80, 3.30±0.74). The item with the lowest score was the PPT audio recording lecture in both areas of learning flow and learning satisfaction(2.72±1.04, 1.73±1.04). Considering that system errors such as sound in the smart lecture environment operated for the first time in this study affected the research results, it is suggested that future research should be conducted by supplementing the corresponding part.

Diagnosing a Child with Autism using Artificial Intelligence

  • Alharbi, Abdulrahman;Alyami, Hadi;Alenzi, Saleh;Alharbi, Saud;bassfar, Zaid
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.6
    • /
    • pp.145-156
    • /
    • 2022
  • Children are the foundation and future of this society and understanding their impressions and behaviors is very important and the child's behavioral problems are a burden on the family and society as well as have a bad impact on the development of the child, and the early diagnosis of these problems helps to solve or mitigate them, and in this research project we aim to understand and know the behaviors of children, through artificial intelligence algorithms that helped solve many complex problems in an automated system, By using this technique to read and analyze the behaviors and feelings of the child by reading the features of the child's face, the movement of the child's body, the method of the child's session and nervous emotions, and by analyzing these factors we can predict the feelings and behaviors of children from grief, tension, happiness and anger as well as determine whether this child has the autism spectrum or not. The scarcity of studies and the privacy of data and its scarcity on these behaviors and feelings limited researchers in the process of analysis and training to the model presented in a set of images, videos and audio recordings that can be connected, this model results in understanding the feelings of children and their behaviors and helps doctors and specialists to understand and know these behaviors and feelings.

Development and Distribution of Deep Fake e-Learning Contents Videos Using Open-Source Tools

  • HO, Won;WOO, Ho-Sung;LEE, Dae-Hyun;KIM, Yong
    • Journal of Distribution Science
    • /
    • v.20 no.11
    • /
    • pp.121-129
    • /
    • 2022
  • Purpose: Artificial intelligence is widely used, particularly in the popular neural network theory called Deep learning. The improvement of computing speed and capability expedited the progress of Deep learning applications. The application of Deep learning in education has various effects and possibilities in creating and managing educational content and services that can replace human cognitive activity. Among Deep learning, Deep fake technology is used to combine and synchronize human faces with voices. This paper will show how to develop e-Learning content videos using those technologies and open-source tools. Research design, data, and methodology: This paper proposes 4 step development process, which is presented step by step on the Google Collab environment with source codes. This technology can produce various video styles. The advantage of this technology is that the characters of the video can be extended to any historical figures, celebrities, or even movie heroes producing immersive videos. Results: Prototypes for each case are also designed, developed, presented, and shared on YouTube for each specific case development. Conclusions: The method and process of creating e-learning video contents from the image, video, and audio files using Deep fake open-source technology was successfully implemented.

An Analysis on the Mathematical Creativity and Computational Thinking of Elementary School Mathematical Gifted Students in the Convergence Class Programs (융합 수업 프로그램에서 나타나는 초등 수학 영재들의 수학적 창의성과 컴퓨팅 사고 분석)

  • Kang, Joo Young;Kim, Dong Hwa;Seo, Hae Ae
    • East Asian mathematical journal
    • /
    • v.38 no.4
    • /
    • pp.463-496
    • /
    • 2022
  • The purpose of this study is to analyze the mathematical creativity and computational thinking of mathematically gifted elementary students through a convergence class using programming and to identify what it means to provide the convergence class using Python for the mathematical creativity and computational thinking of mathematically gifted elementary students. To this end, the content of the nine sessions of the Python-applied convergence programs were developed, exploratory and heuristic case study was conducted to observe and analyze the mathematical creativity and computational thinking of mathematically gifted elementary students. The subject of this study was a single group of sixteen students from the mathematics and science gifted class, and the content of the nine sessions of the Python convergence class was recorded on their tablets. Additional data was collected through audio recording, observation. In fact, in order to solve a given problem creatively, students not only naturally organized and formalized existing mathematical concepts, mathematical symbols, and programming instructions, but also showed divergent thinking to solve problems flexibly from various perspectives. In addition, students experienced abstraction, iterative thinking, and critical thinking through activities to remove unnecessary elements, extract key elements, analyze mathematical concepts, and decompose problems into small components, and math gifted students showed a sense of achievement and challenge.

Implementation and Evaluation of Harmful-Media Filtering Techniques using Multimodal-Information Extraction

  • Yeon-Ji, Lee;Ye-Sol, Oh;Na-Eun, Park;Il-Gu, Lee
    • Journal of information and communication convergence engineering
    • /
    • v.21 no.1
    • /
    • pp.75-81
    • /
    • 2023
  • Video platforms, including YouTube, have a structure in which the number of video views is directly related to the publisher's profits. Therefore, video publishers induce viewers by using provocative titles and thumbnails to garner more views. The conventional technique used to limit such harmful videos has low detection accuracy and relies on follow-up measures based on user reports. To address these problems, this study proposes a technique to improve the accuracy of filtering harmful media using thumbnails, titles, and audio data from videos. This study analyzed these three pieces of multimodal information; if the number of harmful determinations was greater than the set threshold, the video was deemed to be harmful, and its upload was restricted. The experimental results showed that the proposed multimodal information extraction technique used for harmfulvideo filtering achieved a 9% better performance than YouTube's Restricted Mode with regard to detection accuracy and a 41% better performance than the YouTube automation system.

Aural-visual two-stream based infant cry recognition (Aural-visual two-stream 기반의 아기 울음소리 식별)

  • Bo, Zhao;Lee, Jonguk;Atif, Othmane;Park, Daihee;Chung, Yongwha
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.05a
    • /
    • pp.354-357
    • /
    • 2021
  • Infants communicate their feelings and needs to the outside world through non-verbal methods such as crying and displaying diverse facial expressions. However, inexperienced parents tend to decode these non-verbal messages incorrectly and take inappropriate actions, which might affect the bonding they build with their babies and the cognitive development of the newborns. In this paper, we propose an aural-visual two-stream based infant cry recognition system to help parents comprehend the feelings and needs of crying babies. The proposed system first extracts the features from the pre-processed audio and video data by using the VGGish model and 3D-CNN model respectively, fuses the extracted features using a fully connected layer, and finally applies a SoftMax function to classify the fused features and recognize the corresponding type of cry. The experimental results show that the proposed system classification exceeds 0.92 in F1-score, which is 0.08 and 0.10 higher than the single-stream aural model and single-stream visual model.

Simple Image Stenography Technology for Large Scale Text (대용량 텍스트를 위한 손실 없는 영상 은닉기술)

  • Rhee, Keun-Moo
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2008.05a
    • /
    • pp.1104-1107
    • /
    • 2008
  • These people where generally the image or the document nik technique silver document image, against the digital data of audio back all type the research is advanced being used with objective and the use which are various, is a d. Needs a low-end leveling instrument security text from the research which it sees and with substitution quantity the silver nik being simple it will be able to deliver the technique which is simple it embodied. It combined the text image first and the nose which is in the collar image of 24 bit depth which will reach ting it did and it rehabilitatedded and a higher officer technique and the result it used that the loss ratio of the text image to analyze is slight it was ascertained.

Understanding the Perspectives of Paediatric Physicians on Physiotherapy in Paediatric Rehabilitation in Chennai, India: A Qualitative Approach

  • Vadivelan Kanniappan;Abishek Jayapal Rajeswari;Pearlyn Esther Padma Lawrence;Subash Sundar
    • Journal of Preventive Medicine and Public Health
    • /
    • v.57 no.2
    • /
    • pp.157-166
    • /
    • 2024
  • Objectives: Children with disabilities may exhibit a multitude of symptoms, and treatment requires a multidisciplinary approach for a satisfactory outcome. Lack of awareness among physicians, lack of referral, and lack of inter-sectoral coordination have hindered paediatric practice in Tamil Nadu, a state in India with a striking childhood disability rate that warrants a timely interdisciplinary approach. However, the perspectives of paediatricians on paediatric physiotherapy are unknown. The aim of the study was to investigate the perspectives of practicing paediatric physicians in Chennai on the role of physiotherapy in paediatrics. Methods: For an in-depth exploration, qualitative semi-structured interviews were conducted in person with 10 paediatricians. Audio from the sessions was recorded and transcribed, and data saturation was achieved through iterative analysis. Results: A grounded theory analysis of the results yielded 5 domains under which the perspectives and expectations of the physicians were described, along with the barriers experienced by patients' parents as explained by their paediatrician. The responses highlighted deficits in awareness, structural support, accessibility and direct communication between physicians and physiotherapists. Conclusions: Paediatric physicians have different opinions, and some ignorance persists concerning paediatric physiotherapy. This study warrants a proper structure of the paediatric rehabilitation unit and regular interdisciplinary meetings and focus group discussions to increase access for parents and improve patient outcomes.

Music Transcription Using Non-Negative Matrix Factorization (비음수 행렬 분해 (NMF)를 이용한 악보 전사)

  • Park, Sang-Ha;Lee, Seok-Jin;Sung, Koeng-Mo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.2
    • /
    • pp.102-110
    • /
    • 2010
  • Music transcription is extracting pitch (the height of a musical note) and rhythm (the length of a musical note) information from audio file and making a music score. In this paper, we decomposed a waveform into frequency and rhythm components using Non-Negative Matrix Factorization (NMF) and Non-Negative Sparse coding (NNSC) which are often used for source separation and data clustering. And using the subharmonic summation method, fundamental frequency is calculated from the decomposed frequency components. Therefore, the accurate pitch of each score can be estimated. The proposed method successfully performed music transcription with its results superior to those of the conventional methods which used either NMF or NNSC.

Spoken-to-written text conversion for enhancement of Korean-English readability and machine translation

  • HyunJung Choi;Muyeol Choi;Seonhui Kim;Yohan Lim;Minkyu Lee;Seung Yun;Donghyun Kim;Sang Hun Kim
    • ETRI Journal
    • /
    • v.46 no.1
    • /
    • pp.127-136
    • /
    • 2024
  • The Korean language has written (formal) and spoken (phonetic) forms that differ in their application, which can lead to confusion, especially when dealing with numbers and embedded Western words and phrases. This fact makes it difficult to automate Korean speech recognition models due to the need for a complete transcription training dataset. Because such datasets are frequently constructed using broadcast audio and their accompanying transcriptions, they do not follow a discrete rule-based matching pattern. Furthermore, these mismatches are exacerbated over time due to changing tacit policies. To mitigate this problem, we introduce a data-driven Korean spoken-to-written transcription conversion technique that enhances the automatic conversion of numbers and Western phrases to improve automatic translation model performance.