• Title/Summary/Keyword: visual-audio

Search Result 424, Processing Time 0.021 seconds

Metadata Design and Machine Learning-Based Automatic Indexing for Efficient Data Management of Image Archives of Local Governments in South Korea (국내 지자체 사진 기록물의 효율적 관리를 위한 메타데이터 설계 및 기계학습 기반 자동 인덱싱 방법 연구)

  • Kim, InA;Kang, Young-Sun;Lee, Kyu-Chul
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.20 no.2
    • /
    • pp.67-83
    • /
    • 2020
  • Many local governments in Korea provide online services for people to easily access the audio-visual archives of events occurring in the area. However, the current method of managing these archives of the local governments has several problems in terms of compatibility with other organizations and convenience for searching of the archives because of the lack of standard metadata and the low utilization of image information. To solve these problems, we propose the metadata design and machine learning-based automatic indexing technology for the efficient management of the image archives of local governments in Korea. Moreover, we design metadata items specialized for the image archives of local governments to improve the compatibility and include the elements that can represent the basic information and characteristics of images into the metadata items, enabling efficient management. In addition, the text and objects in images, which include pieces of information that reflect events and categories, are automatically indexed based on the machine learning technology, enhancing users' search convenience. Lastly, we developed the program that automatically extracts text and objects from image archives using the proposed method, and stores the extracted contents and basic information in the metadata items we designed.

A Diagnostic Study of safety education in elementary schools based on PRECEDE Model (PRECEDE 모형을 이용한 일부 초등학교 안전교육의 진단적 연구)

  • 백경원;이명선
    • Korean Journal of Health Education and Promotion
    • /
    • v.18 no.1
    • /
    • pp.35-47
    • /
    • 2001
  • As the complexity of the our environment is further complicated by advancements in industry and increase in vehicle traffic flow, the incidents of injury causing accidents are on the rise. Consequently, there is increasing emphasis on the importance of systematic and continual safety education for injury preventive behaviors. This study investigates safety related problems of elementary school students based on the PRECEDE model, proposed by Green et al.(1980 Green), to comprehensively identify the requirements of school safety education. The identified requirements were used to diagnose the current state of elementary school safety education through the analysis of multidimensional factors. A questionnaire survey was conducted on 594 sixth grade students from randomly selected 4 schools in Seoul to examine their injury preventive behaviors and to determine the educational diagnosis variables that affect it. The duration of the survey was 3 weeks starting from April 12, 1999 to May 8, 1999. A summary of the survey results are presented below; 1. Situations in which accidents have occurred were, in their order of frequency, ‘during play or sports activities within the school grounds’ was most frequent at 59.6%, ‘during play on local streets’ at 49.5%, and ‘traffic accidents’ at 41.6%. 2. Categorization of the injury preventive behavior showed that ‘not playing at high traffic flow locations such as streets and construction sites’ had the higher level of observance, while ‘wearing of helmets and joint protection devices during playing’ was least observed. 3. Considering injury preventive behaviors in relation to educational diagnosis variables indicated, for predisposing factors, lower ‘perception to injury accidents’ (p〈0.001) combined with higher ‘concerns for injury accidents’(p〈0.001), ‘practice of preventive behavior’(p〈0.001), and ‘the level of safety knowledge’(p〈0.001) resulted in significantly higher observance of injury preventive behaviors. For enabling factors, higher ‘perceived level of the school safety education’ (p〈0.001) and ‘availability of safety education resources’(p〈0.01) indicated significantly higher observance of injury preventive behaviors. For the reinforcing factor, frequent exposure to ‘safety education brochure’ (p〈0.01) and ‘audio-visual material for safety education’(p〈0.01) combined with more ‘regional safety education’ (p〈0.01), ‘home safety education’ (p〈0.01), ‘school safety education’(p〈0.001), and, ‘parents’ observance of preventive behaviors' (p〈0.001) showed significantly higher observance of injury preventive behaviors. 4. An analysis of the factors that affect injury preventive behaviors showed that the enabling factor ‘awareness of school safety education’ had the highest correlation with injury preventive behaviors followed by factors, in their order of significance, ‘practice of preventive behavior’, ‘perception to injury accidents’, ‘level of safety knowledge’, ‘parents’ observances of preventive behaviors', and ‘concerns for injury accidents.’

  • PDF

Manipulation of the Compressed Video for Multimedia Networking : A Bit rate Shaping of the Compressed Video (멀티미디어 네트워킹을 위한 압축 신호상에서 동영상 처리 : 압축 동영상 비트율 변환)

  • 황대환;조규섭;황수용
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.26 no.11A
    • /
    • pp.1908-1924
    • /
    • 2001
  • Interoperability and inter-working in the various network and media environment with different technology background is very important to enlarge the opportunity of service access and to increase the competitive power of service. The ITU-T and advanced counties are planning ahead for provision of GII enabling user to access advanced global communication services supporting multimedia communication applications, embracing all modes of information. In this paper, we especially forced the heterogeneity of end user applications for multimedia networking. The heterogeneity has several technical aspects, like different medium access methods, heterogeneous coding algorithms for audio-visual data and so on. Among these elements, we have been itemized bit rate shaping algorithm on the compressed moving video. Previous manipulations of video has been done on the uncompressed signal domain. That is, compressed video should be converted to linear PCM signal. To do such a procedures, we should decode, manipulate and then encode the video to compressed signal once again. The traditional approach for processing the video signa1 has several critical weak points, requiring complexity to implement, degradation of image quality and large processing delay. The bit rate shaping algorithm proposed in this paper process the manipulation of moving video on the completely compressed domain to cope with above deficit. With this algorithms. we could realized efficient video bit rate shaping and the result of software simulation shows that this method has significant advantage than that of pixel oriented algorithms.

  • PDF

A Design of Mobile e-Book Viewer interface for the Reading Disabled People (독서장애인용 모바일 전자책뷰어 인터페이스 설계)

  • Lee, KyungHee;Kim, TaeEun;Lee, Jongwoo;Lim, Soon-Bum
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.1
    • /
    • pp.100-107
    • /
    • 2013
  • As the eBook market grows fast recently, various eBook viewer solutions such as hardware viewers and software readers came out to the market. We can, however, hardly find mobile eBook interfaces for the reading disabled people who have difficulties in reading for their visual impairment or learning disabilities, or dyslexia. An eBook viewer interfaces for the reading disabled people should be carefully and distinctively designed because the reading disabled people cannot use normal versions of eBook viewer. In this paper, we suggest a eBook viewer interface model to make the reading disabled people read eBooks easily. Depending on the type of the reading disabled people: the full blind, the almost blind, the just learning disabled, our model provides an adaptive interface to make them read eBooks effectively. In addition, unlike the existing simple audio books, we also support annotation systems to make the reading disabled people interact with eBook viewer. To show the effectiveness of our model, we implemented an eBook viewer prototype on an android-based mobile device. We are sure that our model and implementation can make the reading disabled people, who is 10% of all the domestic people, read eBooks effectively.

Dual CNN Structured Sound Event Detection Algorithm Based on Real Life Acoustic Dataset (실생활 음향 데이터 기반 이중 CNN 구조를 특징으로 하는 음향 이벤트 인식 알고리즘)

  • Suh, Sangwon;Lim, Wootaek;Jeong, Youngho;Lee, Taejin;Kim, Hui Yong
    • Journal of Broadcast Engineering
    • /
    • v.23 no.6
    • /
    • pp.855-865
    • /
    • 2018
  • Sound event detection is one of the research areas to model human auditory cognitive characteristics by recognizing events in an environment with multiple acoustic events and determining the onset and offset time for each event. DCASE, a research group on acoustic scene classification and sound event detection, is proceeding challenges to encourage participation of researchers and to activate sound event detection research. However, the size of the dataset provided by the DCASE Challenge is relatively small compared to ImageNet, which is a representative dataset for visual object recognition, and there are not many open sources for the acoustic dataset. In this study, the sound events that can occur in indoor and outdoor are collected on a larger scale and annotated for dataset construction. Furthermore, to improve the performance of the sound event detection task, we developed a dual CNN structured sound event detection system by adding a supplementary neural network to a convolutional neural network to determine the presence of sound events. Finally, we conducted a comparative experiment with both baseline systems of the DCASE 2016 and 2017.

A Study on Description about Archival Materials in Film Archives (영화 기록의 기술에 관한 연구)

  • Kim, Jin Sung
    • The Korean Journal of Archival Studies
    • /
    • no.30
    • /
    • pp.89-123
    • /
    • 2011
  • Archival materials in film archives is a memories and archival documents of human which is generated from cultural activities of human being, and provided long-term relevant information. However, it is different general public audio-visual records because main purpose of representing culture to create the contents of private sector rather than evidence of the factual information of public service activities. Therefore, should determine the description principle and rule in order to reflect specific physical, intellectual characteristics. So as to control the description, that is need in the textual standards to base the specific purposes and rules thus analyzed the international description standards as Dublin Core, ISAD(G), FIAF Cataloguing Rules For Film Archives. As a result, more effectively to describe archival materials in film archives required significant modifications in the organizations of the areas and the elements. This study argues that first, to divide existence the concept and the reality (work/item) of archival materials in film archives. Second, to need understanding and indicating their content, context, structure. Third, to establish of the areas and the elements including a characteristic of it. The final suggestion organizes separately to 6th and 8th areas, 22th and 25th elements in two parts. This conclusion does not prepare to refer the status and/or policy of a particular film arhicve, can be set accordingly to a specific elements or sub-elements by the film archives.

Effect of Multimodal cues on Tactile Mental Imagery and Attitude-Purchase Intention Towards the Product (다중 감각 단서가 촉각적 심상과 제품에 대한 태도-구매 의사에 미치는 영향)

  • Lee, Yea Jin;Han, Kwanghee
    • Science of Emotion and Sensibility
    • /
    • v.24 no.3
    • /
    • pp.41-60
    • /
    • 2021
  • The purpose of this research was to determine whether multimodal cues in an online shopping environment could enhance tactile consumer mental imagery, purchase intentions, and attitudes towards an apparel product. One limitation of online retail is that consumers are unable to physically touch the items. However, as tactile information plays an important role in consumer decisions especially for apparel products, this study investigated the effects of multimodal cues on overcoming the lack of tactile stimuli. In experiment 1, to explore the product, the participants were randomly assigned to four conditions; picture only, video without sound, video with corresponding sound, and video with discordant sound; after which tactile mental imagery vividness, ease of imagination, attitude, and purchase intentions were measured. It was found that the video with discordant sound had the lowest average scores of all dependent variables. A within-participants design was used in experiment 2, in which all participants explored the same product in the four conditions in a random order. They were told that they were visiting four different brands on a price comparison web site. After the same variables as in experiment 1, including the need for touch, were measured, the repeated measures ANCOVA results revealed that compared to the other conditions, the video with the corresponding sound significantly enhanced tactile mental imagery vividness, attitude, and purchase intentions. However, the discordant condition had significantly lower attitudes and purchase intentions. The dual mediation analysis also revealed that the multimodal cue conditions significantly predicted attitudes and purchase intentions by sequentially mediating the imagery vividness and ease of imagination. In sum, vivid tactile mental imagery triggered using audio-visual stimuli could have a positive effect on consumer decision making by making it easier to imagine a situation where consumers could touch and use the product.

Understanding Purposes and Functions of Students' Drawing while on Geological Field Trips and during Modeling-Based Learning Cycle (야외지질답사 및 모델링 기반 순환 학습에서 학생들이 그린 그림의 목적과 기능에 대한 이해)

  • Choi, Yoon-Sung
    • Journal of the Korean earth science society
    • /
    • v.42 no.1
    • /
    • pp.88-101
    • /
    • 2021
  • The purpose of this study was to qualitatively examine the meaning of students' drawings in outdoor classes and modeling-based learning cycles. Ten students were observed in a gifted education center in Seoul. Under the theme of the Hantan River, three outdoor classes and three modeling activities were conducted. Data were collected to document all student activities during field trips and classroom modeling activities using simultaneous video and audio recording and observation notes made by the researcher and students. Please note it is unclear what this citation refers to. If it is the previous sentence it should be placed within that sentence's punctuation. Hatisaru (2020) Ddrawing typess were classified by modifying the representations in a learning context in geological field trips. We used deductive content analysis to describe the drawing characteristics, including students writing. The results suggest that students have symbolic images that consist of geologic concepts, visual images that describe topographical features, and affective images that express students' emotion domains. The characteristics were classified into explanation, generality, elaboration, evidence, coherence, and state-of-mind. The characteristics and drawing types are consecutive in the modeling-based learning cycle and reflect the students' positive attitude and cognitive scientific domain. Drawing is a useful tool for reflecting students' thoughts and opinions in both outdoor class and classroom modeling activities. This study provides implications for emphasizing the importance of drawing activities.

A Study on the Characteristics of Non-Fungible Token(NFT) and Application Plans from the Digital Records Perspective : Focused on Transferable Records (전자기록 관점에서 본 대체 불가능한 토큰(NFT) 특성 및 활용 방안 이전 및 거래 가능한 기록을 중심으로)

  • Won, Joo-hye;So, Hyeon-Gi;Oh, Hyo-Jung
    • The Korean Journal of Archival Studies
    • /
    • no.73
    • /
    • pp.47-79
    • /
    • 2022
  • NFT is literally a 'non-fungible token', a digital file that records specific virtual assets on a blockchain. Events such as ownership of the asset and transaction history are recorded on the blockchain through the token transaction, so counterfeiting and falsification are impossible. Therefore, NFT is used as a tool that can uniquely represent a specific virtual asset. The main purpose of this paper is to examine the characteristics of NFT from a records management point of view and to find ways to use them, and focuses on digital records that have the characteristics of assets as digital works. For this purpose, we first examine the basic concept of NFT and the principle of ownership and proof of value as an asset for digital works. In addition, it was confirmed how the advantages of NFT were applied through NFT use cases in various fields, and in particular, areas related to audio-visual records such as art, music, sports, and fashion were focused on. Furthermore, by comparing the characteristics of digital records with those of NFT, factors applicable to electronic records were identified. Finally, the types of digital records that are expected to be effective in the application of NFT were identified, and the possibility of their use and discussion points for introduction in records management are presented.

Development and validation of a Korean Affective Voice Database (한국형 감정 음성 데이터베이스 구축을 위한 타당도 연구)

  • Kim, Yeji;Song, Hyesun;Jeon, Yesol;Oh, Yoorim;Lee, Youngmee
    • Phonetics and Speech Sciences
    • /
    • v.14 no.3
    • /
    • pp.77-86
    • /
    • 2022
  • In this study, we reported the validation results of the Korean Affective Voice Database (KAV DB), an affective voice database available for scientific and clinical use, comprising a total of 113 validated affective voice stimuli. The KAV DB includes audio-recordings of two actors (one male and one female), each uttering 10 semantically neutral sentences with the intention to convey six different affective states (happiness, anger, fear, sadness, surprise, and neutral). The database was organized into three separate voice stimulus sets in order to validate the KAV DB. Participants rated the stimuli on six rating scales corresponding to the six targeted affective states by using a 100 horizontal visual analog scale. The KAV DB showed high internal consistency for voice stimuli (Cronbach's α=.847). The database had high sensitivity (mean=82.8%) and specificity (mean=83.8%). The KAV DB is expected to be useful for both academic research and clinical purposes in the field of communication disorders. The KAV DB is available for download at https://kav-db.notion.site/KAV-DB-75 39a36abe2e414ebf4a50d80436b41a.