• Title/Summary/Keyword: 음향결합

Search Result 371, Processing Time 0.024 seconds

A Study on Gameplay Narrative of Contemporary Animated Films (당대 애니메이션 영화의 게임플레이성 서사에 관한 연구)

  • Qin, Jianbo
    • Journal of Korea Entertainment Industry Association
    • /
    • v.13 no.4
    • /
    • pp.83-91
    • /
    • 2019
  • Gameplay exists not only in game activities, but also in other art forms. An animated film with strong gameplay whose gameplay narrative follows the presupposition principle, achieved through unimaginable sound effects, exaggerated visual styling, hypothetical time and space concepts, fictional storylines and character performances. The strength or the presence or absence of gameplay in animated films is reflected in the grasp and use of these gameplay elements in the process of animated film narratives. The gameplay narrative of animated films fully utilizes the characteristics of gameplay thinking. In the process of scriptwriting and narrative, it combines entertainment, action, adventure, competition and other elements to fully reflect the characteristics of "gameplay". Gameplay narratives often use the strategies of reverse setting, repetition and interaction, and rules and challenges setting, which not only enhances the fun of animated film narratives, but also attracts the attention of the audience, which then can enhance the audience's viewing experience.

Multi-Emotion Regression Model for Recognizing Inherent Emotions in Speech Data (음성 데이터의 내재된 감정인식을 위한 다중 감정 회귀 모델)

  • Moung Ho Yi;Myung Jin Lim;Ju Hyun Shin
    • Smart Media Journal
    • /
    • v.12 no.9
    • /
    • pp.81-88
    • /
    • 2023
  • Recently, communication through online is increasing due to the spread of non-face-to-face services due to COVID-19. In non-face-to-face situations, the other person's opinions and emotions are recognized through modalities such as text, speech, and images. Currently, research on multimodal emotion recognition that combines various modalities is actively underway. Among them, emotion recognition using speech data is attracting attention as a means of understanding emotions through sound and language information, but most of the time, emotions are recognized using a single speech feature value. However, because a variety of emotions exist in a complex manner in a conversation, a method for recognizing multiple emotions is needed. Therefore, in this paper, we propose a multi-emotion regression model that extracts feature vectors after preprocessing speech data to recognize complex, inherent emotions and takes into account the passage of time.

Speech Recognition Using Linear Discriminant Analysis and Common Vector Extraction (선형 판별분석과 공통벡터 추출방법을 이용한 음성인식)

  • 남명우;노승용
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.4
    • /
    • pp.35-41
    • /
    • 2001
  • This paper describes Linear Discriminant Analysis and common vector extraction for speech recognition. Voice signal contains psychological and physiological properties of the speaker as well as dialect differences, acoustical environment effects, and phase differences. For these reasons, the same word spelled out by different speakers can be very different heard. This property of speech signal make it very difficult to extract common properties in the same speech class (word or phoneme). Linear algebra method like BT (Karhunen-Loeve Transformation) is generally used for common properties extraction In the speech signals, but common vector extraction which is suggested by M. Bilginer et at. is used in this paper. The method of M. Bilginer et al. extracts the optimized common vector from the speech signals used for training. And it has 100% recognition accuracy in the trained data which is used for common vector extraction. In spite of these characteristics, the method has some drawback-we cannot use numbers of speech signal for training and the discriminant information among common vectors is not defined. This paper suggests advanced method which can reduce error rate by maximizing the discriminant information among common vectors. And novel method to normalize the size of common vector also added. The result shows improved performance of algorithm and better recognition accuracy of 2% than conventional method.

  • PDF

Implementation of Integrated Player System based on Free-Viewpoint Video Service according to User Selection (사용자 선택에 따른 자유 시점 비디오 서비스 기반의 통합 플레이어 시스템 구현)

  • Yang, Ji-hee;Song, Min-ki;Park, Gooman
    • Journal of Broadcast Engineering
    • /
    • v.25 no.2
    • /
    • pp.265-274
    • /
    • 2020
  • Free-viewpoint video service is a technology that allows users to watch at any angle, location and distance through interaction. In this paper, the free-viewpoint video services are defined in four viewing modes: Inward view, outward view, 3D object view and first person view. And we developed and implemented a new integrated program that plays all the suggested views. In the contents of girl band performances and basketball games, multi-view cameras suitable for each viewing mode are installed to acquire media, and data stored on the server is streamed over the network, making it available for viewing. Users can freely choose four viewing modes, space location, angle and so on, and the media data such as images and sounds are provided to them by rendering appropriately for the selected the viewpoint. Our system is expected to be a scalable free-viewpoint video service player as well as provide users with immersion and presence by combining various viewing modes.

Earthquake Wave Propagation Using Staggered-grid Finite-difference Method in the Model of the Antarctic Region (엇격자 유한차분법을 이용한 극지해역 지진파 모델링)

  • Oh, Ju-Won;Min, Dong-Joo;Lee, Ho-Yong;Park, Min-Kyu
    • Journal of the Korean earth science society
    • /
    • v.32 no.6
    • /
    • pp.640-653
    • /
    • 2011
  • We simulate the propagation of earthquake waves in the continental margin of Antarctica using the elastic wave modeling algorithm, which is modified to be suitable for acoustic-elastic coupled media and earthquake source. To simulate the various types of earthquake source, the staggered-grid finite-difference method, which is composed of velocity-stress formulae, can be more appropriate to use than the conventional, displacement-based, finite-difference method. We simulate the elastic wave propagation generated by earthquakes combining 3D staggered-grid finite-difference algorithm composed of displacement-velocity-stress formulae with double couple mechanisms for earthquake source. Through numerical tests for left-lateral strike-slip fault, normal fault and reverse fault, we could confirm that the first arrival of P waves at the surface is in a good agreement with the theoretically-predicted results based on the focal mechanism of an earthquake. Numerical results for a model made after the subduction zone in the continental margin of Antarctica showed that earthquake waves, generated by the reverse fault and propagating through the continental crust, the oceanic crust and the ocean, are accurately described.

MPEG-H 3D Audio Decoder Structure and Complexity Analysis (MPEG-H 3D 오디오 표준 복호화기 구조 및 연산량 분석)

  • Moon, Hyeongi;Park, Young-cheol;Lee, Yong Ju;Whang, Young-soo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.42 no.2
    • /
    • pp.432-443
    • /
    • 2017
  • The primary goal of the MPEG-H 3D Audio standard is to provide immersive audio environments for high-resolution broadcasting services such as UHDTV. This standard incorporates a wide range of technologies such as encoding/decoding technology for multi-channel/object/scene-based signal, rendering technology for providing 3D audio in various playback environments, and post-processing technology. The reference software decoder of this standard is a structure combining several modules and can operate in various modes. Each module is composed of independent executable files and executed sequentially, real time decoding is impossible. In this paper, we make DLL library of the core decoder, format converter, object renderer, and binaural renderer of the standard and integrate them to enable frame-based decoding. In addition, by measuring the computation complexity of each mode of the MPEG-H 3D-Audio decoder, this paper also provides a reference for selecting the appropriate decoding mode for various hardware platforms. As a result of the computational complexity measurement, the low complexity profiles included in Korean broadcasting standard has a computation complexity of 2.8 times to 12.4 times that of the QMF synthesis operation in case of rendering as a channel signals, and it has a computation complexity of 4.1 times to 15.3 times of the QMF synthesis operation in case of rendering as a binaural signals.

Feedback Phenomenon in Technology Art (예술 공학의 피드백)

  • Kim Hyung-Gi
    • Science of Emotion and Sensibility
    • /
    • v.8 no.4
    • /
    • pp.423-433
    • /
    • 2005
  • The computer hardware development has provided many chances of emergence between art and technology. In many cases today's interactive artworks cannot be completed without audience's participation. The interactive production process with technical supplementation can be celled feedback. Mr. Nam Jun Paik showed 'Participant TV' that interacts with audience's response in real time. It means artwork changes with the constantly changing value from the data set from human visual perception. Dan Graham showed another feedback related work, which delays 5second playback in mirror that implies consequence of time. Today's media art has to sublimate coincidence, time ant audience into philosophical artwork through consonance that comes with video and sound as we can see from Bill Viola. Stelarc produced artworks. That use input data that is weak signals from brain, muscles. Through a terminal display with player, body expanded meaning of media. Jeffrey Shaw's 'Legible City' provided a fabrication of the reality with the interaction of bicycle's pedal speed and steering direction that is controlled by 4river. RE:MARK used microphone as input device as Edmond Couchot's 'Je same a la vent' and Nam Jun Paik's 'Participant TV' did. There is no communication without feedback between human being. The reality makes audience involved into artworks. That is the reason why feedback has to be natural. Through the feedback process, the originality of the idea is altered by audience. The feedback is not just part of flesh of artwork rather skeleton of it. Technological showoff cannot be art itself The perfection of technological application plan helps feedback that interacts with audience naturally in order that audience hoes not feel the feedback as artificial plan. Interactive media art has to be evolved into new media form with new integration feedback technology.

  • PDF

A comparison of Korean vowel formants in conditions of chanting and reading utterances (챈트 및 읽기 발화조건에 따른 한국어 모음 포먼트 비교)

  • Park, Jihye;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.12 no.3
    • /
    • pp.85-94
    • /
    • 2020
  • Vowel articulation in subjects related to speech disorders seems to be difficult. A chant method that properly reflects the characteristics of language could be used as an effective way of addressing the difficulties. The purpose of this study was to find out whether the chant method is effective as a means of enhancing vowel articulation. The subjects of this study were 60 normal adults (30 males and 30 females) in their 20s and 30s whose native language is Korean. Eight utterance conditions including chanting and reading conditions were recorded and their acoustic data were analyzed. The results of the analysis of the acoustic variables related to the formant confirmed that the F1 and F2 values of the vowel formants are increased and the direction of movement of the center of gravity of the vowel triangle is statistically significantly forwarded and lowered in the chant method in both the word and the phrase context. The results also proved that accent is the most influential musical factor in chant. There was no significant difference between four repeated tokens, which increased the reliability of the results. In other words, chanting is an effective way to shift the center of gravity of the vowel triangle, which suggests that it can help to improve speech intelligibility by forming a desirable place for articulation.

Design of a multimedia interface for the description of human error caused nuclear power plant trips (인적오류로 인한 원자력발전소 고장정지 사건묘사를 위한 멀티미디어 인터페이스 설계)

  • 박근옥
    • Proceedings of the ESK Conference
    • /
    • 1993.10a
    • /
    • pp.65-75
    • /
    • 1993
  • 원자력발전소에서 발생하는 고장정지 사건에 내포된 인적오류의 발생내용을 흥미있고 현장감 있게 묘사시킴과 동시에 종사원들의 인적오류를 저감시키기 위한 새로운 교육훈련방 법으로써 멀티미디어 기술의 사용을 고려하였다. 즉, 컴퓨터 환경하에서 숫자와 텍스트, 음 향 및 음성, 그래픽, 애니메이션, 화상 등의 미디어를 사용하여 인적오류로 인한 고장정지 사건발생의 내용을 전달하는 새로운 교육훈련 방법을 사용하는 것이다. 이를 위하여 고장정 지 사건과 관련한 원자력발전소 구성정보와 종사원의 활동정보를 입출력하기 위한 멀티미디어 인터페이스를 설계하였다. 설계는 멀티미디어 제공환경 구축과 인터페이스 운영논리 설정의 두단계로 구성한 절차에 따라 수행하였다. 멀티미디어 인터페이스는 원자력발전소 구성정보를 설계하여 저장시켜둔 퍼스널 컴퓨터 환경하에서 운영되며, 입력 및 출력의 두가지 인터페이스 를 갖는다. 입력 인터페이스는 인적 오류를 포함한 고장정지 사건의 발생내용을 숫자, 텍스트, 음성 미디어로 받아들여 출력 인터페이스에서 사용될 수 있도록 미디어 결합을 수행하고, 컴퓨터 저장장치에 저장하는 기능을 수행한다. 출력 인터페이스는 저장장치에 기록된 내용을 판독하여 고장정지의 진행경위와 인적오류의 발생내용을 숫자, 문자, 텍스트, 음성, 음향, 그래픽 애니메 이션, 정지화상으로 원자력발전소 종사원들에게 출력시켜 주는 기능을 수행한다. 멀티미디어 출력정보는 사용자가 CRT 화면에 제공되는 버튼 또는 Click Touch 메뉴를 사용함에 따라 이전, 이후의 출력정보로 전이할 수 있도록 하였다.상대적으로 중요한 검사 항목으로 나타났다. 또한 상대적 중요도 결과를 적용한 적성검사 성적이 적용하지 않은 적성검사 성적에 비해 비행성적에 대한 예측 능력이 좋은 것으로 평가되었다.al age)가 있다는 것을 의미하는 것이다. 한편, 생산현장에서는 자동화, 기계화가 진보되어 육체적인 노동이 경감된 결과, 중고령자라도 할 수 있는 작업이 많아지고 있다. 또, VDT (Visual Dislay Terminal) 작업과 같은 정보처리 작업의 수요가 증가하여 그 인재의 부족이 지적되고 있다. 따라서 중고령자의 기능을 조사하여 어떠한 작업에 적합한가를 판단하는 것이 중요한 과제로 되었다. 그러나 노동에는 많은 기능이 관여 하고, 그 내용에 따라서 요구되는 기능이 서로 다르기 때문에 노동적응능력의 기본적인 기능으로 보여지는 것에 좁혀서 작업능력의 연령증가 변화에 대하여다원적 평가를 하는 것이 실제적이라고 할 수 있다. 따라서 본 연구에서는 인간이 가지고 있는 다수의 기능중에서 수지교 치성과 연령증가와의 관계를 조사한다. 만약 연령증가 만으로 수지교치성을 평가 할 수 없는 경우에는 어떠한 요인이 수지기민성의 변화에 영향을 미치는가를 검토한다.t list)에서 자동적으로 사건들의 순서가 결정되도록 확장하였으며, 설비 제어방식에 있어서도 FIFO, LIFO, 우선 순위 방식등을 선택할 수 있도록 확장하였다. SIMPLE는 자료구조 및 프로그램이 공개되어 있으므로 프로그래머가 원하는 기능을 쉽게 추가할 수 있는 장점도 있다. 아울러 SMPLE에서 새로이 추가된 자료구조와 함수 및 설비제어 방식등

  • PDF

UA Tree-based Reduction of Speech DB in a Large Corpus-based Korean TTS (대용량 한국어 TTS의 결정트리기반 음성 DB 감축 방안)

  • Lee, Jung-Chul
    • Journal of the Korea Society of Computer and Information
    • /
    • v.15 no.7
    • /
    • pp.91-98
    • /
    • 2010
  • Large corpus-based concatenating Text-to-Speech (TTS) systems can generate natural synthetic speech without additional signal processing. Because the improvements in the natualness, personality, speaking style, emotions of synthetic speech need the increase of the size of speech DB, it is necessary to prune the redundant speech segments in a large speech segment DB. In this paper, we propose a new method to construct a segmental speech DB for the Korean TTS system based on a clustering algorithm to downsize the segmental speech DB. For the performance test, the synthetic speech was generated using the Korean TTS system which consists of the language processing module, prosody processing module, segment selection module, speech concatenation module, and segmental speech DB. And MOS test was executed with the a set of synthetic speech generated with 4 different segmental speech DBs. We constructed 4 different segmental speech DB by combining CM1(or CM2) tree clustering method and full DB (or reduced DB). Experimental results show that the proposed method can reduce the size of speech DB by 23% and get high MOS in the perception test. Therefore the proposed method can be applied to make a small sized TTS.