• Title/Summary/Keyword: Audio Data

Search Result 883, Processing Time 0.026 seconds

Adaptive MAC Protocol for Low Latency in WMSN (WMSN 에서 낮은 지연을 위한 적응적 MAC 프로토콜)

  • Kim, Seong-Hun;Lee, Sung-Keun;Jung, Chang-Ryul;Koh, Jin-Gwang
    • Journal of Internet Computing and Services
    • /
    • v.10 no.2
    • /
    • pp.161-169
    • /
    • 2009
  • The development of Wireless Multimedia Sensor Networks(WMSNs) is getting realized in accordance with the increased demands to provide multimedia data transmission service such as CCTV movie, image chapter information, audio information based on wireless sensor network. It is very important to provide the differentiated quality assurance of service. In this paper, we propose a sensor medium access control protocol based on DSMAC, which provides differentiated Quality of Service(QoS) for delay. A proposed protocol is able to reduce the delay without increasing the energy consumption by adaptively changing the duty cycle according to the buffer occupancy. Simulation results show that the new MAC protocol performs better in terms of latency than S-MAC and DSMAC.

  • PDF

Automatic Detection of Pig Wasting Diseases Using Audio and Video Data (소리와 영상 정보를 이용한 돼지 호흡기 질병 탐지)

  • Kim, Heegon;Sa, Jaewon;Lee, Jonguk;Chung, Yongwha;Park, Daihee
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2015.10a
    • /
    • pp.1431-1434
    • /
    • 2015
  • 24시간 모니터링 환경에서 돈사 내 개별 돼지들의 상태를 자동으로 탐지하는 연구는 효율적인 돈사 관리 측면에서 중요한 이슈로 떠오르고 있다. 특히 돼지 호흡기 질병은 전염성이 매우 강하여, 막대한 경제적 손실을 최소화하기 위해서는 조기에 탐지하는 것이 매우 중요하다. 본 논문에서는 마이크를 통한 소리 정보뿐 아니라 카메라를 통한 영상 정보를 동시에 활용하여 호흡기 질병에 걸린 개별 돼지를 조기에 탐지하는 방법을 제안한다. 즉, 돈사의 천장에 설치된 마이크로부터 호흡기 질병에 걸린 소리 정보를 먼저 탐지한 후 카메라로부터 획득된 영상 정보의 MHI 분석을 수행하여 호흡기 질병에 걸린 돼지를 특정한다. 실험결과, 소리와 영상 정보를 동시에 활용하는 제안 방법을 이용하여 호흡기 질병에 걸린 돼지를 특정할 수 있음을 확인하였다.

Experiences of hospitalization among pregnant women with preterm labor in Korea: a phenomenological study

  • Lee, Joon-Young;Song, Yeoungsuk
    • Women's Health Nursing
    • /
    • v.27 no.3
    • /
    • pp.209-219
    • /
    • 2021
  • Purpose: The purpose of this study was to describe pregnant women's lived experiences of hospitalization due to preterm labor in Korea. Methods: This qualitative study adopted a phenomenological approach. Individual in-depth interviews were conducted with nine participants, over the age of 20 years, who had been hospitalized for more than 1 week after being diagnosed with preterm labor. All interviews were audio-taped and verbatim transcripts were made for analysis. The data were analyzed following Colaizzi's phenomenological method. Results: The participants' ages ranged from 26 to 36 years, and all were married women. They were hospitalized for 13.1 days on average. Five thematic clusters emerged from the analysis. 'Withstanding hospitalization for the fetus's well-being' describes women's feelings during preterm labor and their endurance during their prolonged hospitalization, rooted in their conviction that the fetus comes first. 'Endless frustration in the hospital' encompasses women's emotions while lying in bed and quietly thinking to themselves. 'Unmet physiological needs' describes participants' awareness of their inability to independently handle human physiological needs given the need for careful and limited movement. 'Gratitude for the support around oneself' reflects the support from family and medical staff. 'Shifting perceptions and accepting one's circumstances' describes accepting hospitalization and making efforts to spend their remaining time in the hospital in a meaningful way. Conclusion: The findings in this study provide a deeper understanding and insights into the experiences of Korean women with preterm labor during hospitalization, underscoring the need to develop interventions for these patients.

Implementation of Low Complexity FFT, ADC and DAC Blocks of an OFDM Transmitter Receiver Using Verilog

  • Joshi, Alok;Gupta, Dewansh Aditya;Jaipuriyar, Pravriti
    • Journal of Information Processing Systems
    • /
    • v.15 no.3
    • /
    • pp.670-681
    • /
    • 2019
  • Orthogonal frequency division multiplexing (OFDM) is a system which is used to encode data using multiple carriers instead of the traditional single carrier system. This method improves the spectral efficiency (optimum use of bandwidth). It also lessens the effect of fading and intersymbol interference (ISI). In 1995, digital audio broadcast (DAB) adopted OFDM as the first standard using OFDM. Later in 1997, it was adopted for digital video broadcast (DVB). Currently, it has been adopted for WiMAX and LTE standards. In this project, a Verilog design is employed to implement an OFDM transmitter (DAC block) and receiver (FFT and ADC block). Generally, OFDM uses FFT and IFFT for modulation and demodulation. In this paper, 16-point FFT decimation-in-frequency (DIF) with the radix-2 algorithm and direct summation method have been analyzed. ADC and DAC in OFDM are used for conversion of the signal from analog to digital or vice-versa has also been analyzed. All the designs are simulated using Verilog on ModelSim simulator. The result generated from the FFT block after Verilog simulation has also been verified with MATLAB.

The Influence of Topic Exploration and Topic Relevance On Amplitudes of Endogenous ERP Components in Real-Time Video Watching (실시간 동영상 시청시 주제탐색조건과 주제관련성이 내재적 유발전위 활성에 미치는 영향)

  • Kim, Yong Ho;Kim, Hyun Hee
    • Journal of Korea Multimedia Society
    • /
    • v.22 no.8
    • /
    • pp.874-886
    • /
    • 2019
  • To delve into the semantic gap problem of the automatic video summarization, we focused on an endogenous ERP responses at around 400ms and 600ms after the on-set of audio-visual stimulus. Our experiment included two factors: the topic exploration of experimental conditions (Topic Given vs. Topic Exploring) as a between-subject factor and the topic relevance of the shots (Topic-Relevant vs. Topic-Irrelevant) as a within-subject factor. For the Topic Given condition of 22 subjects, 6 short historical documentaries were shown with their video titles and written summaries, while in the Topic Exploring condition of 25 subjects, they were asked instead to explore topics of the same videos with no given information. EEG data were gathered while they were watching videos in real time. It was hypothesized that the cognitive activities to explore topics of videos while watching individual shots increase the amplitude of endogenous ERP at around 600 ms after the onset of topic relevant shots. The amplitude of endogenous ERP at around 400ms after the onset of topic-irrelevant shots was hypothesized to be lower in the Topic Given condition than that in the Topic Exploring condition. The repeated measure MANOVA test revealed that two hypotheses were acceptable.

Perceived Auditory Feedback and User Experience in Mobile Game: A Mediation Analysis of Enjoyment (모바일 게임 속 청각적 피드백 인지와 사용자 경험: 재미의 매개효과 분석)

  • Ahn, Jisoo;Heo, Ji-Yeon;Noh, Ghee Young
    • Journal of Korea Game Society
    • /
    • v.19 no.2
    • /
    • pp.135-144
    • /
    • 2019
  • This study examined the role of enjoyment in the relationship between perceived auditory feedback (PAF), which provides information about the situation during game play, and user experience factors. 100 undergraduates played a mobile game, 'Classy Royale' and took a survey about their user experience. As a result of analyzing the available data from 98 participants, PROCESS MACRO showed that PAF was positively associated with enjoyment, immersion, and intention to use; enjoyment mediated the effects of PAF on immersion and intention. These results can help game research and development regarding the functional value of audio factors.

Incomplete Cholesky Decomposition based Kernel Cross Modal Factor Analysis for Audiovisual Continuous Dimensional Emotion Recognition

  • Li, Xia;Lu, Guanming;Yan, Jingjie;Li, Haibo;Zhang, Zhengyan;Sun, Ning;Xie, Shipeng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.2
    • /
    • pp.810-831
    • /
    • 2019
  • Recently, continuous dimensional emotion recognition from audiovisual clues has attracted increasing attention in both theory and in practice. The large amount of data involved in the recognition processing decreases the efficiency of most bimodal information fusion algorithms. A novel algorithm, namely the incomplete Cholesky decomposition based kernel cross factor analysis (ICDKCFA), is presented and employed for continuous dimensional audiovisual emotion recognition, in this paper. After the ICDKCFA feature transformation, two basic fusion strategies, namely feature-level fusion and decision-level fusion, are explored to combine the transformed visual and audio features for emotion recognition. Finally, extensive experiments are conducted to evaluate the ICDKCFA approach on the AVEC 2016 Multimodal Affect Recognition Sub-Challenge dataset. The experimental results show that the ICDKCFA method has a higher speed than the original kernel cross factor analysis with the comparable performance. Moreover, the ICDKCFA method achieves a better performance than other common information fusion methods, such as the Canonical correlation analysis, kernel canonical correlation analysis and cross-modal factor analysis based fusion methods.

CNN-based Visual/Auditory Feature Fusion Method with Frame Selection for Classifying Video Events

  • Choe, Giseok;Lee, Seungbin;Nang, Jongho
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.3
    • /
    • pp.1689-1701
    • /
    • 2019
  • In recent years, personal videos have been shared online due to the popular uses of portable devices, such as smartphones and action cameras. A recent report predicted that 80% of the Internet traffic will be video content by the year 2021. Several studies have been conducted on the detection of main video events to manage a large scale of videos. These studies show fairly good performance in certain genres. However, the methods used in previous studies have difficulty in detecting events of personal video. This is because the characteristics and genres of personal videos vary widely. In a research, we found that adding a dataset with the right perspective in the study improved performance. It has also been shown that performance improves depending on how you extract keyframes from the video. we selected frame segments that can represent video considering the characteristics of this personal video. In each frame segment, object, location, food and audio features were extracted, and representative vectors were generated through a CNN-based recurrent model and a fusion module. The proposed method showed mAP 78.4% performance through experiments using LSVC data.

Fillers in the Hong Kong Corpus of Spoken English (HKCSE)

  • Seto, Andy
    • Asia Pacific Journal of Corpus Research
    • /
    • v.2 no.1
    • /
    • pp.13-22
    • /
    • 2021
  • The present study employed an analytical framework that is characterised by a synthesis of quantitative and qualitative analyses with a specially designed computer software SpeechActConc to examine speech acts in business communication. The naturally occurring data from the audio recordings and the prosodic transcriptions of the business sub-corpora of the HKCSE (prosodic) are manually annotated with a speech act taxonomy for finding out the frequency of fillers, the co-occurring patterns of fillers with other speech acts, and the linguistic realisations of fillers. The discoursal function of fillers to sustain the discourse or to hold the floor has diverse linguistic realisations, ranging from a sound (e.g. 'uhuh') and a word (e.g. 'well') to sounds (e.g. 'um er') and words, namely phrase ('sort of') and clause (e.g. 'you know'). Some are even combinations of sound(s) and word(s) (e.g. 'and um', 'yes er um', 'sort of erm'). Among the top five frequent linguistic realisations of fillers, 'er' and 'um' are the most common ones found in all the six genres with relatively higher percentages of occurrence. The remaining more frequent realisations consist of clause ('you know'), word ('yeah') and sound ('erm'). These common forms are syntactically simpler than the less frequent realisations found in the genres. The co-occurring patterns of fillers and other speech acts are diverse. The more common co-occurring speech acts with fillers include informing and answering. The findings show that fillers are not only frequently used by speakers in spontaneous conversation but also mostly represented in sounds or non-linguistic realisations.

A study on Developmental and Constraint Factors of Sports Movie

  • MOON, Bo Ra;KIM, Hae Yu;SEO, Won Jae
    • Journal of Sport and Applied Science
    • /
    • v.5 no.2
    • /
    • pp.47-53
    • /
    • 2021
  • Purpose: The purpose of this study is to generate industrial insights for developing sports movie industry. Related studies are scare. Research design, data, and methodology: The study employed case study and selected typical case samples who seem informative about sport movie industry. The authors interviewed six experts who are experienced in sport movie and related academic sector. Interviews were audio-taped and the text were decoded by multiple reading. Through this process, the study categorized significant meanings and produced results. Interviewees reviewed again the final findings to confirm validity, which calls member check. Results: It was suggested that developmental factors of sports movie were the realization of realistic sports scenes by using technologies. Second, participants told that contents of sport movie need to reflect real stories and it should tell the stories. Regarding constraints of sport movies, movie producers feel difficulty to make scenes in that sports are inherently quickly performed. Another constraints are that production cost is expensive but audiences are barely attended. Conclusions: For promoting economic outcomes and developing sport movie industry, government needs to financially support related markets to support sport movie producers assisting them to concentrate their movie works. Future directions for related-studies were discussed.