• Title/Summary/Keyword: audio identification

Search Result 50, Processing Time 0.021 seconds

A Study on Adaptive Information Hiding Technique for Copyright Protection of Digital Images (디지털 영상물의 저작권 보호를 위한 적응적 정보 은닉 기술에 관한 연구)

  • Park, Kang-Seo;Chung, Tae-Yun;Oh, Sang-Rok;Park, Sang-Hee
    • Proceedings of the KIEE Conference
    • /
    • 1998.07g
    • /
    • pp.2427-2429
    • /
    • 1998
  • Digital watermarking is the techinque which embeds the invisible signal into multimedia data such as audio, video, images, for copyright protection, including owner identification and copy control information. This paper proposes a new watermark embedding and extraction technique by extending the direct sequence spread spectrum technique. The proposed technique approximates the frequency component of pixels in spatial domain by using Laplacian mask and adaptively embeds the watermark considering the HVS to reduce the degradation of Image. In watermark extraction process, the proposed technique strengthens the high frequency components of image and extracts the watermark by demodulation. All this processes are performed in spatial domain to reduce the processing time.

  • PDF

A Study on Watermark Technique for Copyright Protection of Digital Images (디지털 영상물의 저작권 보호를 위한 워터마크 기술에 관한 연구)

  • Hong, Min-Suk;Park, Kang-Seo;Chung, Tae-Yun;Shin, Joon-In;Park, Sang-Hui
    • Proceedings of the KIEE Conference
    • /
    • 1998.11b
    • /
    • pp.606-608
    • /
    • 1998
  • Digital watermarking is the technique which embeds the invisible signal into multimedia data such as audio, video, images, for copyright protection, including owner identification and copy control information. In this paper, a new watermark detection algorithm by local masking cross covariance between watermarked signal and pseudo noise signal is proposed. The proposed algorithm enhances the detection probability for embedding information. Since reducing detection errors for the weak embedding signals, the algorithm improves the image quality and robusts against illegal attack to delete the embedding information and data compression applications such as JPEG and MPEGs.

  • PDF

Device identification Based on Audio Source (음원을 이용한 기기판별)

  • Yi, Myeong-Hwan;Moon, Chang-Bae;Kim, Byeong-Man
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2012.06c
    • /
    • pp.224-226
    • /
    • 2012
  • IT 기술의 발전과 정보화 사회로 인해 컴퓨터 관련범죄뿐 아니라 일반 범죄에서도 증거 및 단서가 디지털정보 기기에 보관되는 경우가 발생하고 있다. 이러한 맥락에서 본 논문에서는 디지털 포렌식 기술의 하나로서 녹음 데이터로부터 녹음기기를 판별하는 효과적인 방법을 제안한다. 녹음된 데이터에서 노이즈를 추출하고, 이 노이즈의 차이점을 이용하면 효율적인 기기판별 방법이 가능해진다. 본 논문에서는 위너 필터를 통한 기기 Noise를 추출하고, MirToolBox를 이용하여 특징들을 추출한다. 추출된 특징들과 WEKA의 다중 신경망을 이용하여 학습 및 판별하였다. 판별 결과 평균 99.8%의 성능을 보였다.

Client-identified Significant Events and Interactional Process in Gestalt Therapy Using the Therapeutic Media (치료적 매체를 이용한 게슈탈트 심리치료에 나타난 중요사건 및 매체와의 상호작용과정)

  • Lee, Jeong-Sook
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.11
    • /
    • pp.472-491
    • /
    • 2020
  • The purpose of this research is to scrutinize what are characteristics of client-identified significant events and interactional process in Gestalt therapy using the therapeutic media. The subjects of the study were 4 participants. To this purpose, a audio-taped session and a intensive interview were conducted to examine. The collected data for this study were analyzed by Comprehensive Process Analysis. The audio-taped session and interview data were first transcribed and then analyzed for client-identified significant events. A total of 74 pairs of conversations were analyzed to derive the interactional process within the significant event case. In conclusion, it was found that the interactional process of significant events in the Gestalt therapy contributed to the change of the client by interacting with the therapeutic media. 5 major themes of interaction with the therapeutic madia appeared. The 5 major themes are allowance, identification, recognition, acceptance and prospective. Finally, the significance and limitations of the study were clarified and suggested for further study.

Identification of Design Attributes of the Affective Expressions for Movie Making (영화의 감성만족도 측정을 위한 시.청각적 영향 요인의 체계적 도출)

  • Kim, In-Ki;Kim, Ji-Ho;Chang, Woo-Jin;Lee, Cheol;Yun, Myung-Hwan
    • 한국HCI학회:학술대회논문집
    • /
    • 2007.02a
    • /
    • pp.143-149
    • /
    • 2007
  • 영상은 동적인 시각 이미지와 청각의 결합에 의해 감성적인 반응을 유도한다. 다양한 영상 기법을 통하여 감성적 반응의 극대화를 추구하는 영화는 영상의 시청각적 요소들을 감성의 관점에서 효과적으로 설계하는데 본보기가 된다. 그러나, 제품의 설계속성들에 대한 감성적 평가결과를 모형화하는 감성공학적 관점에서 볼 때 영화는 시청각적 자극의 수준이 극히 다양하고 동적인 경험재로 모형화의 어려움이 있다. 본 연구에서는 영화의 감성 모형을 구축하기 위한 사전연구의 단계로 영화에서의 시청각적 요인들을 문헌조사를 통해 수집, 정리, 선별하고 이러한 시청각적 요인들 중에 영화를 관람하는 관객의 감성적, 인지적 반응에 영향을 주는 유효한 요인들을 객관적이고 체계적으로 탐색하고자 하였다. 이를 위해, 감성 및 인지적 반응의 변화를 생체신호를 통해 측정하는 한편, 생체신호의 측정 시 사용된 영화의 시청각적 자극요인을 Video/Audio Processing방법에 의해 연속적인 수치로 정량화하였다. 생체신호와 정량화된 시청각적 자극요인을 동기화하고 통계적으로 분석함으로써, 생체신호의 반응과 시청각적 자극요인과의 인과관계를 통계적으로 신뢰성있는 수준에서 검증하고자 하였다. 생체신호를 종속변수로, 시청각적 자극요인을 독립변수로 하는 896개의 부분선형회귀모형(Partial Linear Regression Model)들 중 통계적으로 유의한 선형관계에 있는 경우의 빈도분석에 의하면, 시각적 요인들 중에는 밝기(Brightness), 대비(Contrast), 색상(Color), 움직임(Motion), 장면전환속도(Shot change Rate), 주요대상의 상대적 크기가, 청각적 요인들 중에는 Peak주파수, Peak주파수의 음량, 평균음량, 소음비(Sound-to-Noise Ratio)가 생체신호의 변화에 통계적으로 유의한 영향을 주는 것으로 나타났다. 이는, 위의 시청각적 자극 요인들은 특히 관객의 감성 및 인지적인 반응에 유의한 영향을 주는 요소로 작용할 수 있음을 시사하고 있다. 이를 토대로, 위의 시청각적 자극 요인들이 가지는 다양한 조합들을 설명변수로 하는 통계적인 영화의 감성 모형을 구축할 수 있을 것으로 기대한다.

  • PDF

Abnormal Crowd Behavior Detection via H.264 Compression and SVDD in Video Surveillance System (H.264 압축과 SVDD를 이용한 영상 감시 시스템에서의 비정상 집단행동 탐지)

  • Oh, Seung-Geun;Lee, Jong-Uk;Chung, Yongw-Ha;Park, Dai-Hee
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.21 no.6
    • /
    • pp.183-190
    • /
    • 2011
  • In this paper, we propose a prototype system for abnormal sound detection and identification which detects and recognizes the abnormal situations by means of analyzing audio information coming in real time from CCTV cameras under surveillance environment. The proposed system is composed of two layers: The first layer is an one-class support vector machine, i.e., support vector data description (SVDD) that performs rapid detection of abnormal situations and alerts to the manager. The second layer classifies the detected abnormal sound into predefined class such as 'gun', 'scream', 'siren', 'crash', 'bomb' via a sparse representation classifier (SRC) to cope with emergency situations. The proposed system is designed in a hierarchical manner via a mixture of SVDD and SRC, which has desired characteristics as follows: 1) By fast detecting abnormal sound using SVDD trained with only normal sound, it does not perform the unnecessary classification for normal sound. 2) It ensures a reliable system performance via a SRC that has been successfully applied in the field of face recognition. 3) With the intrinsic incremental learning capability of SRC, it can actively adapt itself to the change of a sound database. The experimental results with the qualitative analysis illustrate the efficiency of the proposed method.

Identification of Advantages and Disadvantages Relative to Competitors of Politicians According to the Narrative Styles by Applying Voice Analysis (음성 분석을 통한 정치인들의 화법에 따른 경쟁자들 간의 상대적인 유·불리 규명)

  • Choi, Ji Hyun;Cho, Dong Uk;Lee, Bum Joo;Kim, Chan Jung;Jeong, Yeon Man
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.41 no.5
    • /
    • pp.602-609
    • /
    • 2016
  • In a smart society, politicians analyze the big data of voters to build a favorable political positions. In other words, a variety of digital footprints uploaded in SNS or Internet are used to set the election strategies and political directions. In comparison, it is difficult for voters to extract intention information about how politicians are performing a political acts. Therefore, it is important that voters need to analyze what intention of politicians are like for two way interaction between voters and politicians. For this, in this paper, we want to do the identification by analyzing IT technologies to narrative styles of politicians who pursue relative advantages or gains compared to other competitors. The experiments will be carried out to identify about what relative advantages compared to other competitors by narrative styles of next presidential candidates who are expected to run into the next presidential election by analyzing the usual audio interviews.

A Case Study on Closed Captions: Focusing on on Netflix (넷플릭스 <오징어 게임> 폐쇄자막 연구)

  • Jeong, Sua;Lee, Jimin
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.2
    • /
    • pp.279-285
    • /
    • 2024
  • This study aims to evaluate the accuracy and completeness of Korean and English closed captions for Netflix's "Squid Game" and to present implications based on the findings. To achieve this, the closed captioning guidelines of the U.S. Federal Communications Commission, DCMP, and the Korea Communications Commission were identified and analyzed. The analysis of the subtitle of the entire "Squid Game" series reveals that, while Korean closed captions accurately present slangs and titles, they present non-existent information in speaker identification. In English closed captions, speaker identification guidelines are well followed, but omissions of slangs and title mistranslations are observed. In terms of completeness, both Korean and English closed captions are found to omit certain audio parts. To address these issues, the study suggests strengthening the QA process, establishing a system to communicate original text problems during translation, and utilizing general English subtitles.

The Effect of BTS Preference on Fandom Star & Fan Community Identification and Purchase Intention - Focused on Korean and Southeast Asian - (BTS의 선호요인이 팬덤 동일시욕구와 구매의도에 미치는 영향 - 한국 및 동남아 팬을 중심으로 -)

  • Kim, Yoon-Chul
    • Journal of Korea Entertainment Industry Association
    • /
    • v.14 no.2
    • /
    • pp.1-14
    • /
    • 2020
  • This study was initiated by the interest in identifying what the characteristics of BTS' preference is in the expanded K-Pop market. For this study, a survey was conducted to Taiwan, Thailand, Vietnam, and Korea where BTS is popular. The results of this study show that Vietnam and Thailand have the most positive perceptions of most of the BTS preferences, and the factors affecting the highest quality were analyzed by the differentiating sense of BTS. BTS' preference is an independent variable consisting of five factors: singers and music, a discriminative sense, global communication, meditative lyrics and Korean sentiment. And it has been shown to have a statistically significant influence on both the fandom star and the fan community at a high level. In particular, the Identification desire for fandom star shows that the discriminative sense and meditative lyrics affect the positive at a high level. Also, the identification desire for the fan community's found that the attraction of singers and music affects the highest level of affection. This study was extended to Southeast Asian and Korean fans through a wide range of survey participants, and it is meaningful that a new perspective on the BTS preference was available. Nonetheless, Failure to take into account the various variables that may affect the fandom effect and the intent to purchase, and the lack of a survey of fans in the U.S. and Europe, which has more fans worldwide, could be a limitation of the study.

Hidden Indicator Based PIN-Entry Method Using Audio Signals

  • Seo, Hwajeong;Kim, Howon
    • Journal of information and communication convergence engineering
    • /
    • v.15 no.2
    • /
    • pp.91-96
    • /
    • 2017
  • PIN-entry interfaces have high risks to leak secret values if the malicious attackers perform shoulder-surfing attacks with advanced monitoring and observation devices. To make the PIN-entry secure, many studies have considered invisible radio channels as a secure medium to deliver private information. However, the methods are also vulnerable if the malicious adversaries find a hint of secret values from user's $na{\ddot{i}}ve$ gestures. In this paper, we revisit the state-of-art radio channel based bimodal PIN-entry method and analyze the information leakage from the previous method by exploiting the sight tracking attacks. The proposed sight tracking attack technique significantly reduces the original password complexities by 93.8% after post-processing. To keep the security level strong, we introduce the advanced bimodal PIN-entry technique. The new technique delivers the secret indicator information through a secure radio channel and the smartphone screen only displays the multiple indicator options without corresponding numbers. Afterwards, the users select the target value by following the circular layout. The method completely hides the password and is secure against the advanced shoulder-surfing attacks.