• 제목/요약/키워드: audio database

검색결과 75건 처리시간 0.024초

방송영상자료의 FRBR기반 서지구조모형에 관한 연구 (A Study on Modeling of Bibliographic Framework Based on FRBR for Television Program Materials)

  • 정진규
    • 한국문헌정보학회지
    • /
    • 제41권1호
    • /
    • pp.185-214
    • /
    • 2007
  • 본 연구는 방송영상자료의 메타데이터 서지구조를 FRBR 참조모형 기반의 다층 레이어 구조로 개선함으로써 특정 관계 유형에 있는 방송영상물이 서지적으로 연계되게 하고 이용자가 탐색 목적에 따라 효과적으로 탐색할 수 있는 방안을 모색하였다. FRBR에 기반 하여 계층적인 방송영상자료 메타데이터 모형을 설계하고 실험시스템을 구축하였으며, 이에 대한 평가를 통해 실제 적용 가능성을 살펴보았다. 실험시스템은 FRBR 기반의 시청각 목록시스템인 네덜란드 시청각기구(B&G)의 iMMix 모형을 벤치마킹하였고, 단층구조 모형의 일반시스템과 비교하여 검색의 효율성, 시스템의 유용성을 평가하였다.

의학교육에서 컴퓨터바탕검사와 문항은행 데이터베이스 구축 (Computer-Based Testing and Construction of an Item Bank Database for Medical Education in Korea)

  • 허선
    • 의학교육논단
    • /
    • 제16권1호
    • /
    • pp.11-15
    • /
    • 2014
  • A number of medical schools in Korea have been using computer-based testing (CBT) for evaluating their students' scientific and/or clinical performance since the early 1990s. Introducing CBT to medical education would have several advantages: first, presenting figures and audio-video files of clinical content is simple with CBT, making it possible to evaluate medical students' competency with navigating more realistic clinical situations at minimum cost; second, CBT enables automatic item analysis and score reporting. To establish CBT, constructing an item bank with item parameters such as difficulty or discriminating parameters will be needed. To select more psychometrically sound items, analysis of the items according to item response theory is necessary. CBT has already been introduced in high stakes tests like the United States Medical Licensing Examination and the Medical Council of Canada Qualifying Examination. The National Health Personnel Examination Board in Korea is also planning to introduce a CBT-based version of the National Medical Examination soon. Thus all medical schools in Korea will need to introduce CBT and construct item banks to prepare their students for their licensing examinations and to measure the students' competency more accurately.

위치·음성인식된 장애인 스마트폰과 장애인 편의시설DB 연결 연구 (Research location & voice recognition disabled accessibility smartphones and database connection)

  • 양성용;박대우
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2013년도 춘계학술대회
    • /
    • pp.205-208
    • /
    • 2013
  • 2011년 등록된 국내 장애인수는 약 250만명으로 지속적으로 증가를 하고 있다. 고령화를 고려한 잠재 장애를 고려한다면 더욱 더 증가할 것이다. 이에 장애인이 불편함 없이 시설물을 사용할 수 있는 방법과 장애인 편의시설 DB 구축 방법, 활용방법을 고려하였다. 정보 기술의 발달로 스마트폰의 활용이 증가하고, 융합한 기술들이 발전함에 따라 장애인도 손쉽게 정보를 획득할 수 있게 되었다. 이에 편의시설 DB를 활용하여 장애인 사용자의 편의시설을 확보할 수 있으며, 위치 및 음성을 이용하며 손쉽게 사용자의 위치정보 및 편의시설을 제공한다.

  • PDF

XML Repository System Using DBMS and IRS

  • Kang, Hyung-Il;Yoo, Jae-Soo;Lee, Byoung-Yup
    • International Journal of Contents
    • /
    • 제3권3호
    • /
    • pp.6-14
    • /
    • 2007
  • In this paper, we design and implement a XML Repository System(XRS) that exploits the advantages of DBMSs and IRSs. Our scheme uses BRS to support full text indexing and content-based queries efficiently, and ORACLE to store XML documents, multimedia data, DTD and structure information. We design databases to manage XML documents including audio, video, images as well as text. We employ the non-composition model when storing XML documents into ORACLE. We represent structured information as ETID(Element Type Id), SORD(Sibling ORDer) and SSORD(Same Sibling ORDer). ETID is a unique value assigned to each element of DTD. SORD and SSORD represent an order information between sibling nodes and an order information among the sibling nodes with the same element respectively. In order to show superiority of our XRS, we perform various experiments in terms of the document loading time, document extracting time and contents retrieval time. It is shown through experiments that our XRS outperforms the existing XML document management systems. We also show that it supports various types of queries through performance experiments.

Speaker-Dependent Emotion Recognition For Audio Document Indexing

  • Hung LE Xuan;QUENOT Georges;CASTELLI Eric
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2004년도 ICEIC The International Conference on Electronics Informations and Communications
    • /
    • pp.92-96
    • /
    • 2004
  • The researches of the emotions are currently great interest in speech processing as well as in human-machine interaction domain. In the recent years, more and more of researches relating to emotion synthesis or emotion recognition are developed for the different purposes. Each approach uses its methods and its various parameters measured on the speech signal. In this paper, we proposed using a short-time parameter: MFCC coefficients (Mel­Frequency Cepstrum Coefficients) and a simple but efficient classifying method: Vector Quantification (VQ) for speaker-dependent emotion recognition. Many other features: energy, pitch, zero crossing, phonetic rate, LPC... and their derivatives are also tested and combined with MFCC coefficients in order to find the best combination. The other models: GMM and HMM (Discrete and Continuous Hidden Markov Model) are studied as well in the hope that the usage of continuous distribution and the temporal behaviour of this set of features will improve the quality of emotion recognition. The maximum accuracy recognizing five different emotions exceeds $88\%$ by using only MFCC coefficients with VQ model. This is a simple but efficient approach, the result is even much better than those obtained with the same database in human evaluation by listening and judging without returning permission nor comparison between sentences [8]; And this result is positively comparable with the other approaches.

  • PDF

악성종양환자와 정상인이 발성한 모음의 좁은대역 스펙트럼값의 상관계수와 절대차이합 비교 (A Comparative Study of Vowels Produced by Normal Subjects and Patients with Malignant Vocal Folds by Correlation Coefficient and Difference Sum of Narrow-band Spectra)

  • 양병곤;왕수건;조철우;김형순;김은지;권순복
    • 음성과학
    • /
    • 제10권4호
    • /
    • pp.189-200
    • /
    • 2003
  • The objective of this study was to examine two new parameters by which we could screen people with malignant vocal folds. The new parameters were the difference sums and Pearson correlation coefficients between adjacent pairs of intensity level matrices of narrow-band spectra. Audio files from the Korean Disordered Speech Database were analyzed by Praat, a speech analysis software, to obtain matrices of 400 intensity levels at 16 time points of each sustained vowel spectra. We limited our study to 12 normal subjects and 20 patients with malignant vocal folds who recorded at least three Korean vowels at a sound-proofed booth in Busan National University Hospital. Results indicated that the average coefficients of the abnormal subjects were much lower than those of the normal subjects while the average difference sums of the patients were much higher than those of the normal ones. Also, we found that the degree of the malignancy of the vocal folds was related to the coefficients and sums. However, some subjects at the initial stages of cancerous vocal folds yielded almost comparable coefficients and difference sums to those of the normal speakers. Further studies on larger databases will be desirable to set certain criteria or threshold levels for screening people with vocal fold diseases.

  • PDF

Connection Management Scheme using Mobile Agent System

  • Lim, Hee-Kyoung;Bae, Sang-Hyun;Lee, Kwang-Ok
    • 통합자연과학논문집
    • /
    • 제11권4호
    • /
    • pp.192-196
    • /
    • 2018
  • The mobile agent paradigm can be exploited in a variety of ways, ranging from low-level system administration tasks to middle ware to user-level applications. Mobile agents can be useful in building middle-ware services such as active mail systems, distributed collaboration systems, etc. An active mail message is a program that interacts with its recipient using a multimedia interface, and adapts the interaction session based on the recipient's responses. The mobile agent paradigm is well suitable to this type of application, since it can carry a sender-defined session protocol along with the multimedia message. Mobile agent communication is possible via method invocation on virtual references. Agents can make synchronous, one-way, or future-reply type invocations. Multicasting is possible, since agents can be aggregated hierarchically into groups. A simple check-pointing facility has also been implemented. Another proposed solution is to use multi agent computer systems to access, filter, evaluate, and integrate this information. We will present the overall architectural framework, our agent design commitments, and agent architecture to enable the above characteristics. Besides, the each information needed a mobile agent system such as text, graphic, image, audio and video etc, constructed a great capacity multimedia database system. However, they have problems in establishing connections over multiple subnetworks, such as no end-to-end connections, transmission delay due to ATM address resolution, no QoS protocols. We propose a new connection management scheme in the thesis to improve the connection management involved of mobile agent systems.

Enhancing the Text Mining Process by Implementation of Average-Stochastic Gradient Descent Weight Dropped Long-Short Memory

  • Annaluri, Sreenivasa Rao;Attili, Venkata Ramana
    • International Journal of Computer Science & Network Security
    • /
    • 제22권7호
    • /
    • pp.352-358
    • /
    • 2022
  • Text mining is an important process used for analyzing the data collected from different sources like videos, audio, social media, and so on. The tools like Natural Language Processing (NLP) are mostly used in real-time applications. In the earlier research, text mining approaches were implemented using long-short memory (LSTM) networks. In this paper, text mining is performed using average-stochastic gradient descent weight-dropped (AWD)-LSTM techniques to obtain better accuracy and performance. The proposed model is effectively demonstrated by considering the internet movie database (IMDB) reviews. To implement the proposed model Python language was used due to easy adaptability and flexibility while dealing with massive data sets/databases. From the results, it is seen that the proposed LSTM plus weight dropped plus embedding model demonstrated an accuracy of 88.36% as compared to the previous models of AWD LSTM as 85.64. This result proved to be far better when compared with the results obtained by just LSTM model (with 85.16%) accuracy. Finally, the loss function proved to decrease from 0.341 to 0.299 using the proposed model

ATM/B-ISDN 기반의 원격 의료정보 시스템을 위한 멀티미디어 데이터베이스 원격 접속기능 설계 및 구현 (A Design & Implementation of Remote Access Function for A Multimedia Database of The Tele-medical System Based on ATM/B-ISDN)

  • 김호철;김영탁
    • 한국멀티미디어학회논문지
    • /
    • 제1권1호
    • /
    • pp.98-108
    • /
    • 1998
  • 멀티미디어 원격 의료 정보 시스댐에서는 멀티미디어 형태로 저장 관리되는 환자의 의료정보를 신속하게 원격 검색할 수 있어야 한다 또한, 대용량의 멀티미디어 의료정보를 효율적으로 관리하기 위한 멸티미디어 DBMS가사용되어야하며, 분산처리 환경에서의 원격 검색 기능이 구현되어야한다. 멀티미디어 원격 의료정 보 시스템과 같이 실시간 정보 전송 및 깎 정보 형태별 연결 관리가 펼요한 경우의 DB 원격 검색올 위해서는 Native ATM Service와 같이 개별 연결설정 및 QoS(Quality of Service)를 보장하는 초고속 정보 통신망이 펼요하다. 멀티미디어 DB의 원격 검색을 위해 상용 DBMS가 제공하는 API를 이용할 경우 해당 DBMS만올 지원하는 DBMS 의존적인 멸티미디어 원격 의료 정보 시스템이 되어 병원 규모 및 특성에 맞는 DBMS의 선정 및 DB 구축이 어렵게 된다. 또한, 상용 DBMS가 제공하는 TCP/IP Socket 기반의 전송 방식으로는 전송 특성이 각기 다른 멀티미디어 데이터의 개별적인 연결관리 빛 QoS 보장이 힘툴다 그러므로 본 논문에서 는 멀티미디어 원격 의료 정보 시스템을 위한 멸티미디어 DB 원격 접속 기능 구현에서 현재 상용 DBMS가 제공하지 않는 Native ATM API를 사용 한 DBMS 원격 접속 기능 구조를 제안하고, 이를 기반으로 한 원격 검색 기능을 구현 그 성능을 분석한다.

  • PDF

H.264 압축과 SVDD를 이용한 영상 감시 시스템에서의 비정상 집단행동 탐지 (Abnormal Crowd Behavior Detection via H.264 Compression and SVDD in Video Surveillance System)

  • 오승근;이종욱;정용화;박대희
    • 정보보호학회논문지
    • /
    • 제21권6호
    • /
    • pp.183-190
    • /
    • 2011
  • 감시카메라 환경에서 군중의 비정상 집단행동 탐지란 감시카메라로부터 유입되는 영상에서 다중 객체가 위험에 처한 상황을 신속하고 정확하게 탐지하는 분야를 말한다. 본 논문에서는 CCTV 등과 같은 감시카메라 환경에서 움직임 벡터와 SVDD를 이용하여 집단내의 비정상 상황을 탐지하는 프로토타입 시스템을 제안한다. 제안된 시스템은 H.264 압축과정에서의 움직임 벡터 정보를 이용하여 영상내의 움직임 정보를 추출 표현하였으며, 비정상 집단행동의 판별 문제를 실용적 차원의 단일 클래스 분류 문제로 재해석하여 단일 클래스 SVM의 대표적 모델인 SVDD를 탐지기로 설계하였다. 제안된 시스템은 H.264 압축 과정에서 얻어지는 움직임 벡터를 이용함으로써, 실시간성을 보장하며 SVDD의 점증적 갱신 학습 능력으로 인하여 비정상 집단행동 데이터베이스의 변화에도 능동적으로 적응할 수 있다. 공개적으로 사용 가능한 벤치마크 데이터 셋인 PETS 2009와 UMN을 이용하여 본 논문에서 제안한 비정상 집단행동 탐지 시스템의 성능을 실험적으로 검증한다.