• Title/Summary/Keyword: automatic voice system

Search Result 81, Processing Time 0.02 seconds

Fitness Measurement system using deep learning-based pose recognition (딥러닝 기반 포즈인식을 이용한 체력측정 시스템)

  • Kim, Hyeong-gyun;Hong, Ho-Pyo;Kim, Yong-ho
    • Journal of Digital Convergence
    • /
    • v.18 no.12
    • /
    • pp.97-103
    • /
    • 2020
  • The proposed system is composed of two parts, an AI physical fitness measurement part and an AI physical fitness management part. In the AI fitness measurement part, a guide to physical fitness measurement and accurate calculation of the measured value are performed through deep learning-based pose recognition. Based on these measurements, the AI fitness management part designs personalized exercise programs and provides them to dedicated smart applications. To guide the measurement posture, the posture of the subject to be measured is photographed through a webcam and the skeleton line is extracted. Next, the skeletal line of the learned preparation posture is compared with the extracted skeletal line to determine whether or not it is normal, and voice guidance is provided to maintain the normal posture.

Intrinsic Fundamental Frequency(Fo) of Vowels in the Esophageal Speech (식도음성의 고유기저주파수 발현 현상)

  • 홍기환;김성완;김현기
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.9 no.2
    • /
    • pp.142-146
    • /
    • 1998
  • Background : It has been established that the fundamental frequency(Fo) of the vowels varies systemically as a function of vowel height. Specifically, high vowels have a higher Fo than low vowels. Two major explanations or hypotheses dominate contemporary accounts of fired to explain the mechanisms underlying intrinsic variation in vowel Fo, source-tract coupling hypothesis and tongue-pull hypothesis. Objectives : Total laryngectomy surgery necessiates removal of all structures between the hyoid bone and the tracheal rings. Therefore, the assumption that no direct interconnection exists between the tongue and pharyngoesophageal segment that would mediate systematic variation in vowel Fo appears quite reasonable. If tongue-pull hypothesis is correct, systemic differences in Fo between high versus low vowels produced by esophageal speakers would not Or expected. We analyzed the Fo in the vowels of esophageal voice. Materials and method : The subjects were 11 cases of laryngectomee patients with fluent esophageal voice. The five essential vowels were recorded and analyzed with computer speech analysis system(Computerized Speech Lab). The Fo was measured using acoustic waveform, automatically and manually, and narrow band spectral analysis. Results : The results of this study reveal that intrinsic variation in vowel Fo is clearly evident in esophageal speech. By analysis using acoustic waveform automatically, the signals were too irregular to measure the Fo precisely. So the data from automatic analysis of acoustic waveform is not logical. But the Fo by measuring with manually calculated acoustic waveform or narrowband spectral analysis resulted in acceptable results. These results were interpreted to support neither the source-tract coupling nor the tongue-pull hypotheses and led us to offer an alternative explanation to account for intrinsic variation of Fo.

  • PDF

Efficient Design of a Disaster Broadcasting System using LTE Modem (이동 LTE모뎀을 활용한 재난방송시스템 설계)

  • Moon, Chaeyoung;Kim, Semin;Ryoo, Kwangki
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.10a
    • /
    • pp.292-294
    • /
    • 2018
  • Recently, damage caused by natural disasters such as fire, earthquake, heavy rains and heavy snow is increasing. In addition, traffic accidents due to freezing, fog and fire in tunnels and bridges are frequently occurring. In such a disaster situation, it is very important to take prompt action by the person in charge of managing the facility and area.To this end, a disaster broadcasting system is used, but in the existing system, the broadcasting room and the speaker are connected by a wired connection. Also, the person in charge has to be in the broadcasting room to broadcast, which has a problem of delaying the time. In this paper, we design a disaster broadcasting system using LTE modem. The designed system enables a broadcasting person to make a call to a broadcasting system from anywhere using a cellular phone and a public telephone. Broadcasting via telephone is possible only with the telephone number pre-registered in the system and can be registered / deleted by the administrator. The registered telephone number, incoming voice file, and announcement voice for automatic broadcasting are stored in the system internal SD memory for convenient management. This disaster broadcasting system is expected to contribute to quick and convenient disaster broadcasting.

  • PDF

Development of AVL-GIS System Using IDGPS and Wireless Communication Techniques (IDGPS 와 무선통신을 이용한 AVL-GIS 시스템개발)

  • 안충현;양종윤;최종현
    • Spatial Information Research
    • /
    • v.7 no.2
    • /
    • pp.209-221
    • /
    • 1999
  • In this research, AVL-GIS(Automatic Vehicle Location System linked with Geographic Information System) system was developed using integration of core techniques of GIS engine written by Java language, GOS(Global Positioning System) and wireless telecommunication interfacing techniques. IDGPS(Inverted differential GPS) techniques was employed to estimate accurate position of mobile vehicle and to supervise their path from AVL-GLS control center system. Between mobile vehicle and AVL-GLS control center system which has spatial data analysis function, road network and rleate ddata base were connected wireless phone to communicate for position an dmessage in real time. The developed system from this research has more enhanced GIS functions rather than previous AVL oriented system which has MDT for message display and voice communication only. This system can support build-up application system such as fleet management like bus, taxi, truck, disaster and emergency and monitoring of transportation status for customer s order via web browser in filed of EC/CALS in low cost.

  • PDF

Finding Measure Position Using Combination Rules of Musical Notes in Monophonic Song (단일 음원 노래에서 음표의 조합 규칙을 이용한 마디 위치 찾기)

  • Park, En-Jong;Shin, Song-Yi;Lee, Joon-Whoan
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.10
    • /
    • pp.1-12
    • /
    • 2009
  • There exist some regular multiple relations in the intervals of notes when they are combined within one measure. This paper presents a method to find the exact measure positions in monophonic song based on those relations. In the proposed method the individual intervals are segmented at first and the rules that state the multiple relations are used to find the measure position. The measures can be applied as the foundational information for extracting beat and tempo of a song which can be used as background knowledge of automatic music transcription system. The proposed method exactly detected the measure positions of 11 songs out of 12 songs except one song which consist of monophonic voice song of the men and women. Also one can extract the information of beat and tempo of a song using the information about extracted measure positions with music theory.

Analysis of Feature Extraction Methods for Distinguishing the Speech of Cleft Palate Patients (구개열 환자 발음 판별을 위한 특징 추출 방법 분석)

  • Kim, Sung Min;Kim, Wooil;Kwon, Tack-Kyun;Sung, Myung-Whun;Sung, Mee Young
    • Journal of KIISE
    • /
    • v.42 no.11
    • /
    • pp.1372-1379
    • /
    • 2015
  • This paper presents an analysis of feature extraction methods used for distinguishing the speech of patients with cleft palates and people with normal palates. This research is a basic study on the development of a software system for automatic recognition and restoration of speech disorders, in pursuit of improving the welfare of speech disabled persons. Monosyllable voice data for experiments were collected for three groups: normal speech, cleft palate speech, and simulated clef palate speech. The data consists of 14 basic Korean consonants, 5 complex consonants, and 7 vowels. Feature extractions are performed using three well-known methods: LPC, MFCC, and PLP. The pattern recognition process is executed using the acoustic model GMM. From our experiments, we concluded that the MFCC method is generally the most effective way to identify speech distortions. These results may contribute to the automatic detection and correction of the distorted speech of cleft palate patients, along with the development of an identification tool for levels of speech distortion.

A performance on the maritime MDT by SSB modem (SSB 모뎀에 의한 해상용 MDT의 구현)

  • 윤재준;최조천;김갑기
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2003.10a
    • /
    • pp.90-94
    • /
    • 2003
  • For the structure of VMS is required to the maritime MDT for realtime acquisition the position reporting of navigating ships. The first study is the recording method a navigating data by GPS, the 2th is SSB modem for data trans-receiver by use ship. And then the communication protocol of automatic traffic for in continues transmission a ship's ID, time and position data of GPS. In this paper have studied the SSB modem for transmission the voice and data at same time. Which is adapted to the communication control unit, protocol of data traffic and acquisition, displayer of character data. This maritime GPS MDT is considered to low-cost type by using microprocessor.

  • PDF

A Study of Speech Control Tags Based on Semantic Information of a Text (텍스트의 의미 정보에 기반을 둔 음성컨트롤 태그에 관한 연구)

  • Chang, Moon-Soo;Chung, Kyeong-Chae;Kang, Sun-Mee
    • Speech Sciences
    • /
    • v.13 no.4
    • /
    • pp.187-200
    • /
    • 2006
  • The speech synthesis technology is widely used and its application area is also being broadened to an automatic response service, a learning system for handicapped person, etc. However, the sound quality of the speech synthesizer has not yet reached to the satisfactory level of users. To make a synthesized speech, the existing synthesizer generates rhythms only by the interval information such as space and comma or by several punctuation marks such as a question mark and an exclamation mark so that it is not easy to generate natural rhythms of people even though it is based on mass speech database. To make up for the problem, there is a way to select rhythms after processing language from a higher level information. This paper proposes a method for generating tags for controling rhythms by analyzing the meaning of sentence with speech situation information. We use the Systemic Functional Grammar (SFG) [4] which analyzes the meaning of sentence with speech situation information considering the sentence prior to the given one, the situation of a conversation, the relationship among people in the conversation, etc. In this study, we generate Semantic Speech Control Tag (SSCT) by the result of SFG's meaning analysis and the voice wave analysis.

  • PDF

Synchronization of the Train PIS using the reference clock and development of a subtitle authoring tool (레퍼런스 클럭을 이용한 객차 PI 시스템 동기화 및 자막 편집기 개발)

  • Kim, Jung-Hoon;Jang, Dong-Wook;Han, Kwang-Rok
    • Journal of the Korea Society of Computer and Information
    • /
    • v.12 no.4
    • /
    • pp.1-10
    • /
    • 2007
  • This paper describes the development of a network-based passenger information system(PIS) which provides the convenience of the passenger of the train and heightens the effect of the subtitle service, the advertising and the shelter guidance broadcasting against the urgent event. The existing system uses VGA signal distributor in order to broadcast information with image and subtitle and voice guidance. In this paper we improve the existing system by applying the UDP and TCP/IP protocol and use a reference clock to solve a data loss and synchronization problem which occurs in this case. We also developed an XML-based subtitle authoring tool which can edit and play the subtitles with various 3D to improve the automatic guidance broadcasting and advertisement effect according to the operation schedule of the train. The system performance was evaluated through a simulation.

  • PDF

Analysis and Proposal of Communication System for Maritime HF Band Digital Data Exchange (해상 HF대역 디지털 데이터 교환을 위한 통신시스템 분석 및 제안)

  • Choi, Sung-Cheol;So, Ji-Eun;Park, Hyung-Chul
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.21 no.12
    • /
    • pp.2249-2260
    • /
    • 2017
  • IMO (International Maritime Organization) has been providing GMDSS (Global Maritime Distress and Safety System) and mandating to install distress and safety systems according to SOLAS. Digital-HF(High-Frequency) coast station communication system maintains interoperability between ship and coast station and digital data exchange in maritime mobile service by digitizing existing analog base voice communication. In this paper, we analyze ITU-R M. 1798-1 established by ITU for digital HF communications and propose Advanced annex2 and new Annex 5 to improve the problems of the existing Annex 2 and Annex 4. The proposed OFDM protocol basically adopts ARQ (Automatic Retransmission Request) which retransmits when an error occurs in a half-duplex manner between an information transmitting side (ISS) and an information receiving side (IRS) and we propose a digital HF communication system and its operational concept which is more reliable and superior than the existing ITU-R M. 1798 by implementing technical development on implementation and performance improvement.