• Title/Summary/Keyword: text coding

Search Result 86, Processing Time 0.026 seconds

An Interaction-Based MPEG-4 Player for a PDA (PDA 환경에서의 인터렉션 기반의 MPEG-4 재생기)

  • N., Kim;S., Kim;H., Lee;S., Kim
    • Proceedings of the Korea Multimedia Society Conference
    • /
    • 2004.05a
    • /
    • pp.370-373
    • /
    • 2004
  • The rapid proliferation of mobile device such as PDA allows users more ubiquitous access to multimedia information. The user mobility provides users a uniform vision of their preferred working environment independently of their current points of attachment. Supporting the user mobility requires the Player capable of efficiently presenting the multimedia contents. MPEC-4 provides not only the description for coding audio and video (as its predecessors MPEG-1 and MPEG-2), but also for coding images, animations, interactivity and protecting content. With MPEG-4, we present interactive media using multiple objects - audio, video, image, 2D geometry, and text - in a single format. Therefore we propose the MPEC-4 Player for PDA. The proposed MPEG-4 Player for PDA supports mobility, portability and personality.

  • PDF

Design of Python Block and Text Co-coding Platform for Artificial Intelligence Convergence in Vocational Education (인공지능 융합 직업 교육을 위한 파이썬 블록과 텍스트 공동 코딩 플랫폼 설계)

  • Lee, Se-Hoon;Kim, Yeon-Woo;Hong, Seung-Min
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2022.01a
    • /
    • pp.231-232
    • /
    • 2022
  • 본 논문에서는 직업 교육 분야에 인공지능 융합 교육을 위한 파이썬 블록과 텍스트 동시 코딩 플랫폼을 설계하였다. 플랫폼에 코딩 언어로는 데이터 분석과 머신러닝의 다양한 라이브러리를 지원하고 있는 파이썬으로 하며, 직업 교육의 영역 전문가가 쉽게 직무 기능 파이썬 블록 모듈을 만들어 추가하고 커스터마이징을 할 수 있는 아키텍처를 갖고 있다. 제안한 플랫폼을 활용한 인공지능 융합 직업 분야로 바이오와 기계공학 분야의 블록 모듈을 추가하고 실습 예제를 만드는 과정을 보여 플랫폼의 유용성과 효율성을 보였다.

  • PDF

The Korean Text-to-speech Using Syllable Units (음절 단위를 이용한 한국어 음성 합성)

  • 김병수;윤기선;박성한
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.27 no.1
    • /
    • pp.143-150
    • /
    • 1990
  • In this paper, a rule-based method for improving the intelligibility of synthetic speech is proposed. A 12-pole linear prediction coding method is used to model syllable speech signals. A syllable concatenation rule for pause and frame rejection between syllables is developed to improve the naturalness of the synthetic speech. In addition, phonoligical structure transform rule and prosody rule are applied to the synthetic speech by LPC. The illustrative results demonstrate that the synthetic speech obtained by applying these rules has better naturalness than the synthetic speech by LPC.

  • PDF

On a Speech Coding Algorithm for Low Cost Implementation of Voice Telegram System (보이스 전보 시스템 구현을 위한 저가형 음성파형 부호화 알고리즘)

  • 나덕수;민소연;배명진
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.2
    • /
    • pp.101-105
    • /
    • 2000
  • A telegram has been used to transmit the emergency news or celebration message. So, it has been very important media in our life. Although the telegram processing is more and more convenient, on the other hand, the telegram service contains only text message. The voice telegram is that delivering user's voice with text message. So, the voice telegram can be delivered sender's emotions and feelings. However, since voice information contains lots of data, large memory size and high cost processor are needed to deliver itself. In this paper, we proposed a new speech waveform coding method that has low complexity and low cost implementation for the voice telegram system. First, we fixed one basic speech waveform per pitch period and measured the waveform similarity between basic and neighbor speech waveform. Second, if the similarity satisfied threshold values, we compress the neighbor speech waveform with pitch and magnitude value per pitch period and if not, we save speech waveform. When the compression is about 45%, we obtained about 4 point in MOS.

  • PDF

A New Vocoder based on AMR 7.4Kbit/s Mode for Speaker Dependent System (화자 의존 환경의 AMR 7.4Kbit/s모드에 기반한 보코더)

  • Min, Byung-Jae;Park, Dong-Chul
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.33 no.9C
    • /
    • pp.691-696
    • /
    • 2008
  • A new vocoder of Code Excited Linear Predictive (CELP) based on Adaptive Multi Rate (AMR) 7.4kbit/s mode is proposed in this paper. The proposed vocoder achieves a better compression rate in an environment of Speaker Dependent Coding System (SDSC) and is efficiently used for systems, such as OGM(Outgoing message) and TTS(Text To Speech), which needs only one person's speech. In order to enhance the compression rate of a coder, a new Line Spectral Pairs(LSP) code-book is employed by using Centroid Neural Network (CNN) algorithm. In comparison with original(traditional) AMR 7.4 Kbit/s coder, the new coder shows 27% higher compression rate while preserving synthesized speech quality in terms of Mean Opinion Score(MOS).

The Effect of e-Learning Contents' Information Presentation Method on Teaching Presence and Academic Achievement (e-러닝 콘텐츠의 정보제시방식이 교수실재감 및 학업성취도에 미치는 효과)

  • Kim, Jinha;Kim, Kyunghee;Lee, Seongju
    • The Journal of Korean Association of Computer Education
    • /
    • v.22 no.3
    • /
    • pp.79-87
    • /
    • 2019
  • This study examined the effect of e-learning contents with different dual-coding, media-richness, and cognitive-load degree on learning. To do so, after dividing summary and explanation presentation methods in e-learning contents according to information's quantity and kind, the effects on teaching presence and academic achievement were examined. The summary presentation method was produced as text type and text+illustration type and the explanation presentation method as audio type and audio+video type. The results of this study are as follows. First, in the summary method, the text+illustration type had significantly higher teaching presence than text type. Second, in the explanation method, the audio type was found to be significantly higher than the audio+video type. Third, the interaction between the summary method and explanation method was found to be significant in teaching presence and academic achievement.

A study on narrative text analysis from the perspective of information processing - focusing on four computational methodologies (정보처리 관점에서의 서사 텍스트 분석에 관한 연구 - 네 가지 전산적 방법론을 중심으로)

  • Kwon, Hochang
    • Trans-
    • /
    • v.13
    • /
    • pp.141-169
    • /
    • 2022
  • Analysis of narrative texts has been regarded as academically and practically important, and has been made from various perspectives and methods. In this paper, the computational narrative analysis methodology from the perspective of information processing was examined. From the point of view of information processing, the creation and acceptance of narrative is a bidirectional coding process mediated by narrative text, and narrative text can be said to be a multi-layered structured code. In this paper, four methodologies that share this point of view - character network analysis, text mining and sentiment analysis, continuity analysis of event composition, and knowledge analysis of narrative agents - were examined together with cases. Through this, the mechanism and possibility of computational methodology in narrative analysis were confirmed. In conclusion, the significance and side effects of computational narrative analysis were examined, and the necessity of designing a human-computer collaboration model based on the consilience of the humanities and science/technology was discussed. Based on this model, it was argued that aesthetically creative, ethically good, politically progressive, and cognitively sophisticated narratives could be made more effectively.

Huffman Code Design and PSIP Structure of Hangul Data for Digital Broadcasting (디지털 방송용 한글 허프만 부호 설계 및 PSIP 구조)

  • 황재정;진경식;한학수;최준영;이진환
    • Journal of Broadcast Engineering
    • /
    • v.6 no.1
    • /
    • pp.98-107
    • /
    • 2001
  • In this paper we derive an optimal Huffman code set with escape coding that miximizes coding efficiency for the Hangul text data. The Hangul code can be represented in the standard Wansung or Unicode format, and we can generate a set of Huffamn codes for both. The current Korean DT standard has not defined a Hangul compression algorithm which may be confronted with a serious data rate for the digital data broadcasting system Generation of the optimal Huffman code set is to solve the data transmission problem. A relevant PSIP structure for the DTB standard is also proposed As a result characters which have the probability of less than 0.0043 are escape coded, showing the optimum compression efficiency of 46%.

  • PDF

PDOCM : Fast Text Compression on MasPar Machine (PDOCM : MasPar머쉰상의 새로운 압축기법과 빠른 텍스트 축약)

  • Min, Yong-Sik
    • The Journal of the Acoustical Society of Korea
    • /
    • v.14 no.1
    • /
    • pp.40-47
    • /
    • 1995
  • Due to rapid progress in data communications, we are able to acquire the information we need with ease. One means of achieving this is a parallel machine such as the MasPar. Although the parallel machine makes it possible to receive/transmit enormous quantities of data, because of the increasing volume of information that must be processed, it is necessary to transmit only a minimal amount of data bits. This paper suggests a new coding method for the parallel machine, which compresses the data by reducing redundancy. Parallel Dynamic Octal Compact Mapping (PDOCM) compresses at least 1 byte per word, compared with other coding techniques, and achieves a 54.188-fold speedup with 64 processors to transmit 10 million characters.

  • PDF