• Title/Summary/Keyword: Standard Vocabulary

Search Result 66, Processing Time 0.026 seconds

Classification of Vocabulary for Evaluation on Machine Noise at High Noisy Workshop (고소음 작업장 기계소음 평가를 위한 어휘의 유형화)

  • Yun, Jae-Hyun;Kim, Jae-Soo
    • Journal of Korean Society of Environmental Engineers
    • /
    • v.33 no.10
    • /
    • pp.748-755
    • /
    • 2011
  • After the Industrialization of 1960s, while it has greatly contributed to the industrial development owing to acceleration of mechanization, but it is real situation that the countermeasure to noise damage generating at the loud noise workshop is scarcely made. Especially, the machine noise made at factory and workplace is so shocking and repeatedly reiterating terrible noise that most of the spot workers are forcedly imposing such dangers as the severe unpleasant feeling and hearing impairments. On such point of view, this research has attempted to extract the proper rating vocabulary in order for evaluation on machine noise made at the high noisy workshop, therefore it is considering that those extracted vocabularies could be utilized as the useful psycho-acoustic experiment for evaluation on machine noise, also for establishment of regulation standard in domestic high noisy workshop.

Vocabularies for Defining Business Process of International Logistics (국제 물류 비즈니스 프로세스 정의를 위한 어휘 사전)

  • Lee, Jong-Yun;Lee, Sang-Ho;Bae, Woo-Sik;Lee, Sun-Young;Song, Byung-Jun
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2007.10c
    • /
    • pp.179-184
    • /
    • 2007
  • EPCglobal 네트워크란 EPC(Electronic Product Code)와 RFID(Radio Frequency Identification) 기술을 바탕으로 제품에 식별번호를 부여하고 정보를 저장할 수 있는 공간을 네트워크로 연동하여 공급자, 수요자, 소비가 제품에 관련된 정보를 알 수 있게 해주는 시스템을 말한다. 또한 EPCglobal 네트워크 아키텍처 프레임워크(EPCglobal network architecture framework)는 EPC를 사용하여 공급과 유통망 강화라는 업체간의 공동 목표를 위한 서비스하는 것이다. EPCglobal 네트워크 아키텍처 중 EPCIS(EPC Information System) 수집 애플리케이션, EPCIS 접근 애플리케이션에서 이벤트 정의 시 공통적인 의사소통을 위한 표준 어휘 정의가 필수적이며, 그 어휘 종류에는 표준 어휘(standard vocabulary)와 사용자어휘(user vocabulary)가 있다. 이 논문은 이 가운데 EPCglobal 네트워크 아키텍처의 표준 어휘 정의에 초점을 두고, 사용자 어휘는 부분적으로 정의하고자 한다. 따라서 이 논문은 EPCglobal 네트워크 프레임 아키텍처의 이벤트 정의 시 사용될 표준 어휘를 제안하고, XML 구현을 통해 실제 사용 가능한 환경을 제공할 것이다.

  • PDF

Stochastic Pronunciation Lexicon Modeling for Large Vocabulary Continous Speech Recognition (확률 발음사전을 이용한 대어휘 연속음성인식)

  • Yun, Seong-Jin;Choi, Hwan-Jin;Oh, Yung-Hwan
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.2
    • /
    • pp.49-57
    • /
    • 1997
  • In this paper, we propose the stochastic pronunciation lexicon model for large vocabulary continuous speech recognition system. We can regard stochastic lexicon as HMM. This HMM is a stochastic finite state automata consisting of a Markov chain of subword states and each subword state in the baseform has a probability distribution of subword units. In this method, an acoustic representation of a word can be derived automatically from sample sentence utterances and subword unit models. Additionally, the stochastic lexicon is further optimized to the subword model and recognizer. From the experimental result on 3000 word continuous speech recognition, the proposed method reduces word error rate by 23.6% and sentence error rate by 10% compare to methods based on standard phonetic representations of words.

  • PDF

Emergency dispatching based on automatic speech recognition (음성인식 기반 응급상황관제)

  • Lee, Kyuwhan;Chung, Jio;Shin, Daejin;Chung, Minhwa;Kang, Kyunghee;Jang, Yunhee;Jang, Kyungho
    • Phonetics and Speech Sciences
    • /
    • v.8 no.2
    • /
    • pp.31-39
    • /
    • 2016
  • In emergency dispatching at 119 Command & Dispatch Center, some inconsistencies between the 'standard emergency aid system' and 'dispatch protocol,' which are both mandatory to follow, cause inefficiency in the dispatcher's performance. If an emergency dispatch system uses automatic speech recognition (ASR) to process the dispatcher's protocol speech during the case registration, it instantly extracts and provides the required information specified in the 'standard emergency aid system,' making the rescue command more efficient. For this purpose, we have developed a Korean large vocabulary continuous speech recognition system for 400,000 words to be used for the emergency dispatch system. The 400,000 words include vocabulary from news, SNS, blogs and emergency rescue domains. Acoustic model is constructed by using 1,300 hours of telephone call (8 kHz) speech, whereas language model is constructed by using 13 GB text corpus. From the transcribed corpus of 6,600 real telephone calls, call logs with emergency rescue command class and identified major symptom are extracted in connection with the rescue activity log and National Emergency Department Information System (NEDIS). ASR is applied to emergency dispatcher's repetition utterances about the patient information. Based on the Levenshtein distance between the ASR result and the template information, the emergency patient information is extracted. Experimental results show that 9.15% Word Error Rate of the speech recognition performance and 95.8% of emergency response detection performance are obtained for the emergency dispatch system.

Development of Online Fashion Thesaurus and Taxonomy for Text Mining (텍스트마이닝을 위한 패션 속성 분류체계 및 말뭉치 웹사전 구축)

  • Seyoon Jang;Ha Youn Kim;Songmee Kim;Woojin Choi;Jin Jeong;Yuri Lee
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.46 no.6
    • /
    • pp.1142-1160
    • /
    • 2022
  • Text data plays a significant role in understanding and analyzing trends in consumer, business, and social sectors. For text analysis, there must be a corpus that reflects specific domain knowledge. However, in the field of fashion, the professional corpus is insufficient. This study aims to develop a taxonomy and thesaurus that considers the specialty of fashion products. To this end, about 100,000 fashion vocabulary terms were collected by crawling text data from WSGN, Pantone, and online platforms; text subsequently was extracted through preprocessing with Python. The taxonomy was composed of items, silhouettes, details, styles, colors, textiles, and patterns/prints, which are seven attributes of clothes. The corpus was completed through processing synonyms of terms from fashion books such as dictionaries. Finally, 10,294 vocabulary words, including 1,956 standard Korean words, were classified in the taxonomy. All data was then developed into a web dictionary system. Quantitative and qualitative performance tests of the results were conducted through expert reviews. The performance of the thesaurus also was verified by comparing the results of text mining analysis through the previously developed corpus. This study contributes to achieving a text data standard and enables meaningful results of text mining analysis in the fashion field.

Sentence Rejection using Word Spotting Ratio in the Phoneme-based Recognition Network (음소기반 인식 네트워크에서의 단어 검출률을 이용한 문장거부)

  • Kim, Hyung-Tai;Ha, Jin-Young
    • Proceedings of the KSPS conference
    • /
    • 2005.04a
    • /
    • pp.99-102
    • /
    • 2005
  • Research efforts have been made for out-of-vocabulary word rejection to improve the confidence of speech recognition systems. However, little attention has been paid to non-recognition sentence rejection. According to the appearance of pronunciation correction systems using speech recognition technology, it is needed to reject non-recognition sentences to provide users with more accurate and robust results. In this paper, we introduce standard phoneme based sentence rejection system with no need of special filler models. Instead we used word spotting ratio to determine whether input sentences would be accepted or rejected. Experimental results show that we can achieve comparable performance using only standard phoneme based recognition network in terms of the average of FRR and FAR.

  • PDF

Analyzing Vocabulary Characteristics of Colloquial Style Corpus and Automatic Construction of Sentiment Lexicon (구어체 말뭉치의 어휘 사용 특징 분석 및 감정 어휘 사전의 자동 구축)

  • Kang, Seung-Shik;Won, HyeJin;Lee, Minhaeng
    • Smart Media Journal
    • /
    • v.9 no.4
    • /
    • pp.144-151
    • /
    • 2020
  • In a mobile environment, communication takes place via SMS text messages. Vocabularies used in SMS texts can be expected to use vocabularies of different classes from those used in general Korean literary style sentence. For example, in the case of a typical literary style, the sentence is correctly initiated or terminated and the sentence is well constructed, while SMS text corpus often replaces the component with an omission and a brief representation. To analyze these vocabulary usage characteristics, the existing colloquial style corpus and the literary style corpus are used. The experiment compares and analyzes the vocabulary use characteristics of the colloquial corpus SMS text corpus and the Naver Sentiment Movie Corpus, and the written Korean written corpus. For the comparison and analysis of vocabulary for each corpus, the part of speech tag adjective (VA) was used as a standard, and a distinctive collexeme analysis method was used to measure collostructural strength. As a result, it was confirmed that adjectives related to emotional expression such as'good-','sorry-', and'joy-' were preferred in the SMS text corpus, while adjectives related to evaluation expressions were preferred in the Naver Sentiment Movie Corpus. The word embedding was used to automatically construct a sentiment lexicon based on the extracted adjectives with high collostructural strength, and a total of 343,603 sentiment representations were automatically built.

Investigation on the Scrum-based Standard Management for Efficient Data Quality Control of Small-sized Companies : A Case Study on Distribution Service of Company 'I' (중소기업의 효율적 데이터 품질관리를 위한 스크럼 기반 표준관리 방안 : 'I'사 물류서비스 적용 사례)

  • Kim, Tai-Yun;Kim, Nam-Gyu;Sohn, Yong-Lak
    • Journal of Information Technology Applications and Management
    • /
    • v.17 no.1
    • /
    • pp.83-105
    • /
    • 2010
  • The competence of enterprise for managing information is evaluated not by the amount of information but by the quality of information such as response time, data consistency, and data correctness. The degradation of data quality is usually caused by the inappropriate process of managing the structure and value of stored data. According to the recent survey on the actual condition of data quality management, the correctness and consistency of data appeared to be the most problematic area among the six criteria of data quality management such as correctness, consistency, availability, timeliness, accessibility, and security. Moreover, the problem was more serious in case of small and medium-sized companies than large enterprises. In this paper, therefore, we attempt to propose a new data quality control methodology for small and medium-sized companies that can improve the correctness and consistency of data without consuming too much time and cost. To adopt the proposed methodology to real application immediately, we provided some scripts for as-is analysis and devised automation tools for managing naming rules of vocabulary, terminology, and data code. Additionally, we performed case study on the distribution service of a small-sized company to estimate the applicability of our tool and methodology.

  • PDF

An Analysis of ISO 30300 Principles and ISO 30301 Requirements (ISO 30300 원리와 ISO 30301 요구사항 분석)

  • Ryu, Ga Hyun;Kim, Heesop
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.12 no.1
    • /
    • pp.95-115
    • /
    • 2012
  • The purpose of this study is to analyze the 7 principles of ISO 30300 and the requirements of ISO 30301 to provide the conceptual frame of MSR for various organizations providing a comprehensive understanding of the standard series. To achieve the aim of this study, we analyzed the relationship between 'ISO/FDIS 30300 MSR-Fundamentals and Vocabulary' and 'ISO/FDIS 30301 MSR- Requirements' using the Matrix Diagram approach. The results show that there exist some level of relationships between principles and requirement either directly or indirectly, however, 'Process approach' principle seems the closest relationship among them.

Development of the ISO 15926-based Classification Structure for Nuclear Plant Equipment (ISO 15926 국제 표준을 이용한 원자력 플랜트 기자재 분류체계)

  • Yun, J.;Mun, D.;Han, S.;Cho, K.
    • Korean Journal of Computational Design and Engineering
    • /
    • v.12 no.3
    • /
    • pp.191-199
    • /
    • 2007
  • In order to construct a data warehouse of process plant equipment, a classification structure should be defined first, identifying not only the equipment categories but also attributes of an each equipment to represent the specifications of equipment. ISO 15926 Process Plants is an international standard dealing with the life-cycle data of process plant facilities. From the viewpoints of defining classification structure, Part 2 data model and Reference Data Library (RDL) of ISO 15926 are seen to respectively provide standard syntactic structure and semantic vocabulary, facilitating the exchange and sharing of plant equipment's life-cycle data. Therefore, the equipment data warehouse with an ISO 15926-based classification structure has the advantage of easy integration among different engineering systems. This paper introduces ISO 15926 and then discusses how to define a classification structure with ISO 15926 Part 2 data model and RDL. Finally, we describe the development result of an ISO 15926-based classification structure for a variety of equipment consisting in the reactor coolant system (RCS) of APR 1400 nuclear plant.