Search | Korea Science

Telephone Speech Recognition with Data-Driven Selective Temporal Filtering based on Principal Component Analysis

Jung Sun Gyun;Son Jong Mok;Bae Keun Sung
- Proceedings of the IEEK Conference
- /
- 2004.08c
- /
- pp.764-767
- /
- 2004
The performance of a speech recognition system is generally degraded in telephone environment because of distortions caused by background noise and various channel characteristics. In this paper, data-driven temporal filters are investigated to improve the performance of a specific recognition task such as telephone speech. Three different temporal filtering methods are presented with recognition results for Korean connected-digit telephone speech. Filter coefficients are derived from the cepstral domain feature vectors using the principal component analysis.
PDF

Three-Stage Framework for Unsupervised Acoustic Modeling Using Untranscribed Spoken Content

Zgank, Andrej
- ETRI Journal
- /
- v.32 no.5
- /
- pp.810-818
- /
- 2010
This paper presents a new framework for integrating untranscribed spoken content into the acoustic training of an automatic speech recognition system. Untranscribed spoken content plays a very important role for under-resourced languages because the production of manually transcribed speech databases still represents a very expensive and time-consuming task. We proposed two new methods as part of the training framework. The first method focuses on combining initial acoustic models using a data-driven metric. The second method proposes an improved acoustic training procedure based on unsupervised transcriptions, in which word endings were modified by broad phonetic classes. The training framework was applied to baseline acoustic models using untranscribed spoken content from parliamentary debates. We include three types of acoustic models in the evaluation: baseline, reference content, and framework content models. The best overall result of 18.02% word error rate was achieved with the third type. This result demonstrates statistically significant improvement over the baseline and reference acoustic models.
https://doi.org/10.4218/etrij.10.1510.0092 인용 PDF KSCI

Identification of 18 flutter derivatives by covariance driven stochastic subspace method

Mishra, Shambhu Sharan;Kumar, Krishen;Krishna, Prem
- Wind and Structures
- /
- v.9 no.2
- /
- pp.159-178
- /
- 2006
For the slender and flexible cable supported bridges, identification of all the flutter derivatives for the vertical, lateral and torsional motions is essential for its stability investigation. In all, eighteen flutter derivatives may have to be considered, the identification of which using a three degree-of-freedom elastic suspension system has been a challenging task. In this paper, a system identification technique, known as covariance-driven stochastic subspace identification (COV-SSI) technique, has been utilized to extract the flutter derivatives for a typical bridge deck. This method identifies the stochastic state-space model from the covariances of the output-only (stochastic) data. All the eighteen flutter derivatives have been simultaneously extracted from the output response data obtained from wind tunnel test on a 3-DOF elastically suspended bridge deck section-model. Simplicity in model suspension and measurements of only output responses are additional motivating factors for adopting COV-SSI technique. The identified discrete values of flutter derivatives have been approximated by rational functions.
https://doi.org/10.12989/was.2006.9.2.159 인용

Semantic Interoperability Framework for IAAS Resources in Multi-Cloud Environment

Benhssayen, Karima;Ettalbi, Ahmed
- International Journal of Computer Science & Network Security
- /
- v.21 no.2
- /
- pp.1-8
- /
- 2021
Cloud computing has proven its efficiency, especially after the increasing number of cloud services offered by a wide range of cloud providers, from different domains. Despite, these cloud services are mostly heterogeneous. Consequently, and due to the rising interest of cloud consumers to adhere to a multi-cloud environment instead of being locked-in to one cloud provider, the need for semantically interconnecting different cloud services from different cloud providers is a crucial and important task to ensure. In addition, considerable research efforts proposed interoperability solutions leading to different representation models of cloud services. In this work, we present our solution to overcome this limitation, precisely in the IAAS service model. This solution is a framework permitting the semantic interoperability of different IAAS resources in a multi-cloud environment, in order to assist cloud consumers to retrieve the cloud resource that meets specific requirements.
https://doi.org/10.22937/IJCSNS.2021.21.2.1 인용 PDF KSCI

Theoretical Background for Data-driven Integration of Raster-based Geological Information (격자형 지질정보의 자료유도 통합을 위한 이론적 배경)

Lee, Ki-Won;Chi, Kwang-Hoon
- Journal of Korean Society for Geospatial Information Science
- /
- v.3 no.1 s.5
- /
- pp.115-121
- /
- 1995
Recently, spatial integration for mineral exploration is regarded as an important task of various geological applications of GIS. Therefore, theoretical bases of data representation and reasoning concerned with Dempster-Shafer theory and Fuzzy theory were systematically as the data-driven integration methodologies for raster-based geoinformation; they are distinguished from target-driven methodology based on statistical background. According to previous actual applications of these methods to mineral exploration, they have been proven to provide useful information related to hidden target mineral deposits, and it is thought that some suggestions in this study are helpful to further real applications including representation, reasoning, and interpretation stages in order to obtain a decision-supporting layer.
PDF

A Deadline_driven CPU Power Consumption Management Scheme of the TMO-eCos Real-Time Embedded OS (실시간 임베디드 운영체제 TMO-eCos의 데드라인 기반 CPU 소비 전력 관리)

Park, Jeong-Hwa;Kim, Jung-Guk
- Journal of KIISE:Computing Practices and Letters
- /
- v.15 no.4
- /
- pp.304-308
- /
- 2009
This paper presents the deadline driven CPU-Power management scheme for the Real-Time Embedded OS: named TMO-eCos. It used the scheduling scenarios generated by a task serialization technique for hard real- time TMO system. The serializer does a off-line analysis at design time with period, deadline and WCET of periodic tasks. Finally, TMO-eCos kernel controls the CPU speed to save the power consumption under the condition that periodic tasks do not violate deadlines. As a result, the system shows a reasonable amount of power saving. This paper presents all of these processes and test results.
PDF KSCI

Effective Acoustic Model Clustering via Decision Tree with Supervised Decision Tree Learning

Park, Jun-Ho;Ko, Han-Seok
- Speech Sciences
- /
- v.10 no.1
- /
- pp.71-84
- /
- 2003
In the acoustic modeling for large vocabulary speech recognition, a sparse data problem caused by a huge number of context-dependent (CD) models usually leads the estimated models to being unreliable. In this paper, we develop a new clustering method based on the C45 decision-tree learning algorithm that effectively encapsulates the CD modeling. The proposed scheme essentially constructs a supervised decision rule and applies over the pre-clustered triphones using the C45 algorithm, which is known to effectively search through the attributes of the training instances and extract the attribute that best separates the given examples. In particular, the data driven method is used as a clustering algorithm while its result is used as the learning target of the C45 algorithm. This scheme has been shown to be effective particularly over the database of low unknown-context ratio in terms of recognition performance. For speaker-independent, task-independent continuous speech recognition task, the proposed method reduced the percent accuracy WER by 3.93% compared to the existing rule-based methods.
PDF

Technical Trends in Artificial Intelligence for Robotics Based on Large Language Models (거대언어모델 기반 로봇 인공지능 기술 동향 )

J. Lee;S. Park;N.W. Kim;E. Kim;S.K. Ko
- Electronics and Telecommunications Trends
- /
- v.39 no.1
- /
- pp.95-105
- /
- 2024
In natural language processing, large language models such as GPT-4 have recently been in the spotlight. The performance of natural language processing has advanced dramatically driven by an increase in the number of model parameters related to the number of acceptable input tokens and model size. Research on multimodal models that can simultaneously process natural language and image data is being actively conducted. Moreover, natural-language and image-based reasoning capabilities of large language models is being explored in robot artificial intelligence technology. We discuss research and related patent trends in robot task planning and code generation for robot control using large language models.
https://doi.org/10.22648/ETRI.2024.J.390109 인용 PDF

Compliance Control of a 3-Link Electro-Hydraulic Manipulator (3축 전기유압 매니퓰레이터의 컴플라이언스 제어)

안경관;표성만
- Journal of the Korean Society for Precision Engineering
- /
- v.21 no.1
- /
- pp.101-108
- /
- 2004
An electro-hydraulic manipulator using hydraulic actuators has many nonlinear elements, and its parameter fluctuations are greater than those of an electrically driven manipulator. So it is relatively difficult to obtain stable control performance. In this report, we applied disturbance estimation and compensation type robust control to all axes in a 3-link electro-hydraulic manipulator. From the results of experiment, it was confirmed that the performance of trajectory tracking and attitude regulating is greatly improved by the disturbance observer, which model is the same for each axis. On the other hand, for the autonomous assembly tasks, it is said that compliance control is one of the most available methods. Therefore we proposed compliance control which is based on the position control by disturbance observer for our manipulator system. To realize more stable contact work, the states in the compliance loop are feedback, where not only displacement but also velocity and acceleration are considered. And we applied this compliance control to Peg-in-Hole insertion task and analyzed mechanical relation between peg and hole. Also we proposed new method of shifting the position of end-effector periodically for the purpose of smooth insertion. As a result of using this method, it is experimentally confirmed that Peg-in-Hole insertion task with a clearance of 0.05［mm］can be achieved.
PDF KSCI

Design of Real Time Task Scheduling for Line Controller of Continuous Manufacturing Process Automation (연속 공정 자동화를 위한 라인 제어기에서의 실시간 작업 스케쥴링에 관한 연구)

Lee, Joon-Soo;Cho, Young-Jo;Lim, Mee-Seub;Park, Jung-Min;Choy, Ick;Lim, Jun-Hong;Kim, Kwang-Bae
- Proceedings of the KIEE Conference
- /
- 1992.07a
- /
- pp.365-368
- /
- 1992
This paper presents an approach to the design of real time task scheduling for a line controller of continuous manufacturing process automation. The line controller has multiprocessor-based architecture with shared memory and is operated by firmware. This firmware contains menu-driven software supporting real-time database management and fuction-block control language. The multitasking line control processor performs the following three functions: 1) interprets the function block control language by virtue of shared memory in the database; 2) invokes an interupt service routine as required by external hardware; 3) detects errors and notifies the user. We propose real time task scheduling method.
PDF

Search Result 143, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)