Search | Korea Science

A Study on the Segmentation of Speech Signal into Phonemic Units (음성 신호의 음소 단위 구분화에 관한 연구)

Lee, Yeui-Cheon;Lee, Gang-Sung;Kim, Soon-Hyon
- The Journal of the Acoustical Society of Korea
- /
- v.10 no.4
- /
- pp.5-11
- /
- 1991
This paper suggests a segmentation method of speech signal into phonemic units. The suggested segmentation system is speaker-independent and performed without anyprior information of speech signal. In segmentation process, we first divide input speech signal into purevoiced region and not pure voiced speech regions. After then we apply the second algorithm which segments each region into the detailed phonemic units by using the voiced detection parameters, i.e., the time variation of 0th LPC cepstrum coefficient parameter and the ZCR parameter. Types of speech, used to prove the availability of segmentation algorithm suggested in this paper, are the vocabulary composed of isolated words and continuous words. According to the experiments, the successful segmentation rate for 507 phonemic units involved in the total vocabulary is 91.7%.
PDF

Fault Pattern Extraction Via Adjustable Time Segmentation Considering Inflection Points of Sensor Signals for Aircraft Engine Monitoring (센서 데이터 변곡점에 따른 Time Segmentation 기반 항공기 엔진의 고장 패턴 추출)

Baek, Sujeong
- Journal of Korean Society of Industrial and Systems Engineering
- /
- v.44 no.3
- /
- pp.86-97
- /
- 2021
As mechatronic systems have various, complex functions and require high performance, automatic fault detection is necessary for secure operation in manufacturing processes. For conducting automatic and real-time fault detection in modern mechatronic systems, multiple sensor signals are collected by internet of things technologies. Since traditional statistical control charts or machine learning approaches show significant results with unified and solid density models under normal operating states but they have limitations with scattered signal models under normal states, many pattern extraction and matching approaches have been paid attention. Signal discretization-based pattern extraction methods are one of popular signal analyses, which reduce the size of the given datasets as much as possible as well as highlight significant and inherent signal behaviors. Since general pattern extraction methods are usually conducted with a fixed size of time segmentation, they can easily cut off significant behaviors, and consequently the performance of the extracted fault patterns will be reduced. In this regard, adjustable time segmentation is proposed to extract much meaningful fault patterns in multiple sensor signals. By considering inflection points of signals, we determine the optimal cut-points of time segments in each sensor signal. In addition, to clarify the inflection points, we apply Savitzky-golay filter to the original datasets. To validate and verify the performance of the proposed segmentation, the dataset collected from an aircraft engine (provided by NASA prognostics center) is used to fault pattern extraction. As a result, the proposed adjustable time segmentation shows better performance in fault pattern extraction.
https://doi.org/10.11627/jkise.2021.44.3.086 인용 PDF KSCI

A New Method for Segmenting Speech Signal by Frame Averaging Algorithm

Byambajav D.;Kang Chul-Ho
- The Journal of the Acoustical Society of Korea
- /
- v.24 no.4E
- /
- pp.128-131
- /
- 2005
A new algorithm for speech signal segmentation is proposed. This algorithm is based on finding successive similar frames belonging to a segment and represents it by an average spectrum. The speech signal is a slowly time varying signal in the sense that, when examined over a sufficiently short period of time (between 10 and 100 ms), its characteristics are fairly stationary. Generally this approach is based on finding these fairly stationary periods. Advantages of the. algorithm are accurate border decision of segments and simple computation. The automatic segmentations using frame averaging show as much as $82.20\%$ coincided with manually verified segmentation of CMU ARCTIC corpus within time range 16 ms. More than $90\%$ segment boundaries are coincided within a range of 32 ms. Also it can be combined with many types of automatic segmentations (HMM based, acoustic cues or feature based etc.).
PDF KSCI

Performance Comparison Between the Envelope Peak Detection Method and the HMM Based Method for Heart Sound Segmentation

Jang, Hyun-Baek;Chung, Young-Joo
- The Journal of the Acoustical Society of Korea
- /
- v.28 no.2E
- /
- pp.72-78
- /
- 2009
Heart sound segmentation into its components, S1, systole, S2 and diastole is the first step of analysis and the most important part in the automatic diagnosis of heart sounds. Conventionally, the Shannon energy envelope peak detection method has been popularly used due to its superior performance in locating S1 and S2. Recently, the HMM has been shown to be quite suitable in modeling the heart sound signal and its use in segmenting the heart sound signal has been suggested with some success. In this paper, we compared the two methods for heart sound segmentation using a common database. Experimental tests carried out on the 4 different types of heart sound signals showed that the segmentation accuracy relative to the manual segmentation was 97.4% in the HMM based method which was larger than 91.5% in the peak detection method.
PDF KSCI

Effects of Segmentation Size on the Stationarity of Electromyographic Signal in Runs Test (런 검정을 사용한 근전도 신호의 안정성 평가 시 분할 크기가 신호의 안정성에 미치는 영향)

Cho, Young-Jin;Kim, Jung-Yong
- Journal of the Ergonomics Society of Korea
- /
- v.29 no.4
- /
- pp.667-671
- /
- 2010
Runs test is a mathematical tool to test the stationarity of electromyographic (EMG) signals. The purpose of this study is to investigate the effects of segmentation size on the stationarity of EMG signals in runs test. Six subjects participated in this experiment and performed isometric trunk exertions for twenty seconds at the load level of 25% and 50% MVC. The signals extracted from the erector spinae muscles were divided into the intervals of 1000ms and the stationarity of the signal in each interval was tested by the runs test. In this test, seven segmentation sizes such as 1.0, 2.0, 3.9, 7.8, 15.6, 31.3 and 62.5ms were applied. Additionally, two stationarity tests of reverse arrangements test and modified reverse arrangements test were used to verify the results of the runs test. In results, the segmentation size of 62.5ms showed the similar results with the other stationarity tests. However, the stationarity values among there tests were different each other when segmentation sizes other than 62.5ms were used. These results indicated the effect of segmentation size in runs test that needs to be considered to have consistent and sensitive result in stationarity test.
https://doi.org/10.5143/JESK.2010.29.4.667 인용 PDF KSCI

Application of Speech Recognition with Closed Caption for Content-Based Video Segmentations

Son, Jong-Mok;Bae, Keun-Sung
- Speech Sciences
- /
- v.12 no.1
- /
- pp.135-142
- /
- 2005
An important aspect of video indexing is the ability to segment video into meaningful segments, i.e., content-based video segmentation. Since the audio signal in the sound track is synchronized with image sequences in the video program, a speech signal in the sound track can be used to segment video into meaningful segments. In this paper, we propose a new approach to content-based video segmentation. This approach uses closed caption to construct a recognition network for speech recognition. Accurate time information for video segmentation is then obtained from the speech recognition process. For the video segmentation experiment for TV news programs, we made 56 video summaries successfully from 57 TV news stories. It demonstrates that the proposed scheme is very promising for content-based video segmentation.
PDF

Segmentation-based Signal Processing Algorithm for Vehicle Detection (차량검지를 위한 세그먼트에 기반을 둔 신호처리 알고리즘)

Ko, Ki-Won;Woo, Kwang-Joon
- Proceedings of the KIEE Conference
- /
- 2005.10b
- /
- pp.306-308
- /
- 2005
The vehicle detection method using pulse radar has the advantage of maintenance in comparison with loop detection method. We have the information about the vehicle being and position by dividing the signals into sectors in accordance with SSC method, and by applying the discriminant function based on stochastical data. We also reduce the signal processing time.
PDF

A Study on Endpoint Detection and Syllable Segmentation System Using Ramp Edge Detection (Ramp Edge Detection을 이용한 끝점 검출과 음절 분할에 관한 연구)

유일수;홍광석
- Proceedings of the IEEK Conference
- /
- 2003.07e
- /
- pp.2216-2219
- /
- 2003
Accurate speech region detection and automatic syllable segmentation is important part of speech recognition system. In automatic speech recognition system, they are needed for the purpose of accurate recognition and less computational complexity, In this paper, we Propose improved syllable segmentation method using ramp edge detection method and residual signal Peak energy. These methods were used to ensure accuracy and robustness for endpoint detection and syllable segmentation system. They have almost invariant response to various background noise levels. As experimental results, we obtained the rate of 90.7％ accuracy in syllable segmentation in a condition of accurate endpoint detection environments.
PDF

A Moving Picture Coding Method Based on Region Segmentation Using Genetic Algorithm (유전적 알고리즘을 이용한 동화상의 영역분할 부호화 방법)

Jung, Nam-Chae
- Journal of the Institute of Convergence Signal Processing
- /
- v.10 no.1
- /
- pp.32-39
- /
- 2009
In this paper, the method of region segmentation using genetic algorithm is proposed for an improvement of efficiency in moving picture coding. A genetic algorithm is the method that searches a large probing space using only a function value for a optimal combination consecutively. By progressing both motion presumption and region segmentation at once, we can assign the motion vector in a image to a small block or a pixel respectively, and transform the capacity of coding and a signal to noise rate into a problem of optimization. That is to say, there is close correlation between region segmentation and motion presumption in motion-compensated prediction coding. This is to optimize the capacity of coding and a S/N ratio. This is to arrange the motion vector in each block of picture according to the state of optimization. Therefore, we examined both the data type of genetic algorithm and the method of data processing to obtain the results of optimal region segmentation in this paper. And we confirmed the validity of a proposed method using the test pictures by means of computer simulation.
PDF

Sinusoidal Modeling of Polyphonic Audio Signals Using Dynamic Segmentation Method (동적 세그멘테이션을 이용한 폴리포닉 오디오 신호의 정현파 모델링)

장호근;박주성
- The Journal of the Acoustical Society of Korea
- /
- v.19 no.4
- /
- pp.58-68
- /
- 2000
This paper proposes a sinusoidal modeling of polyphonic audio signals. Sinusoidal modeling which has been applied well to speech and monophonic signals cannot be applied directly to polyphonic signals because a window size for sinusoidal analysis cannot be determined over the entire signal. In addition, for high quality synthesized signal transient parts like attacks should be preserved which determines timbre of musical instrument. In this paper, a multiresolution filter bank is designed which splits the input signal into six octave-spaced subbands without aliasing and sinusoidal modeling is applied to each subband signal. To alleviate smearing of transients in sinusoidal modeling a dynamic segmentation method is applied to subbands which determines the analysis-synthesis frame size adaptively to fit time-frequency characteristics of the subband signal. The improved dynamic segmentation is proposed which shows better performance about transients and reduced computation. For various polyphonic audio signals the result of simulation shows the suggested sinusoidal modeling can model polyphonic audio signals without loss of perceptual quality.
PDF

Search Result 134, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)