• Title/Summary/Keyword: Data segmentation

Search Result 1,234, Processing Time 0.025 seconds

Consonant/Vowel Segmentation in Monosyllabic Speech Data Using the Fractal Dimension (프랙탈 차원을 이용한 단음절 음성의 자$\cdot$모음 분리)

  • Choi, Chul-Young;Kim, Hyung-Soon;Kim, Jae-Ho;Son, Kyung-Sik
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.3
    • /
    • pp.51-62
    • /
    • 1994
  • In this paper, we performed a class of experiments on segmenting consonant and vowel from Korean consonant-vowel (CV) monosyllable data, using the fractal dimension of the speech signals. We chose the Minkowski-Bouligand dimension as the fractal dimension, and computed it using the morphological covering method. In order to examine the usefulness of the fractal dimension in speech segmentation we carried out speech segmentation experiments using the fractal dimension alone, using the short-time energy alone, and using both the fractal dimension and the short-time energy, and compared the results. From the experiments, segmentation accuracy of $96.1\%$ was achieved for the case with using the multiplication of the slope of the fractal dimension and that of the energy, while the segmentation accuracies for the cases with using the slope of either the fractal dimension or energy alone were slightly lower $(93.6\%)$ or much lower $(88.0\%)$ than the above case, respectively. These results indicate that the fractal dimension can be used as a good parameter for speech segmentation.

  • PDF

A Moving Picture Coding Method Based on Region Segmentation Using Genetic Algorithm (유전적 알고리즘을 이용한 동화상의 영역분할 부호화 방법)

  • Jung, Nam-Chae
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.10 no.1
    • /
    • pp.32-39
    • /
    • 2009
  • In this paper, the method of region segmentation using genetic algorithm is proposed for an improvement of efficiency in moving picture coding. A genetic algorithm is the method that searches a large probing space using only a function value for a optimal combination consecutively. By progressing both motion presumption and region segmentation at once, we can assign the motion vector in a image to a small block or a pixel respectively, and transform the capacity of coding and a signal to noise rate into a problem of optimization. That is to say, there is close correlation between region segmentation and motion presumption in motion-compensated prediction coding. This is to optimize the capacity of coding and a S/N ratio. This is to arrange the motion vector in each block of picture according to the state of optimization. Therefore, we examined both the data type of genetic algorithm and the method of data processing to obtain the results of optimal region segmentation in this paper. And we confirmed the validity of a proposed method using the test pictures by means of computer simulation.

  • PDF

Object detection in financial reporting documents for subsequent recognition

  • Sokerin, Petr;Volkova, Alla;Kushnarev, Kirill
    • International journal of advanced smart convergence
    • /
    • v.10 no.1
    • /
    • pp.1-11
    • /
    • 2021
  • Document page segmentation is an important step in building a quality optical character recognition module. The study examined already existing work on the topic of page segmentation and focused on the development of a segmentation model that has greater functional significance for application in an organization, as well as broad capabilities for managing the quality of the model. The main problems of document segmentation were highlighted, which include a complex background of intersecting objects. As classes for detection, not only classic text, table and figure were selected, but also additional types, such as signature, logo and table without borders (or with partially missing borders). This made it possible to pose a non-trivial task of detecting non-standard document elements. The authors compared existing neural network architectures for object detection based on published research data. The most suitable architecture was RetinaNet. To ensure the possibility of quality control of the model, a method based on neural network modeling using the RetinaNet architecture is proposed. During the study, several models were built, the quality of which was assessed on the test sample using the Mean average Precision metric. The best result among the constructed algorithms was shown by a model that includes four neural networks: the focus of the first neural network on detecting tables and tables without borders, the second - seals and signatures, the third - pictures and logos, and the fourth - text. As a result of the analysis, it was revealed that the approach based on four neural networks showed the best results in accordance with the objectives of the study on the test sample in the context of most classes of detection. The method proposed in the article can be used to recognize other objects. A promising direction in which the analysis can be continued is the segmentation of tables; the areas of the table that differ in function will act as classes: heading, cell with a name, cell with data, empty cell.

The Service Operation Strategy of Internet Shopping Mall by User Segmentation Market Typology

  • Jeong, Won-Kil
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.1
    • /
    • pp.11-20
    • /
    • 2004
  • The purpose of this paper was to reveal Service Operation Strategy for the Internet shopping mall based on the types of internet users' market segmentation focus on the internet shopping behavior and e-service quality. In this paper, we examined internet shopping behavior and internet service quality factor depend on the types of internet users' group empirically. The empirical study result identifies the main decision factor depend on the types of internet users' group. On the basis of these result, Service Operation Strategy for the internet shopping mall has been suggested.

  • PDF

Market Segmentation of Patient-Utilization in Oriental Medical Care and Western Medical Care (양.한방 의료서비스 이용환자의 시장 세분화에 관한 연구)

  • 이선희;조희숙;최은영;최귀선;채유미
    • Health Policy and Management
    • /
    • v.12 no.1
    • /
    • pp.125-143
    • /
    • 2002
  • The objectives of this study were analysis of patient\`s characteristics and market segmentation in oriental medical care and western medical care. This study focused on medical utilization using Anderson's health utilization model. The source of data was 1998 National Health and Nutrition Survey which Korean Institute For Health and Social Affairs carried out. A stratified multistage probability sampling design was used in this survey. The analysis was conducted using the statistical software package SPSS version 10.0 and Answer Tree 2.1 which is one of data mining methodology. The results were as follows ; 1) 44.9% of respondents reported visiting oriental medical center within recent two weeks. 3.4% of them used oriental medical care. The group of age, kind of disease and medical expenditure are associated with the difference western and oriental medical utilization rate. 2) There were several factors related to utilization of oriental medical care according to decision tree. Especially, important factors that patient chose his medical center were kinds of disease, kinds of common medical use, and expenditure. 3) in the results of CART analysis, market of oriental medical care were classified by seven categories. The major groups who have a preference for oriental medicine were those musculo-skeletal, cerebra-vascular disease, or chronic headache patients, and they had a preference fur oriental medical care in common use. These results show that oriental and western medical market were divided into various areas by market segmentation.

A Two-Stage Document Page Segmentation Method using Morphological Distance Map and RBF Network (거리 사상 함수 및 RBF 네트워크의 2단계 알고리즘을 적용한 서류 레이아웃 분할 방법)

  • Shin, Hyun-Kyung
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.9
    • /
    • pp.547-553
    • /
    • 2008
  • We propose a two-stage document layout segmentation method. At the first stage, as top-down segmentation, morphological distance map algorithm extracts a collection of rectangular regions from a given input image. This preliminary result from the first stage is employed as input parameters for the process of next stage. At the second stage, a machine-learning algorithm is adopted RBF network, one of neural networks based on statistical model, is selected. In order for constructing the hidden layer of RBF network, a data clustering technique bared on the self-organizing property of Kohonen network is utilized. We present a result showing that the supervised neural network, trained by 300 number of sample data, improves the preliminary results of the first stage.

Segmentation of the Compensation Packages for Doctors by Mixture Regression Model (혼합회귀모델을 이용한 의사의 선호보상체계 분석)

  • Paik, Soo-Kyung;Kwak, Young-Sik
    • Korea Journal of Hospital Management
    • /
    • v.10 no.4
    • /
    • pp.75-97
    • /
    • 2005
  • The research objective is to empirically investigate the compensation packages maximizing the utilities of internal customers by applying the market segmentation theory. Data was collected from four Korean hospitals in Seoul, Busan and Gyunggi-do. The research is designed to seek the compensation package maximizing the utility of doctors by mixture regression model, which has been applied as latent structure and other type of finite mixture models from various academic fields since early 1980s. The mixture regression model shows the optimal segments number and fuzzy classification for each observation by EM(expectation-maximization algorism). The finite mixture regression model is to unmix the sample, to identify the groups, and to estimate the parameters of the density function underlying the observed data within each group. The doctors were segmented into 5 groups by their preference for the compensation package. The results of this study imply that the utility of doctors increases with differentiated compensation package segmented by their preference.

  • PDF

Review on Probabilistic Seismic Hazard Analysis of Capable Faults (단층지진원 확률론적 지진재해도 분석에 관한 고찰)

  • 최원학;연관희;장천중
    • Proceedings of the Earthquake Engineering Society of Korea Conference
    • /
    • 2002.03a
    • /
    • pp.28-35
    • /
    • 2002
  • The probabilistic seismic hazard analysis for engineering needs several active fault parameters as input data. Fault slip rates, the segmentation model for each fault, and the date of the most recent large earthquake in seismic hazard analysis are the critical pieces of information required to characterize behavior of the faults. Slip rates provide a basis for calculating earthquake recurrence intervals. Segmentation models define potential rupture lengths and are inputs to earthquake magnitude. The most recent event is used in time-dependent probability calculations. These data were assembled by expert source-characterization groups consisting of geologists, geophysicists, and seismologists evaluating the information available for earth fault. The procedures to prepare inputs for seismic hazard are illustrated with possible segmentation scenarios of capable fault models and the seismic hazards are evaluated to see the implication of considering capable faults models.

  • PDF

Moving Object Tracking Method in Video Data Using Color Segmentation (칼라 분할 방식을 이용한 비디오 영상에서의 움직이는 물체의 검출과 추적)

  • 이재호;조수현;김회율
    • Proceedings of the IEEK Conference
    • /
    • 2001.06d
    • /
    • pp.219-222
    • /
    • 2001
  • Moving objects in video data are main elements for video analysis and retrieval. In this paper, we propose a new algorithm for tracking and segmenting moving objects in color image sequences that include complex camera motion such as zoom, pan and rotating. The Proposed algorithm is based on the Mean-shift color segmentation and stochastic region matching method. For segmenting moving objects, each sequence is divided into a set of similar color regions using Mean-shift color segmentation algorithm. Each segmented region is matched to the corresponding region in the subsequent frame. The motion vector of each matched region is then estimated and these motion vectors are summed to estimate global motion. Once motion vectors are estimated for all frame of video sequences, independently moving regions can be segmented by comparing their trajectories with that of global motion. Finally, segmented regions are merged into the independently moving object by comparing the similarities of trajectories, positions and emerging period. The experimental results show that the proposed algorithm is capable of segmenting independently moving objects in the video sequences including complex camera motion.

  • PDF

Market Segmentation and Purchase Behavior for Consumers Purchasing Korean Cultural Fashion Items - Focused on Inbound Japanese Tourists - (한국패션문화상품 소비자에 대한 시장세분화와 구매행동연구 - 방한 일본관광객을 중심으로 -)

  • Lee, Jin-Hwa
    • Fashion & Textile Research Journal
    • /
    • v.8 no.4
    • /
    • pp.427-432
    • /
    • 2006
  • The purpose of this study was 1) to segment the market of inbound Japanese tourists based on the importance of tour activity that tourists perceived and 2) to examine the behavior of each segmentation purchasing cultural fashion items in Korea. Data were collected using a self-administered questionnaire survey in Seoul. Clustering analysis, Chisquare, and ANOVA test were used to conduct the data analysis on 288 out of 400 questionnaires. The inbound Japanese tourists market was segmented into 3 groups; culture oriented group, shopping oriented group, and multi-activity group. Three groups were significantly different in terms of age, income, purchase amount, purchase criteria, and degree of shopping satisfaction. Marketing strategies for segmented markets were discussed.