• Title/Summary/Keyword: Non-Negative Matrix Factorization

Search Result 103, Processing Time 0.033 seconds

Font Classification using NMF and EMD (NMF와 EMD를 이용한 영문자 활자체 폰트분류)

  • Lee, Chang-Woo;Kang, Hyun;Jung, Kee-Chul;Kim, Hang-Joon
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.04b
    • /
    • pp.688-690
    • /
    • 2004
  • 최근 전자화된 문서 영상을 효율적으로 관리하고 검색하기 위한 문서구조분석 방법과 문서의 자동 분류에 관한 많은 연구가 발표되고 있다. 본 논문에서는 NMF(non-negative matrix factorization) 알고리즘을 사용하여 폰트를 자동으로 분류하는 방법을 제안한다. 제안된 방법은 폰트의 구분 특징들이 공간적으로 국부성을 가지는 부분으로 표현될 수 있다는 가정을 바탕으로, 전체의 폰트 이미지들로부터 각 폰트들의 구분 특징인 부분을 학습하고, 학습된 부분들을 특징으로 사용하여 폰트를 분류하는 방법이다. 학습된 폰트의 특징들은 계층적 군집화 알고리즘을 이용하여 템플릿을 생성하고, 테스트 패턴을 분류하기 위하여 템플릿 패턴과의 EMD(earth mover's distance)를 사용한다. 실험결과에서 폰트 이미지들의 공간적으로 국부적인 특징들이 조사되고, 그 특징들의 폰트 식별을 위한 적절성을 보였다. 제안된 방법이 기존의 문자인식. 문서 검색 시스템들의 전처리기로 사용되면. 그 시스템들의 성능을 향상시킬 것으로 기대된다.

  • PDF

Imaging and analysis of genetically encoded calcium indicators linking neural circuits and behaviors

  • Oh, Jihae;Lee, Chiwoo;Kaang, Bong-Kiun
    • The Korean Journal of Physiology and Pharmacology
    • /
    • v.23 no.4
    • /
    • pp.237-249
    • /
    • 2019
  • Confirming the direct link between neural circuit activity and animal behavior has been a principal aim of neuroscience. The genetically encoded calcium indicator (GECI), which binds to calcium ions and emits fluorescence visualizing intracellular calcium concentration, enables detection of in vivo neuronal firing activity. Various GECIs have been developed and can be chosen for diverse purposes. These GECI-based signals can be acquired by several tools including two-photon microscopy and microendoscopy for precise or wide imaging at cellular to synaptic levels. In addition, the images from GECI signals can be analyzed with open source codes including constrained non-negative matrix factorization for endoscopy data (CNMF_E) and miniscope 1-photon-based calcium imaging signal extraction pipeline (MIN1PIPE), and considering parameters of the imaged brain regions (e.g., diameter or shape of soma or the resolution of recorded images), the real-time activity of each cell can be acquired and linked with animal behaviors. As a result, GECI signal analysis can be a powerful tool for revealing the functions of neuronal circuits related to specific behaviors.

Personalized Size Recommender System for Online Apparel Shopping: A Collaborative Filtering Approach

  • Dongwon Lee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.8
    • /
    • pp.39-48
    • /
    • 2023
  • This study was conducted to provide a solution to the problem of sizing errors occurring in online purchases due to discrepancies and non-standardization in clothing sizes. This paper discusses an implementation approach for a machine learning-based recommender system capable of providing personalized sizes to online consumers. We trained multiple validated collaborative filtering algorithms including Non-Negative Matrix Factorization (NMF), Singular Value Decomposition (SVD), k-Nearest Neighbors (KNN), and Co-Clustering using purchasing data derived from online commerce and compared their performance. As a result of the study, we were able to confirm that the NMF algorithm showed superior performance compared to other algorithms. Despite the characteristic of purchase data that includes multiple buyers using the same account, the proposed model demonstrated sufficient accuracy. The findings of this study are expected to contribute to reducing the return rate due to sizing errors and improving the customer experience on e-commerce platforms.

Target Speaker Speech Restoration via Spectral bases Learning (주파수 특성 기저벡터 학습을 통한 특정화자 음성 복원)

  • Park, Sun-Ho;Yoo, Ji-Ho;Choi, Seung-Jin
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.3
    • /
    • pp.179-186
    • /
    • 2009
  • This paper proposes a target speech extraction which restores speech signal of a target speaker form noisy convolutive mixture of speech and an interference source. We assume that the target speaker is known and his/her utterances are available in the training time. Incorporating the additional information extracted from the training utterances into the separation, we combine convolutive blind source separation(CBSS) and non-negative decomposition techniques, e.g., probabilistic latent variable model. The nonnegative decomposition is used to learn a set of bases from the spectrogram of the training utterances, where the bases represent the spectral information corresponding to the target speaker. Based on the learned spectral bases, our method provides two postprocessing steps for CBSS. Channel selection step finds a desirable output channel from CBSS, which dominantly contains the target speech. Reconstruct step recovers the original spectrogram of the target speech from the selected output channel so that the remained interference source and background noise are suppressed. Experimental results show that our method substantially improves the separation results of CBSS and, as a result, successfully recovers the target speech.

Robustness of Face Recognition to Variations of Illumination on Mobile Devices Based on SVM

  • Nam, Gi-Pyo;Kang, Byung-Jun;Park, Kang-Ryoung
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.4 no.1
    • /
    • pp.25-44
    • /
    • 2010
  • With the increasing popularity of mobile devices, it has become necessary to protect private information and content in these devices. Face recognition has been favored over conventional passwords or security keys, because it can be easily implemented using a built-in camera, while providing user convenience. However, because mobile devices can be used both indoors and outdoors, there can be many illumination changes, which can reduce the accuracy of face recognition. Therefore, we propose a new face recognition method on a mobile device robust to illumination variations. This research makes the following four original contributions. First, we compared the performance of face recognition with illumination variations on mobile devices for several illumination normalization procedures suitable for mobile devices with low processing power. These include the Retinex filter, histogram equalization and histogram stretching. Second, we compared the performance for global and local methods of face recognition such as PCA (Principal Component Analysis), LNMF (Local Non-negative Matrix Factorization) and LBP (Local Binary Pattern) using an integer-based kernel suitable for mobile devices having low processing power. Third, the characteristics of each method according to the illumination va iations are analyzed. Fourth, we use two matching scores for several methods of illumination normalization, Retinex and histogram stretching, which show the best and $2^{nd}$ best performances, respectively. These are used as the inputs of an SVM (Support Vector Machine) classifier, which can increase the accuracy of face recognition. Experimental results with two databases (data collected by a mobile device and the AR database) showed that the accuracy of face recognition achieved by the proposed method was superior to that of other methods.

Enhancing Document Clustering Method using Synonym of Cluster Topic and Similarity (군집 주제의 유의어와 유사도를 이용한 문서군집 향상 방법)

  • Park, Sun;Kim, Kyung-Jun;Lee, Jin-Seok;Lee, Seong-Ro
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.48 no.5
    • /
    • pp.30-38
    • /
    • 2011
  • This paper proposes a new enhancing document clustering method using a synonym of cluster topic and the similarity. The proposed method can well represent the inherent structure of document cluster set by means of selecting terms of cluster topic based on the semantic features by NMF. It can solve the problem of "bags of words" by using of expanding the terms of cluster topics which uses the synonyms of WordNet. Also, it can improve the quality of document clustering which uses the cosine similarity between the expanded cluster topic terms and document set to well cluster document with respect to the appropriation cluster. The experimental results demonstrate that the proposed method achieves better performance than other document clustering methods.

Enhancing Red Tide Image Recognition using NMF and Image Revision (NMF와 이미지 보정을 이용한 적조 이미지 인식 향상)

  • Park, Sun;Lee, Seong-Ro
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.16 no.2
    • /
    • pp.331-336
    • /
    • 2012
  • Red tide is a temporary natural phenomenon involving harmful algal blooms (HABs) in company with a changing sea color from normal to red or reddish brown, and which has a bad influence on coast environments and sea ecosystems. The HABs have inflicted massive mortality on fin fish and shellfish, damaging the economies of fisheries for almost every year from 1990 in South Korea. There have been many studies on red tide due to increasing damage from red tide on fishing and aquaculture industry. However, internal study of automatic red tide image classification is not enough. Especially, extraction of matching center features for recognizing algae image object is difficult because over 200 species of algae in the world have a different size and features. Previously studies used a few type of red tide algae for image classification. In this paper, we proposed the red tide image recognition method using NMF and revison of rotation angle for enhancing of recognition of red tide algae image.

Characterization of Five Shu Acupoint Pattern in Saam Acupuncture Using Text Mininig (텍스트마이닝을 통한 사암침법 오수혈 사용 패턴 분석)

  • Park, In-Soo;Jung, Won-Mo;Lee, Ye-Seul;Hahm, Dae-Hyun;Park, Hi-Joon;Chae, Younbyoung
    • Korean Journal of Acupuncture
    • /
    • v.32 no.2
    • /
    • pp.66-74
    • /
    • 2015
  • Background : Saam acupuncture were composed by applying the elemental concepts from the Five Phase theory - the relationships between the cycles such as Saeng(Sheng, 'nourishing' or 'creating') and Geuk(Ke, 'suppressing' or 'controlling') - onto the Five Phase points and 12 channels to compensate for the imbalance in each of the 12 main energy traits. Objective : The present study is aimed to find out the characteristics of Five Phase points pattern in Saam acupuncture. Methods : We analysed the characteristics of five elements of the Five Phase points in Korean medical texts such as Saamdoinchimguyogyeol, Dongeuibogam and Chimgugyeongheombang in mid Chosun Dynasty. Using non-negative factorization(NNMF) methods, we extracted the feature matrix of five elements of Five Phase points in each classic medical text. Results : In Saam acupuncture, two characteristics were most prominent: (1) "Self" component of Five elements, (2) "Mother" and "Grandmother" component of Five elements. Conclusions : Saam acupuncture used the combination of Five-Shu acupoint based on ZangFu pattern identification. Our findings suggest that grasping the characteristics of Five Phase points combinations can improve the understanding the selection of the relevant acupoints based on the ZangFu pattern identifications.

Enhancing Document Clustering using Important Term of Cluster and Wikipedia (군집의 중요 용어와 위키피디아를 이용한 문서군집 향상)

  • Park, Sun;Lee, Yeon-Woo;Jeong, Min-A;Lee, Seong-Ro
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.49 no.2
    • /
    • pp.45-52
    • /
    • 2012
  • This paper proposes a new enhancing document clustering method using the important terms of cluster and the wikipedia. The proposed method can well represent the concept of cluster topics by means of selecting the important terms in cluster by the semantic features of NMF. It can solve the problem of "bags of words" to be not considered the meaningful relationships between documents and clusters, which expands the important terms of cluster by using of the synonyms of wikipedia. Also, it can improve the quality of document clustering which uses the expanded cluster important terms to refine the initial cluster by re-clustering. The experimental results demonstrate that the proposed method achieves better performance than other document clustering methods.

Medical Data Based Clinical Pathway Analysis and Automatic Ganeration System (임상데이터기반 표준진료지침 자동 생성 시스템 분석 및 연구)

  • Park, Hanna;Bae, In Ho;Kim, Yong Oock
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.39C no.6
    • /
    • pp.497-502
    • /
    • 2014
  • In general, all physicians have some standardized diagnosis and treatment methods. However, there are differences in the precise order and examination depending on the hospital size, system, medical equipment, etc. To reduce this difference, the interest about standardized guidelines recently increased and a variety of research is being conducted. The uniform guideline cannot reflect the differences of each situation and environment to meet the hospitals. Therefore, standardized medical guidelines(=clinical pathway) should provide customized guidelines based on the relevant medical data to ensure the quality of the medical service and the doctor's autonomy. In this paper, we will analyze medical data made by two thyroid specialists in the same hospitals. Moreover, this paper mentions the implement of automatic generating clinical pathway system which consider its real hospital situation and result.