• 제목/요약/키워드: Matrix Factorization

검색결과 308건 처리시간 0.027초

Identification of pollutant sources and evaluation of water quality improvement alternatives of the Geum river

  • shiferaw, Natnael;Kim, Jaeyoung;Seo, Dongil
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2022년도 학술발표회
    • /
    • pp.475-475
    • /
    • 2022
  • The aim of this study is to identify the significant pollutant sources from the tributaries that are affecting the water quality of the study site, the Geum River and provide a solution to enhance the water quality. Multivariate statistical analysis modles such as cluster analysis, Principal component analysis (PCA) and positive matrix factorization (PMF) were applied to identify and prioritize the major pollutant sources of the two major tributaries, Gab-cheon and Miho-cheon, of the Geum River. PCA identifies three major pollutant sources for Gab-cheon and Miho-cheon, respectively. For Gab-cheon, wastewater treatment plant (WWTP), urban, and agricultural pollutions are identified as major pollutant sources. For Miho-cheon, agricultural, urban, and forest land are identified as major pollutant sources. On the contrary, PMF identifies three pollutant sources in Gab-cheon, same as PCA result and two pollutant sources in Miho-cheon. Water quality control scenarios are formulated and improvement of water quality in the river locations are simulated and analyzed with the Environmental Fluid Dynamic Code (EFDC) model. Scenario results were evaluated using a water quality index developed by Canadian Council of Ministers of the Environment. PCA and PMF appears to be effective to identify water pollution sources for the Geum river and also its tributaries in detail and thus can be used for the development of water quality improvement alternative of the above water bodies.

  • PDF

영화 추천 시스템을 위한 연구: 한계점 및 해결 방법 (Survey for Movie Recommendation System: Challenge and Problem Solution)

  • 초느에진랏;마리즈아길랄;무함마드 필다우스;강성원;이경현
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2022년도 춘계학술발표대회
    • /
    • pp.594-597
    • /
    • 2022
  • Recommendation systems are a prominent approach for users to make informed automated judgments. In terms of movie recommendation systems, there are two methods used; Collaborative filtering, which is based on user similarities; and Content-based filtering which takes into account specific user's activity. However, there are still issues with these two existing methods, and to address those, a combination of collaborative and content-based filtering is employed to produce a more effective system. In addition, various similarity methodologies are used to identify parallels among users. This paper focuses on a survey of the various tactics and methods to find solutions based on the problems of the current recommendation system.

일반화 신경망 협업필터링 (Generalized neural collaborative filtering)

  • 황인준;김희주;김유진;이윤동
    • 응용통계연구
    • /
    • 제37권3호
    • /
    • pp.311-322
    • /
    • 2024
  • 본 연구에서는, 추천시스템 연구에 자주 활용되는 무비렌즈 데이터에 대한 탐색적 분석을 통하여 무비렌즈 데이터의 자세한 특성을 살펴보고, 추천시스템에서 심층신경망을 이용한 협업필터링 (NCF) 방법으로 잘 알려진 신경망행렬분해법을 개선하기 위한 대안을 모색한다. 본 연구에서, 제안한 일반화 NCF (G-NCF) 방법은 기존의 NCF 방법에 비하여 주요 평가 지표에서 평균적으로 우수한 특성을 보이지만, 평가지표의 산포가 다소 커지는 단점도 함께 가진다. 성능 비교를 위한 평가 지표로 MAP와 nDCG 등을 이용하였다.

일반화 적응 심층 잠재요인 추천모형 (A Generalized Adaptive Deep Latent Factor Recommendation Model)

  • 김정하;이지평;장성현;조윤호
    • 지능정보연구
    • /
    • 제29권1호
    • /
    • pp.249-263
    • /
    • 2023
  • 대표적인 추천 시스템 방법론인 협업 필터링(Collaborative Filtering)에는 이웃기반 방법(Neighbor Methods)과 잠재 요인 모델(Latent Factor model)이라는 두 가지 접근법이 있다. 이중 행렬 분해(Matrix Factorization)를 이용하는 잠재 요인 모델은 사용자-아이템 상호작용 행렬을 두 개의 보다 낮은 차원의 직사각형 행렬로 분해하고 이들의 행렬 곱으로 아이템의 평점(Rating)을 예측한다. 평점 패턴으로부터 추출된 요인 벡터들을 통해 사용자와 아이템 속성을 포착할 수 있기 때문에 확장성, 정확도, 유연성 측면에서 이웃기반 방법보다 우수하다고 알려져 있다. 하지만 평점이 지정되지 않은 아이템에 대해서는 선호도가 다른 개개인의 다양성을 반영하지 못하는 근본적인 한계가 있고 이는 반복적이고 부정확한 추천을 초래하게 된다. 이러한 잠재요인 모델의 한계를 개선하고자 각각의 아이템 별로 사용자의 선호도를 적응적으로 학습하는 적응 심층 잠재요인 모형(Adaptive Deep Latent Factor Model; ADLFM)이 등장하였다. ADLFM은 아이템의 특징을 설명하는 텍스트인 아이템 설명(Item Description)을 입력으로 받아 사용자와 아이템의 잠재 벡터를 구하고 어텐션 스코어(Attention Score)를 활용하여 개인의 다양성을 반영할 수 있는 방법을 제시한다. 하지만 아이템 설명을 포함하는 데이터 셋을 요구하기 때문에 이 방법을 적용할 수 있는 대상이 많지 않은 즉 일반화에 있어 한계가 있다. 본 연구에서는 아이템 설명 대신 추천시스템에서 보편적으로 사용하는 아이템 ID를 입력으로 하고 Self-Attention, Multi-head attention, Multi-Conv1d 등 보다 개선된 딥러닝 모델 구조를 적용함으로써 ADLFM의 한계를 개선할 수 있는 일반화된 적응 심층 잠재요인 추천모형 G-ADLFRM을 제안한다. 다양한 도메인의 데이터셋을 가지고 입력과 모델 구조 변경에 대한 실험을 진행한 결과, 입력만 변경했을 경우 동반되는 정보손실로 인해 ADLFM 대비 MAE(Mean Absolute Error)가 소폭 높아지며 추천성능이 하락했지만, 처리할 정보량이 적어지면서 epoch 당 평균 학습속도는 대폭 향상되었다. 입력 뿐만 아니라 모델 구조까지 바꿨을 경우에는 가장 성능이 우수한 Multi-Conv1d 구조가 ADLFM과 유사한 성능을 나타내며 입력변경으로 인한 정보손실을 충분히 상쇄시킬 수 있음을 보여주었다. 결론적으로 본 논문에서 제시한 모형은 기존 ADLFM의 성능은 최대한 유지하면서 빠른 학습과 추론이 가능하고(경량화) 다양한 도메인에 적용할 수 있는(일반화) 새로운 모형임을 알 수 있다.

PMF 모델을 이용한 용인-수원경계지역에서의 부유분진의 크기별 오염원 확인 (Source Identification of Ambient Size-by-Size Particulate Using the Positive Matrix Factorization Model on the Border of Yongin and Suwon)

  • 오미석;이태정;김동술
    • 한국대기환경학회지
    • /
    • 제25권2호
    • /
    • pp.108-121
    • /
    • 2009
  • The suspended particulate matters have been collected on membrane filters and glass fiber filters by an 8-stage cascade impactor for 2 years (Sep. 2005${\sim}$Sep. 2007) in Kyung Hee University-Global Campus located on the border of Yongin and Suwon. The 20 chemical species (Al, Mn, Si, Fe, Cu, Pb, Cr, Ni, V, Cd, Ba, $Na^+$, ${NH_4}^+$, $K^+$, $Mg^{2+}$, $Ca^{2+}$, $Cl^-$, ${NO_3}^-$, and ${SO_4}^{2-}$) were analyzed by an ICP-AES and an IC after performing proper pre-treatments of each sample filter. Based on these chemical information, the PMF receptor model was applied to identify the source of ambient size-by-size particulate matters. The receptor modeling is the one of the statistical methods to achieve resonable air pollution management strategies. A total of 10 sources was identified in 9 size-ranges such as long-range transport, secondary aerosol, $NH_{4}NO_{3}$ related source, coal combustion, sea-salt, soil, oil combustion, auto emission, incineration, and biomass burning. Especially, the secondary aerosol source assorted in fine and coarse modes was intensively studied.

CHEMTAX 활용한 가막만 식물플랑크톤 군집조성 (Composition of Phytoplankton in Gamak Bay by CHEMTAX Analyses)

  • 오현택;김다정;이원찬;정래홍;홍석진;강양순;이용우
    • 한국환경과학회지
    • /
    • 제17권10호
    • /
    • pp.1155-1167
    • /
    • 2008
  • Chlorophyll a (chl a) has been used as an indicator for phytoplankton biomass in pelagic ecosystems due to the relative ease of measurement and selectivity for autotrophs in mixed plankton assemblages. However, the use of chi a as an indicator for phytoplankton biomass is restricted due to its inability to resolve taxonomic differences of phytoplankton and the highly variable relationship of chi a with phytoplankton. Here, we describe the analysis of High-Performance Liquid Chromatography (HPLC) photosynthetic pigment data using CHEMTAX, which is a matrix factorization program that uses chemical taxonomic indices (phytoplankton carotenoids) to quantify the abundance of phytoplankton groups. Compared to direct microscopic counting that can distinguish species within broad groups, the resolution of taxonomic groups by CHEMTAX is generally coarse. It can only distinguish between diatoms, dinoflagellates, cryptophytes, cyanobacteria, chlorophytes, prasinophytes, and haptophytes. However, CHEMTAX analysis is much faster and less expensive than microscopic counting methods. HPLC pigment observations were taken in the spring, summer, fall, and winter in$ 2005\sim2006$ within Gamak Bay, South Korea. CHEMTAX results revealed that diatoms were the dominant taxonomic group in Gamak Bay. In inner Gamak Bay, the ratio between diatoms and cryptophytes was $75\sim80%$, and the ratio between dinoflagellates and cryptophytes was $10\sim15%$. In outer Gamak Bay, the ratio between diatoms and cryptophytes was $85\sim90%$, and the ratio between dinflagellates and cryptophytes was only $1\sim5%$. The population structure was seasonal. Relative diatom populations were less in the summer than the winter season.

Robustness of Face Recognition to Variations of Illumination on Mobile Devices Based on SVM

  • Nam, Gi-Pyo;Kang, Byung-Jun;Park, Kang-Ryoung
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제4권1호
    • /
    • pp.25-44
    • /
    • 2010
  • With the increasing popularity of mobile devices, it has become necessary to protect private information and content in these devices. Face recognition has been favored over conventional passwords or security keys, because it can be easily implemented using a built-in camera, while providing user convenience. However, because mobile devices can be used both indoors and outdoors, there can be many illumination changes, which can reduce the accuracy of face recognition. Therefore, we propose a new face recognition method on a mobile device robust to illumination variations. This research makes the following four original contributions. First, we compared the performance of face recognition with illumination variations on mobile devices for several illumination normalization procedures suitable for mobile devices with low processing power. These include the Retinex filter, histogram equalization and histogram stretching. Second, we compared the performance for global and local methods of face recognition such as PCA (Principal Component Analysis), LNMF (Local Non-negative Matrix Factorization) and LBP (Local Binary Pattern) using an integer-based kernel suitable for mobile devices having low processing power. Third, the characteristics of each method according to the illumination va iations are analyzed. Fourth, we use two matching scores for several methods of illumination normalization, Retinex and histogram stretching, which show the best and $2^{nd}$ best performances, respectively. These are used as the inputs of an SVM (Support Vector Machine) classifier, which can increase the accuracy of face recognition. Experimental results with two databases (data collected by a mobile device and the AR database) showed that the accuracy of face recognition achieved by the proposed method was superior to that of other methods.

2차원 객체 영상의 3차원 모델링을 위한 손실 특징점 보정 (Correction of Missing Feature Points for 3D Modeling from 2D object images)

  • 고성식
    • 한국정보통신학회논문지
    • /
    • 제19권12호
    • /
    • pp.2844-2851
    • /
    • 2015
  • 다수의 2차원 객체 영상으로부터 3차원 형상을 복원하는 방법은 컴퓨터 비젼 분야에서 널리 연구되고 있다. 복원된 3차원 형상의 정확도 개선을 위해서는 잡음 영향을 줄이거나 영상 프레임 수를 확보하는 것이 무엇보다 중요하다. 그렇지만 특징점 추정 시 잡음은 잠재적으로 내포되고, 관측행렬을 구성하는 영상 프레임 수는 특징점 추적 실패, 장애요소 또는 낮은 해상력 등에 의해 일반적으로 감소하게 된다. 그래서 잠음 환경 하에 손실된 특징점을 보다 정확히 보정하여 사용 가능한 영상 프레임 수를 확보하는 것이 필수적이다. 따라서 우리는 잡음 분포 하에서 기하학적 특성을 이용해 손실 특징점의 오차 거리와 방향을 직접 제어할 수 있는 분석적 접근방법을 제안한다. 제안한 방법의 우수성은 합성과 실제 객체에 대한 실험 결과를 통해서 검증한다.

Comparison of Source Apportionment of PM2.5 Using PMF2 and EPA PMF Version 2

  • Hwang, In-Jo;Hopke, Philip K.
    • Asian Journal of Atmospheric Environment
    • /
    • 제5권2호
    • /
    • pp.86-96
    • /
    • 2011
  • The positive matrix factorization (PMF2) and multilinear engine (ME2) models have been shown to be powerful environmental analysis techniques and have been successfully applied to the assessment of ambient particulate matter (PM) source contributions. Because these models are difficult to apply practically, the US EPA developed a more user-friendly version of the PMF. The initial version of the EPA PMF model does not provide any rotational capabilities; for this reason, the model was upgraded to include rotational functions in the EPA PMF ver. 2.0. In this study, PMF and EPA PMF modeling identified ten particulate matter sources including secondary sulfate I, vehicle gasoline, secondary sulfate II, secondary nitrate, secondary sulfate III, incinerators, aged sea salt, airborne soil particles, oil combustion, and diesel emissions. All of the source profiles determined by the two models showed excellent agreement. The calculated average concentrations of $PM_{2.5}$ were consistent between the PMF2 and EPA PMF ($17.94{\pm}0.30{\mu}g/m^3$ and $17.94{\pm}0.30\;{\mu}g/m^3$, respectively). Also, each set of estimated source contributions of the PMF2 and EPA PMF showed good agreement. The results from the new EPA PMF version applying rotational functions were consistent with those of PMF2. Therefore, the updated version of EPA PMF with rotational capabilities will provide more reasonable solutions compared with those of PMF2 and can be more widely applied to air quality management.

PMF 모델을 이용한 미세분진의 오염원 확인과 기여도 추정 : 탄소성분을 이용한 휘발유 및 경유차량 오염원의 분리 (Identifying Ambient PM2.5 Sources and Estimating their Contributions by Using PMF : Separation of Gasoline and Diesel Automobile Sources by Analyzing ECs and OCs)

  • 이형우;이태정;김동술
    • 한국대기환경학회지
    • /
    • 제25권1호
    • /
    • pp.75-89
    • /
    • 2009
  • The purpose of this study was to identify $PM_{2.5}$ sources and to estimate their contributions to the border of Yongin-Suwon area, based on the analysis of the $PM_{2.5}$ mass concentration and the associated inorganic elements, ions and carbon components. The contribution of $PM_{2.5}$ sources were estimated by using a positive matrix factorization (PMF) model to identify air emission sources. For this study, $PM_{2.5}$ samples were collected from May, 2007 to April, 2008. The inorganic elements were analyzed by an ICP-AES. The ionic components in $PM_{2.5}$ were analyzed by an Ie. The carbon components were also analyzed by DRI/OGC analyzer. After performing PMF modeling, a total of 12 sources were identified and their contributions were quantitatively estimated. The contributions from each emission source were as follows: 11.3% from oil combustion source, 3.4% from bus/highway source, 5.8% from diesel vehicle source, 4.7% from gasoline vehicle source, 8.8% from biomass burning source, 15.1 % from secondary sulfate, 5.2% from secondary nitrate source, 13.4% from industrial related source, 4.1% from Cl-rich source, 19.6% from soil related source, 1.0% from aged sea salt, and 7.4% from coal combustion source, respectively. This study provides basic information on the major sources affecting air quality, and then it will help to effectively control $PM_{2.5}$ in this study area.