• Title/Summary/Keyword: Matrix Factorization

Search Result 308, Processing Time 0.027 seconds

Estimation of Source Apportionment of Ambient PM2.5 at Western Coastal IMPROVE Site in USA (미국 서부 해안 IMPROVE 측정소에 대한 대기 중 PM2.5의 오염원 기여도 추정)

  • Hwang, In-Jo;Kim, Dong-Sool;Hopke, Philip K.
    • Journal of Korean Society for Atmospheric Environment
    • /
    • v.24 no.1
    • /
    • pp.30-42
    • /
    • 2008
  • In this study, the chemical compositions of $PM_{2.5}$ samples collected at the Redwood National Park IMPROVE site in California from March 1988 to May 2004 were analyzed to provide source identification and apportionment. A total of 1,640 samples were collected and 33 chemical species were analyzed by particle induced X-ray emission, proton elastic scattering analysis, photon induced X-ray fluorescence, ion chromatography, and thermal optical reflectance methods. Positive matrix factorization (PMF) was used to develop source profiles and to estimate their mass contributions. The PMF modeling identified five sources and the average mass was apportioned to motor vehicle (35.8%, $1.58\;{\mu}g/m^3$), aged sea salt (23.2%, $1.02\;{\mu}g/m^3$), fresh sea salt (21.4%, $0.94\;{\mu}g/m^3$), wood/field burning (16.1%, $0.71\;{\mu}g/m^3$), and airborne soil (3.5%, $0.15\;{\mu}g/m^3$), respectively. To analyze local source impacts from various wind directions, the CPF and NPR analyses were performed using source contribution results with the wind direction values measured at the site. These results suggested that sources of $PM_{2.5}$ are also sources of visibility degradation and then source apportionment studies derived for $PM_{2.5}$ are also used for understanding visibility problem.

Estimation of Contribution by Pollutant Source of VOCs in Industrial Complexes of Gwangju Using Receptor Model (PMF) (수용모델(PMF)을 이용한 광주산업단지 VOCs의 오염원별 기여도 추정)

  • Park, Jin-Hwan;Park, Byoung-Hoon;Kim, Seung-Ho;Yang, Yoon-Cheol;Lee, Ki-Won;Bae, Seok-Jin;Song, Hyeong-Myeong
    • Journal of Environmental Science International
    • /
    • v.30 no.3
    • /
    • pp.219-234
    • /
    • 2021
  • Industrial emissions, mainly from industrial complexes, are important sources of ambient Volatile Organic Compounds (VOCs). Identification of the significant VOC sources from industrial complexes has practical significance for emission reduction. VOC samples were collected from July 2019 to June 2020. A Positive Matrix Factorization (PMF) receptor model was used to evaluate the VOC sources in the area. Four sources were identified by PMF analysis, including coating-1, coating-2, printing, and vehicle exhaust. The coating-1 source was revealed to have the highest contribution (41.5%), followed by coating-2 (23.9%), printing (23.1%), and vehicle exhaust (11.6%). The source showing the highest contribution was coating emissions, originating from the northwest to southwest of the sample site. It also relates to facilities that produce auto parts. The major components of VOC emissions from the coating facilities were toluene, m,p-xylene, ethylbenzene, o-xylene, and butyl acetate. Industrial emissions should be the top priority to meet the relevant control criteria, followed by vehicular emissions. This study provides a strategy for VOC source apportionment from an industrial complex, which is helpful in the development of targeted control strategies.

Personalized Size Recommender System for Online Apparel Shopping: A Collaborative Filtering Approach

  • Dongwon Lee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.8
    • /
    • pp.39-48
    • /
    • 2023
  • This study was conducted to provide a solution to the problem of sizing errors occurring in online purchases due to discrepancies and non-standardization in clothing sizes. This paper discusses an implementation approach for a machine learning-based recommender system capable of providing personalized sizes to online consumers. We trained multiple validated collaborative filtering algorithms including Non-Negative Matrix Factorization (NMF), Singular Value Decomposition (SVD), k-Nearest Neighbors (KNN), and Co-Clustering using purchasing data derived from online commerce and compared their performance. As a result of the study, we were able to confirm that the NMF algorithm showed superior performance compared to other algorithms. Despite the characteristic of purchase data that includes multiple buyers using the same account, the proposed model demonstrated sufficient accuracy. The findings of this study are expected to contribute to reducing the return rate due to sizing errors and improving the customer experience on e-commerce platforms.

Evaluation of Endothelium-dependent Myocardial Perfusion Reserve in Healthy Smokers; Cold Pressor Test using $H_2^{15}O\;PET$ (흡연자에서 관상동맥 내피세포 의존성 심근 혈류 예비능: $H_2^{15}O\;PET$ 찬물자극 검사에 의한 평가)

  • Hwang, Kyung-Hoon;Lee, Dong-Soo;Lee, Byeong-Il;Lee, Jae-Sung;Lee, Ho-Young;Chung, June-Key;Lee, Myung-Chul
    • The Korean Journal of Nuclear Medicine
    • /
    • v.38 no.1
    • /
    • pp.21-29
    • /
    • 2004
  • Purpose: Much evidence suggests long-term cigarette smoking alters coronary vascular endothelial response. On this study, we applied nonnegative matrix factorization (NMF), an unsupervised learning algorithm, to CO-less $H_2^{15}O-PET$ to investigate coronary endothelial dysfunction caused by smoking noninvasively. Materials and methods: This study enrolled eighteen young male volunteers consisting of 9 smokers $(23.8{\pm}1.1\;yr;\;6.5{\pm}2.5$ pack-years) and 9 nonsmokers $(23.8{\pm}2.9 yr)$. They do not have any cardiovascular risk factor or disease history. Myocardial $H_2^{15}O-PET$ was performed at rest, during cold ($5^{\circ}C$) pressor stimulation and during adenosine infusion. Left ventricular blood pool and myocardium were segmented on dynamic PET data by NMF method. Myocardial blood flow (MBF) was calculated from input and tissue functions by a single compartmental model with correction of partial volume and spillover effects. Results: There were no significant difference in resting MBF between the two groups (Smokers: 1.43 0.41 ml/g/min and non-smokers: $1.37{\pm}0.41$ ml/g/min p=NS). during cold pressor stimulation, MBF in smokers was significantly lower than 4hat in non-smokers ($1.25{\pm}0.34$ ml/g/min vs $1.59{\pm}0.29$ ml/gmin; p=0.019). The difference in the ratio of cold pressor MBF to resting MBF between the two groups was also significant (p=0.024; $90{\pm}24%$ in smokers and $122{\pm}28%$ in non-smokers.). During adenosine infusion, however, hyperemic MBF did not differ significantly between smokers and non-smokers ($5.81{\pm}1.99$ ml/g/min vs $5.11{\pm}1.31$ ml/g/min ; p=NS). Conclusion: in smokers, MBF during cold pressor stimulation was significantly lower compared wi4h nonsmokers, reflecting smoking-Induced endothelial dysfunction. However, there was no significant difference in MBF during adenosine-induced hyperemia between the two groups.

Hierrachical manner of motion parameters for sports video mosaicking (스포츠 동영상의 모자익을 위한 이동계수의 계층적 향상)

  • Lee, Jae-Cheol;Lee, Soo-Jong;Ko, Young-Hoon;Noh, Heung-Sik;Lee Wan-Ju
    • The Journal of Information Technology
    • /
    • v.7 no.2
    • /
    • pp.93-104
    • /
    • 2004
  • Sports scene is characterized by large amount of global motion due to pan and zoom of camera motion, and includes many small objects moving independently. Some short period of sports games is thrilling to televiewers, and important to producers. At the same time that kinds of scenes exhibit exceptionally dynamic motions and it is very difficult to analyze the motions with conventional algorithms. In this thesis, several algorithms are proposed for global motion analysis on these dynamic scenes. It is shown that proposed algorithms worked well for motion compensation and panorama synthesis. When cascading the inter frame motions, accumulated errors are unavoidable. In order to minimize these errors, interpolation method of motion vectors is introduced. Affined transform or perspective projection transform is regarded as a square matrix, which can be factorized into small amount of motion vectors. To solve factorization problem, we preposed the adaptation of Newton Raphson method into vector and matrix form, which is also computationally efficient. Combining multi frame motion estimation and the corresponding interpolation in hierarchical manner enhancement algorithm of motion parameters is proposed, which is suitable for motion compensation and panorama synthesis. The proposed algorithms are suitable for special effect rendering for broadcast system, video indexing, tracking in complex scenes, and other fields requiring global motion estimation.

  • PDF

Offline Friend Recommendation using Mobile Context and Online Friend Network Information based on Tensor Factorization (모바일 상황정보와 온라인 친구네트워크정보 기반 텐서 분해를 통한 오프라인 친구 추천 기법)

  • Kim, Kyungmin;Kim, Taehun;Hyun, Soon. J
    • KIISE Transactions on Computing Practices
    • /
    • v.22 no.8
    • /
    • pp.375-380
    • /
    • 2016
  • The proliferation of online social networking services (OSNSs) and smartphones has enabled people to easily make friends with a large number of users in the online communities, and interact with each other. This leads to an increase in the usage rate of OSNSs. However, individuals who have immersed into their digital lives, prioritizing the virtual world against the real one, become more and more isolated in the physical world. Thus, their socialization processes that are undertaken only through lots of face-to-face interactions and trial-and-errors are apt to be neglected via 'Add Friend' kind of functions in OSNSs. In this paper, we present a friend recommendation system based on the on/off-line contextual information for the OSNS users to have more serendipitous offline interactions. In order to accomplish this, we modeled both offline information (i.e., place visit history) collected from a user's smartphone on a 3D tensor, and online social data (i.e., friend relationships) from Facebook on a matrix. We then recommended like-minded people and encouraged their offline interactions. We evaluated the users' satisfaction based on a real-world dataset collected from 43 users (12 on-campus users and 31 users randomly selected from Facebook friends of on-campus users).

An Intelligence Support System Research on KTX Rolling Stock Failure Using Case-based Reasoning and Text Mining (사례기반추론과 텍스트마이닝 기법을 활용한 KTX 차량고장 지능형 조치지원시스템 연구)

  • Lee, Hyung Il;Kim, Jong Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.1
    • /
    • pp.47-73
    • /
    • 2020
  • KTX rolling stocks are a system consisting of several machines, electrical devices, and components. The maintenance of the rolling stocks requires considerable expertise and experience of maintenance workers. In the event of a rolling stock failure, the knowledge and experience of the maintainer will result in a difference in the quality of the time and work to solve the problem. So, the resulting availability of the vehicle will vary. Although problem solving is generally based on fault manuals, experienced and skilled professionals can quickly diagnose and take actions by applying personal know-how. Since this knowledge exists in a tacit form, it is difficult to pass it on completely to a successor, and there have been studies that have developed a case-based rolling stock expert system to turn it into a data-driven one. Nonetheless, research on the most commonly used KTX rolling stock on the main-line or the development of a system that extracts text meanings and searches for similar cases is still lacking. Therefore, this study proposes an intelligence supporting system that provides an action guide for emerging failures by using the know-how of these rolling stocks maintenance experts as an example of problem solving. For this purpose, the case base was constructed by collecting the rolling stocks failure data generated from 2015 to 2017, and the integrated dictionary was constructed separately through the case base to include the essential terminology and failure codes in consideration of the specialty of the railway rolling stock sector. Based on a deployed case base, a new failure was retrieved from past cases and the top three most similar failure cases were extracted to propose the actual actions of these cases as a diagnostic guide. In this study, various dimensionality reduction measures were applied to calculate similarity by taking into account the meaningful relationship of failure details in order to compensate for the limitations of the method of searching cases by keyword matching in rolling stock failure expert system studies using case-based reasoning in the precedent case-based expert system studies, and their usefulness was verified through experiments. Among the various dimensionality reduction techniques, similar cases were retrieved by applying three algorithms: Non-negative Matrix Factorization(NMF), Latent Semantic Analysis(LSA), and Doc2Vec to extract the characteristics of the failure and measure the cosine distance between the vectors. The precision, recall, and F-measure methods were used to assess the performance of the proposed actions. To compare the performance of dimensionality reduction techniques, the analysis of variance confirmed that the performance differences of the five algorithms were statistically significant, with a comparison between the algorithm that randomly extracts failure cases with identical failure codes and the algorithm that applies cosine similarity directly based on words. In addition, optimal techniques were derived for practical application by verifying differences in performance depending on the number of dimensions for dimensionality reduction. The analysis showed that the performance of the cosine similarity was higher than that of the dimension using Non-negative Matrix Factorization(NMF) and Latent Semantic Analysis(LSA) and the performance of algorithm using Doc2Vec was the highest. Furthermore, in terms of dimensionality reduction techniques, the larger the number of dimensions at the appropriate level, the better the performance was found. Through this study, we confirmed the usefulness of effective methods of extracting characteristics of data and converting unstructured data when applying case-based reasoning based on which most of the attributes are texted in the special field of KTX rolling stock. Text mining is a trend where studies are being conducted for use in many areas, but studies using such text data are still lacking in an environment where there are a number of specialized terms and limited access to data, such as the one we want to use in this study. In this regard, it is significant that the study first presented an intelligent diagnostic system that suggested action by searching for a case by applying text mining techniques to extract the characteristics of the failure to complement keyword-based case searches. It is expected that this will provide implications as basic study for developing diagnostic systems that can be used immediately on the site.

Estimate of Regional and Broad-based Sources for PM2.5 Collected in an Industrial Area of Japan

  • Nakatsubo, Ryouhei;Tsunetomo, Daisuke;Horie, Yosuke;Hiraki, Takatoshi;Saitoh, Katsumi;Yoda, Yoshiko;Shima, Masayuki
    • Asian Journal of Atmospheric Environment
    • /
    • v.8 no.3
    • /
    • pp.126-139
    • /
    • 2014
  • In order to estimate the influence of sources on $PM_{2.5}$ in the industrial area of Japan, we carried out a source analysis using chemical component data of $PM_{2.5}$. $PM_{2.5}$ samples were collected intermittently at an industrial area in Japan from July 2010 to November 2012. Water soluble ions ($Cl^-$, $NO_3{^-}$, $SO{_4}^{2-}$, $Na^+$,$NH_4{^+}$, $K^+$, $Mg^{2+}$, $Ca^{2+}$), elements (Al, K, Ca, Ti, V, Cr, Mn, Fe, Ni, Cu, Zn, As, Cd, Sb, Pb), and carbonaceous species (OC, EC) of the $PM_{2.5}$ (a total of 198 samples) were analyzed. Positive Matrix Factorization (PMF) model was applied to the data of those chemical components to identify the source of $PM_{2.5}$. At this observation site, nine factors were extracted. The major contributors of $PM_{2.5}$ were secondary sulfate 1, in which loading factors of $SO{_4}^{2-}$ and $NH_4{^+}$ were large (percentage source contribution: 20.9%), traffic, in which loading factors of OC (organic carbon) and EC (elemental carbon) were large (20.8%), secondary sulfate 2, in which loading factors of K and $SO{_4}^{2-}$ were large (8.0%), steel mills (7.8%), secondary chloride and nitrate (7.0%), soil (5.0%), heavy oil combustion (3.8%), sea salt (3.8%), and coal combustion (2.3%). The conditional probability function (CPF) and the potential source contribution function (PSCF) were carried out to examine the influence of a regional source and a broad-based source, respectively. CPF results supported local source influences such as steel mills, sea salt, traffic, coal combustion, and heavy oil combustion. PSCF results suggested that ships in the East China Sea, an industrial area of the east coastal region of China, and an active volcano in the Kyushu region of Japan were potential regional sources of secondary sulfate 1. Secondary sulfate 2 was affected by the burning of biomass fields and by coal combustion in Chinese urban areas such as Beijing, Hebei, and western Inner Mongolia. Source characterization using continuous data from one site showed a potential source representing fossil fuel combustion is affected both by regional and broad-based sources.

Identification of Atmospheric PM10 Sources and Estimating Their Contributions to the Yongin-Suwon Bordering Area by Using PMF (PMF모델을 이용한 용인.수원 경계지역에서 PM10 오염원의 확인과 상대적 기여도의 추정)

  • Lee, Hyung-Woo;Lee, Tae-Jung;Yang, Sung-Su;Kim, Dong-Sool
    • Journal of Korean Society for Atmospheric Environment
    • /
    • v.24 no.4
    • /
    • pp.439-454
    • /
    • 2008
  • The purpose of this study was to extensively identify $PM_{10}$ sources and to estimate their contributions to the study area, based on the analysis of the $PM_{10}$ mass concentration and the associated inorganic elements, ions, and total carbon. The contribution of $PM_{10}$ sources was estimated by applying a receptor method because identifying air emission sources were effective way to control the ambient air quality. $PM_{10}$ particles were collected from May to November 2007 in the Yongin-Suwon bordering area. $PM_{10}$ samples were collected on quartz filters by a $PM_{10}$ high-volume air sampler. The inorganic elements (Al, Mn, V, Cr, Fe, Ni, Cu, Zn, Cd, Pb, Si, Ba, Ti and Ag) were analyzed by an ICP-AES after proper pre-treatments of each sample. The ionic components of these $PM_{10}$ samples ($Cl^_$, $NO_3^-$, $SO_4^{2-}$, $Na^+$, $NH_4^+$, $K^+$, $Ca^{2+}$, and $Mg^{2+}$) were analyzed by an IC. The carbon components (OC1, OC2, OC3, OC4, OP, EC1, EC2 and EC3) were also analyzed by DRI/OGC analyzer. Source apportionment of $PM_{10}$ was performed using a positive matrix factorization (PMF) model. After performing PMF modeling, a total of 8 sources were identified and their contribution were estimated. Contributions from each emission source were as follows: 13.8% from oil combustion and industrial related source, 25.4% from soil source, 22.1% from secondary sulfate, 12.3% from secondary nitrate, 17.7% from auto emission including diesel (12.1%) and gasoline (5.6%), 3.1% from waste incineration and 5.6% from Na-rich source. This study provides information on the major sources affecting air quality in the receptor site, and therefore it will help us maintain and manage the ambient air quality in the Yongin-Suwon bordering area by establishing reliable control strategies for the related sources.

Identification of PM10 Chemical Characteristics and Sources and Estimation of their Contributions in a Seoul Metropolitan Subway Station (서울시 지하역사에서 PM10의 화학적 특성과 오염원의 확인 및 기여도 추정)

  • Park, Seul-Ba-Sen-Na;Lee, Tae-Jung;Ko, Hyun-Ki;Bae, Sung-Joon;Kim, Shin-Do;Park, Duckshin;Sohn, Jong-Ryeul;Kim, Dong-Sool
    • Journal of Korean Society for Atmospheric Environment
    • /
    • v.29 no.1
    • /
    • pp.74-85
    • /
    • 2013
  • Since the underground transportation system is a closed environment, indoor air quality problems may seriously affect many passengers' health. The purpose of this study was to understand $PM_{10}$ characteristics in the underground air environment and further to quantitatively estimate $PM_{10}$ source contributions in a Seoul Metropolitan subway station. The $PM_{10}$ was intensively collected on various filters with $PM_{10}$ aerosol samplers to obtain sufficient samples for its chemical analysis. Sampling was carried out in the M station on the Line-4 from April 21 to 28, July 13 to 21, and October 11 to 19 in the year of 2010 and January 11 to 17 in the year of 2011. The aerosol filter samples were then analyzed for metals, water soluble ions, and carbon components. The 29 chemical species (OC1, OC2, OC3, OC4, CC, PC, EC, Ag, Al, Ba, Cd, Cr, Cu, Fe, Mn, Ni, Pb, Si, Ti, V, Zn, $Cl^-$, $NO_3{^-}$, $SO_4{^{2-}}$, $Na^+$, $NH_4{^+}$, $K^+$, $Mg^{2+}$, $Ca^{2+}$) were analyzed by using ICP-AES, IC, and TOR after proper pretreatments of each sample filter. Based on the chemical information, positive matrix factorization (PMF) model was applied to identify the $PM_{10}$ sources and then six sources such as biomass burning, outdoor, vehicle, soil and road dust, secondary aerosol, ferrous, and brakewear related source were classified. The contributions rate of their sources in tunnel are 4.0%, 5.8%, 1.6%, 17.9%, 13.8% and 56.9% in order.