• Title/Summary/Keyword: Sparse data

Search Result 413, Processing Time 0.027 seconds

PARAFAC Tensor Reconstruction for Recommender System based on Apache Spark (아파치 스파크에서의 PARAFAC 분해 기반 텐서 재구성을 이용한 추천 시스템)

  • Im, Eo-Jin;Yong, Hwan-Seung
    • Journal of Korea Multimedia Society
    • /
    • v.22 no.4
    • /
    • pp.443-454
    • /
    • 2019
  • In recent years, there has been active research on a recommender system that considers three or more inputs in addition to users and goods, making it a multi-dimensional array, also known as a tensor. The main issue with using tensor is that there are a lot of missing values, making it sparse. In order to solve this, the tensor can be shrunk using the tensor decomposition algorithm into a lower dimensional array called a factor matrix. Then, the tensor is reconstructed by calculating factor matrices to fill original empty cells with predicted values. This is called tensor reconstruction. In this paper, we propose a user-based Top-K recommender system by normalized PARAFAC tensor reconstruction. This method involves factorization of a tensor into factor matrices and reconstructs the tensor again. Before decomposition, the original tensor is normalized based on each dimension to reduce overfitting. Using the real world dataset, this paper shows the processing of a large amount of data and implements a recommender system based on Apache Spark. In addition, this study has confirmed that the recommender performance is improved through normalization of the tensor.

A Network Analysis of Authors and Keywords from North Korean Traditional Medicine Journal, Koryo Medicine (북한 고려의학 학술 저널에 대한 저자 및 키워드 네트워크 분석)

  • Oh, Junho;Yi, Eunhee;Lee, Juyeon;Kim, Dongsu
    • Journal of Society of Preventive Korean Medicine
    • /
    • v.25 no.2
    • /
    • pp.33-43
    • /
    • 2021
  • Objectives : This study seeks to grasp the current status of Koryo medical research in North Korea, by focusing on researchers and research topics. Methods : A network analysis of co-authors and keyword which were extracted from Koryo Medicine - a North Korean traditional medicine journal, was conducted. Results : The results of author network analysis was a sparse network due to the low correlation between authors. The domain-wide network density of co-authors was 0.001, with a diameter of 14, average distance between nodes 4.029, and average binding coefficient 0.029. The results of the keyword network analysis showed the keyword "traditional medicine" had the strongest correlation weight of 228. Other keywords with high correlation weight was common acupuncture (84) and intradermal acupuncture(80). Conclusions : Although the co-authors of the Koryo Medicine did not have a high correlation with each other, they were able to identify key researchers considered important for each major sub-network. In addition, the keywords of the Koryo Medicine journals had a very high linkage to herbal medicines.

Case-Related News Filtering via Topic-Enhanced Positive-Unlabeled Learning

  • Wang, Guanwen;Yu, Zhengtao;Xian, Yantuan;Zhang, Yu
    • Journal of Information Processing Systems
    • /
    • v.17 no.6
    • /
    • pp.1057-1070
    • /
    • 2021
  • Case-related news filtering is crucial in legal text mining and divides news into case-related and case-unrelated categories. Because case-related news originates from various fields and has different writing styles, it is difficult to establish complete filtering rules or keywords for data collection. In addition, the labeled corpus for case-related news is sparse; therefore, to train a high-performance classification model, it is necessary to annotate the corpus. To address this challenge, we propose topic-enhanced positive-unlabeled learning, which selects positive and negative samples guided by topics. Specifically, a topic model based on a variational autoencoder (VAE) is trained to extract topics from unlabeled samples. By using these topics in the iterative process of positive-unlabeled (PU) learning, the accuracy of identifying case-related news can be improved. From the experimental results, it can be observed that the F1 value of our method on the test set is 1.8% higher than that of the PU learning baseline model. In addition, our method is more robust with low initial samples and high iterations, and compared with advanced PU learning baselines such as nnPU and I-PU, we obtain a 1.1% higher F1 value, which indicates that our method can effectively identify case-related news.

Therapeutics in the Treatment of COVID-19 for Children and Adolescents (소아청소년 코로나바이러스감염증-19의 치료: 치료 약제를 중심으로)

  • Choi, Soo-Han;Choi, Jae Hong;Yun, Ki Wook
    • Pediatric Infection and Vaccine
    • /
    • v.29 no.1
    • /
    • pp.1-15
    • /
    • 2022
  • Coronavirus disease 2019 (COVID-19) presents as a mild-to-moderate respiratory illness in most children. However, a small proportion of children with COVID-19 develop severe or critical illnesses. Although pediatric clinical trials for the treatment of COVID-19 are sparse, some drugs are available for children and adolescents with severe COVID-19. This review summarizes clinical data focusing on antiviral agents and immunomodulators for use in treating COVID-19. In addition, current recommendations for therapeutics for children and adolescents with COVID-19 are discussed.

Nexus Between Brand Transgression and Brand Forgiveness Among Islamic Banking Customers in Malaysia

  • ABD RASHID, Muhammad Hafiz;HAMZAH, Muhammad Iskandar;MUHAMAT, Amirul Afif;MANSOR, Aida Azlina;HASANORDIN, Rahayu
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.9 no.4
    • /
    • pp.381-389
    • /
    • 2022
  • Studies examining the interplay between brand transgression and brand forgiveness is notably sparse especially in the context of Southeast Asian banking customers. The purpose of this research is to add to the existing literature by examining the impact of brand transgression, which is represented by negative past experience image incongruence, and corporate wrongdoing on brand forgiveness among Islamic banking customers in Malaysia. The increasing surge in interest in unfavorable brand relationships has sparked concerns about its impact on brand forgiveness. As a result, this theoretical argument, which lacks empirical proof, has to be statistically tested. The current study was conducted utilizing a non-probability purposive sampling technique among clients in the Klang Valley who had poor experiences with Islamic banking services. Data analysis included descriptive statistics, exploratory factor analysis, and multiple regression on a total of 211 valid replies. The findings show that two elements of brand transgression, image inconsistency, and corporate wrongdoing, have a major impact on brand forgiveness. However, the other dimension namely negative past experience was found to be non-significant to brand forgiveness. Research implications and directions for future studies are also discussed in this paper.

Shear resistance of steel-concrete-steel deep beams with bidirectional webs

  • Guo, Yu-Tao;Nie, Xin;Fan, Jian-Sheng;Tao, Mu-Xuan
    • Steel and Composite Structures
    • /
    • v.42 no.3
    • /
    • pp.299-313
    • /
    • 2022
  • Steel-concrete-steel composite structures with bidirectional webs (SCSBWs) are used in large-scale projects and exhibit good mechanical performances and constructional efficiency. The shear behaviors of SCSBW deep beam members in key joints or in locations subjected to concentrated forces are of concern in design. To address this issue, experimental program is investigated to examine the deep-beam shear behaviors of SCSBWs, in which the cracking process and force transfer mechanism are revealed. Compared with the previously proposed truss model, it is found that a strut-and-tie model is more suitable for describing the shear mechanism of SCSBW deep beams with a short span and sparse transverse webs. According to the experimental analyses, a new model is proposed to predict the shear capacities of SCSBW deep beams. This model uses strut-and-tie concept and introduces web shear and dowel action to consider the coupled multi mechanisms. A stress decomposition method is used to distinguish the contributions of different shear-transferring paths. Based on case studies, a simplified model is further developed, and the explicit solution is derived for design efficiency. The proposed models are verified using experimental data, which are proven to have good accuracy and efficiency and to be suitable for practical application.

Gaussian models for bond strength evaluation of ribbed steel bars in concrete

  • Prabhat R., Prem;Branko, Savija
    • Structural Engineering and Mechanics
    • /
    • v.84 no.5
    • /
    • pp.651-664
    • /
    • 2022
  • A precise prediction of the ultimate bond strength between rebar and surrounding concrete plays a major role in structural design, as it effects the load-carrying capacity and serviceability of a member significantly. In the present study, Gaussian models are employed for modelling bond strength of ribbed steel bars embedded in concrete. Gaussian models offer a non-parametric method based on Bayesian framework which is powerful, versatile, robust and accurate. Five different Gaussian models are explored in this paper-Gaussian Process (GP), Variational Heteroscedastic Gaussian Process (VHGP), Warped Gaussian Process (WGP), Sparse Spectrum Gaussian Process (SSGP), and Twin Gaussian Process (TGP). The effectiveness of the models is also evaluated in comparison to the numerous design formulae provided by the codes. The predictions from the Gaussian models are found to be closer to the experiments than those predicted using the design equations provided in various codes. The sensitivity of the models to various parameters, input feature space and sampling is also presented. It is found that GP, VHGP and SSGP are effective in prediction of the bond strength. For large data set, GP, VHGP, WGP and TGP can be computationally expensive. In such cases, SSGP can be utilized.

REAL-TIME 3D MODELING FOR ACCELERATED AND SAFER CONSTRUCTION USING EMERGING TECHNOLOGY

  • Jochen Teizer;Changwan Kim;Frederic Bosche;Carlos H. Caldas;Carl T. Haas
    • International conference on construction engineering and project management
    • /
    • 2005.10a
    • /
    • pp.539-543
    • /
    • 2005
  • The research presented in this paper enables real-time 3D modeling to help make construction processes ultimately faster, more predictable and safer. Initial research efforts used an emerging sensor technology and proved its usefulness in the acquisition of range information for the detection and efficient representation of static and moving objects. Based on the time-of-flight principle, the sensor acquires range and intensity information of each image pixel within the entire sensor's field-of-view in real-time with frequencies of up to 30 Hz. However, real-time working range data processing algorithms need to be developed to rapidly process range information into meaningful 3D computer models. This research ultimately focuses on the application of safer heavy equipment operation. The paper compares (a) a previous research effort in convex hull modeling using sparse range point clouds from a single laser beam range finder, to (b) high-frame rate update Flash LADAR (Laser Detection and Ranging) scanning for complete scene modeling. The presented research will demonstrate if the FlashLADAR technology can play an important role in real-time modeling of infrastructure assets in the near future.

  • PDF

The Role of Local Government in Improving Resilience and Performance of Small and Medium-Sized Enterprises in Indonesia

  • TANEO, Stefanus Yufra M.;NOYA, Sunday;MELANY, Melany;SETIYATI, Etsa Astridya
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.9 no.3
    • /
    • pp.245-256
    • /
    • 2022
  • During the COVID-19 pandemic, several studies focused on financial programs and SMEs' performance, but research on the relationship between non-financial programs, resilience, and SMEs' performance is still sparse. This study fills the gap by analyzing the role of local government in increasing SME resilience and performance by purchasing products (through civil servants) from SMEs and by facilitating online training to SMEs. This study also investigates the role of the local government in strengthening the relationship between resilience and SME performance. Data was collected using an online questionnaire distributed to SMEs in Malang Regency. As many as 410 questionnaires were received and eligible for statistical analysis using WarpPLS. The results show that resilience is positively and significantly related to the performance of SMEs. The local government programs have been proven to improve SME performance directly and indirectly through resilience. Local government programs are not proven to strengthen the relationship between resilience and the performance of SMEs, indicating that the role of government in developing countries such as Indonesia is more appropriate to be "rowing rather than steering" not "steering rather than rowing".

A development of grid-based spatial downscaling for climate change assessment in regions with sparse ground data networks (미계측 지역 기후변화 평가를 위한 격자 기반 통계적 상세화 기법 개발)

  • Kim, Yong-Tak;Jung, Min-Kyu;Kim, Min-Ji;Kwon, Hyun-Han
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2021.06a
    • /
    • pp.41-41
    • /
    • 2021
  • 최근 전 세계적으로 급증하는 기후변화의 영향으로 이상기후로 인한 자연재해들의 강도 및 발생 빈도의 증가가 다양한 연구를 통하여 확인되고 있으며, 이를 대비 및 대응하기 위한 방안수립 연구가 세계의 가장 중요한 주제로 부상되고 있다. 우리나라의 경우에는 기후변화에 따른 심각성 문제가 대두되고 있지만 국가적 대응기반조성 및 수자원정책 의사결정에 직접적으로 활용될 수 있는 일관성 있고 통합적인 기후 정보가 부족한 실정이다. 미래 기상 변동성을 나타내는 기후모델은 전 지구적 대규모 기상장(large scale climate pattern)을 비교적 정확하게 묘사하는 것으로 알려져 있으나 모형에 내재해 있는 시·공간적 편의(spatial-temporal bias) 및 불확실성으로 인하여 통계학적 상세화가 필수적으로 요구된다. 이러한 편향성은 일반적으로 지상 관측 자료를 격자에 보간하여 보정하는 방법이 적용되고 있지만, 관측자료의 불연속성 및 관측소의 불균등성으로 인하여 공간적 신뢰성이 낮다. 이에, 본 연구에서는 Bayesian 기반의 Kriging을 통한 공간적 편의보정 및 QDM(quantile delta mapping)을 연계한 새로운 격자 기반의 통계적 상세화 모형 Bayesian Kriging-QDM을 개발하였다. 본 연구를 통하여 산정된 결과는 과거자료에 근거하여 이루어지는 기존의 보수적인 수자원 관리 체계의 위험성을 저감 시킬 수 있는 의사결정에 직접적으로 활용될 수 있는 기초 자료로 이용 가능할 것으로 판단된다.

  • PDF