• Title/Summary/Keyword: Public Dataset

Search Result 241, Processing Time 0.026 seconds

Automatic Matching of Building Polygon Dataset from Digital Maps Using Hierarchical Matching Algorithm (계층적 매칭 기법을 이용한 수치지도 건물 폴리곤 데이터의 자동 정합에 관한 연구)

  • Yeom, Junho;Kim, Yongil;Lee, Jeabin
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.33 no.1
    • /
    • pp.45-52
    • /
    • 2015
  • The interoperability of multi-source data has become more important due to various digital maps, produced from public institutions and enterprises. In this study, the automatic matching algorithm of multi-source building data using hierarchical matching was proposed. At first, we divide digital maps into blocks and perform the primary geometric registration of buildings with the ICP algorithm. Then, corresponding building pairs were determined by evaluating the similarity of overlap area, and the matching threshold value of similarity was automatically derived by the Otsu binary thresholding. After the first matching, we extracted error matching candidates buildings which are similar with threshold value to conduct the secondary ICP matching and to make a matching decision using turning angle function analysis. For the evaluation, the proposed method was applied to representative public digital maps, road name address map and digital topographic map 2.0. As a result, the F measures of matching and non-matching buildings increased by 2% and 17%, respectively. Therefore, the proposed method is efficient for the matching of building polygons from multi-source digital maps.

Effect of Coverage Expansion Policy for an Ultrasonography in the Upper Abdomen on Its Utilization: A Difference-in-Difference Mixed-Effects Model Analysis (상복부초음파검사 급여확대에 따른 의료이용의 변화: 이중차이 혼합효과모형 추정방법을 이용하여)

  • Son, Yena;Lee, Yongjae;Nam, Chung-Mo;Kim, Gyu Ri;Chung, Woojin
    • Health Policy and Management
    • /
    • v.30 no.3
    • /
    • pp.326-334
    • /
    • 2020
  • Background: Korea has gradually expanded the coverage of medical care services in its national health insurance system. On April 1, 2018, it implemented a policy that expanded the coverage for an ultrasonography in the upper abdomen. In this study, we aimed to investigate the effect of the policy on the utilization of the ultrasonography in the upper abdomen in tertiary care hospitals. Methods: Using the dataset of the Health Insurance Review and Assessment Service, we explored changes in the utilization of the ultrasonography in the upper abdomen in tertiary care hospitals from July 1, 2017 to November 30, 2018 through the difference-in-difference (DID) mixed-effects-model method. Facility factor, equipment factor and personnel factors, type of hospital, the total amount of medical care expenses, and geographic region were considered as control variables. Results: On average, the utilization of the ultrasonography in the upper abdomen increased by 228% after the coverage expansion policy. However, the results of DID mixed-effects-model method analysis showed that the utilization increased by 73%. As for the number of beds, the utilization was higher with a group of 844-930, 931-1,217, and 1,218 or greater compared with a group of 843 or fewer, while the utilization of the number of ultrasonic devices was lower with a group of 45-49 compared with a group of 44 or fewer. The utilization decreased with the number of interns and the number of nurse assistants. Besides, relative to Seoul, the utilization was lower in the other metro-cities and provinces. Conclusion: The coverage expansion policy in the national health insurance system increased service utilization among people. Future research needs to investigate the degree to which such coverage expansion policy reduces the unmet medical care needs among the deprived in Korea.

The Effect of Publicness on the Service Quality in Long-Term Care Facilities (공공성이 노인장기요양시설의 서비스 질에 미치는 효과)

  • Kwon, Hyunjung;Hong, Kyungzoon
    • Korean Journal of Social Welfare
    • /
    • v.67 no.3
    • /
    • pp.253-280
    • /
    • 2015
  • This study reconsiders the concept of publicness by raising a question about the problems which the recent marketization of social services in South Korea. The existing perspective on publicness, however, is insufficient to account for the entire Long-term care market because only public organizations have publicness. Accordingly, this study presents 'integrated publicness', particularly 'dimensional publicness' and 'normative publicness'. A disproportional stratified sampling procedure was used to consider ownership. A merged dataset combining surveys from 248 Long-term Care facilities and on-line resources was used and analyzed by multiple regression, negative binomial regression and multiple imputation analysis. The analysis results suggest as follows. First, ownership publicness appeared more effective in the overall. Second, the regulations on the government funding did not show effective, and the regulation on evaluation system showed the effect. Third, professionalization of normative publicness showed a negative effect on service structure and showed a positive effect on service process. Lastly, user of free services whose public accountability was identified to be effective on service structure and outcome. These findings suggest that not only existing ownership but also dimensional publicness and normative publicness showed an effect on service quality. In this respect, this is important as the performance produced by empirical models of integrated publicness, in this situation that the outcome of marketization is insignificant.

  • PDF

Artificial Intelligence Techniques for Predicting Online Peer-to-Peer(P2P) Loan Default (인공지능기법을 이용한 온라인 P2P 대출거래의 채무불이행 예측에 관한 실증연구)

  • Bae, Jae Kwon;Lee, Seung Yeon;Seo, Hee Jin
    • The Journal of Society for e-Business Studies
    • /
    • v.23 no.3
    • /
    • pp.207-224
    • /
    • 2018
  • In this article, an empirical study was conducted by using public dataset from Lending Club Corporation, the largest online peer-to-peer (P2P) lending in the world. We explore significant predictor variables related to P2P lending default that housing situation, length of employment, average current balance, debt-to-income ratio, loan amount, loan purpose, interest rate, public records, number of finance trades, total credit/credit limit, number of delinquent accounts, number of mortgage accounts, and number of bank card accounts are significant factors to loan funded successful on Lending Club platform. We developed online P2P lending default prediction models using discriminant analysis, logistic regression, neural networks, and decision trees (i.e., CART and C5.0) in order to predict P2P loan default. To verify the feasibility and effectiveness of P2P lending default prediction models, borrower loan data and credit data used in this study. Empirical results indicated that neural networks outperforms other classifiers such as discriminant analysis, logistic regression, CART, and C5.0. Neural networks always outperforms other classifiers in P2P loan default prediction.

A Comparison of Analysis Methods for Work Environment Measurement Databases Including Left-censored Data (불검출 자료를 포함한 작업환경측정 자료의 분석 방법 비교)

  • Park, Ju-Hyun;Choi, Sangjun;Koh, Dong-Hee;Park, Donguk;Sung, Yeji
    • Journal of Korean Society of Occupational and Environmental Hygiene
    • /
    • v.32 no.1
    • /
    • pp.21-30
    • /
    • 2022
  • Objectives: The purpose of this study is to suggest an optimal method by comparing the analysis methods of work environment measurement datasets including left-censored data where one or more measurements are below the limit of detection (LOD). Methods: A computer program was used to generate left-censored datasets for various combinations of censoring rate (1% to 90%) and sample size (30 to 300). For the analysis of the censored data, the simple substitution method (LOD/2), β-substitution method, maximum likelihood estimation (MLE) method, Bayesian method, and regression on order statistics (ROS)were all compared. Each method was used to estimate four parameters of the log-normal distribution: (1) geometric mean (GM), (2) geometric standard deviation (GSD), (3) 95th percentile (X95), and (4) arithmetic mean (AM) for the censored dataset. The performance of each method was evaluated using relative bias and relative root mean squared error (rMSE). Results: In the case of the largest sample size (n=300), when the censoring rate was less than 40%, the relative bias and rMSE were small for all five methods. When the censoring rate was large (70%, 90%), the simple substitution method was inappropriate because the relative bias was the largest, regardless of the sample size. When the sample size was small and the censoring rate was large, the Bayesian method, the β-substitution method, and the MLE method showed the smallest relative bias. Conclusions: The accuracy and precision of all methods tended to increase as the sample size was larger and the censoring rate was smaller. The simple substitution method was inappropriate when the censoring rate was high, and the β-substitution method, MLE method, and Bayesian method can be widely applied.

Artificial Intelligence for Assistance of Facial Expression Practice Using Emotion Classification (감정 분류를 이용한 표정 연습 보조 인공지능)

  • Dong-Kyu, Kim;So Hwa, Lee;Jae Hwan, Bong
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.17 no.6
    • /
    • pp.1137-1144
    • /
    • 2022
  • In this study, an artificial intelligence(AI) was developed to help with facial expression practice in order to express emotions. The developed AI used multimodal inputs consisting of sentences and facial images for deep neural networks (DNNs). The DNNs calculated similarities between the emotions predicted by the sentences and the emotions predicted by facial images. The user practiced facial expressions based on the situation given by sentences, and the AI provided the user with numerical feedback based on the similarity between the emotion predicted by sentence and the emotion predicted by facial expression. ResNet34 structure was trained on FER2013 public data to predict emotions from facial images. To predict emotions in sentences, KoBERT model was trained in transfer learning manner using the conversational speech dataset for emotion classification opened to the public by AIHub. The DNN that predicts emotions from the facial images demonstrated 65% accuracy, which is comparable to human emotional classification ability. The DNN that predicts emotions from the sentences achieved 90% accuracy. The performance of the developed AI was evaluated through experiments with changing facial expressions in which an ordinary person was participated.

The role of RNA epigenetic modification-related genes in the immune response of cattle to mastitis induced by Staphylococcus aureus

  • Yue Xing;Yongjie Tang;Quanzhen Chen;Siqian Chen;Wenlong Li;Siyuan Mi;Ying Yu
    • Animal Bioscience
    • /
    • v.37 no.7
    • /
    • pp.1141-1155
    • /
    • 2024
  • Objective: RNA epigenetic modifications play an important role in regulating immune response of mammals. Bovine mastitis induced by Staphylococcus aureus (S. aureus) is a threat to the health of dairy cattle. There are numerous RNA modifications, and how these modification-associated enzymes systematically coordinate their immunomodulatory effects during bovine mastitis is not well reported. Therefore, the role of common RNA modification-related genes (RMRGs) in bovine S. aureus mastitis was investigated in this study. Methods: In total, 80 RMRGs were selected for this study. Four public RNA-seq data sets about bovine S. aureus mastitis were collected and one additional RNA-seq data set was generated by this study. Firstly, quantitative trait locus (QTL) database, transcriptome-wide association studies (TWAS) database and differential expression analyses were employed to characterize the potential functions of selected enzyme genes in bovine S. aureus mastitis. Correlation analysis and weighted gene co-expression network analysis (WGCNA) were used to further investigate the relationships of RMRGs from different types at the mRNA expression level. Interference experiments targeting the m6 A demethylase FTO and utilizing public MeRIP-seq dataset from bovine Mac-T cells were used to investigate the potential interaction mechanisms among various RNA modifications. Results: Bovine QTL and TWAS database in cattle revealed associations between RMRGs and immune-related complex traits. S. aureus challenged and control groups were effectively distinguished by principal component analysis based on the expression of selected RMRGs. WGCNA and correlation analysis identified modules grouping different RMRGs, with highly correlated mRNA expression. The m6 A modification gene FTO showed significant effects on the expression of m6 A and other RMRGs (such as NSUN2, CPSF2, and METTLE), indicating complex co-expression relationships among different RNA modifications in the regulation of bovine S. aureus mastitis. Conclusion: RNA epigenetic modification genes play important immunoregulatory roles in bovine S. aureus mastitis, and there are extensive interactions of mRNA expression among different RMRGs. It is necessary to investigate the interactions between RNA modification genes regulating complex traits in the future.

An Intelligent Intrusion Detection Model Based on Support Vector Machines and the Classification Threshold Optimization for Considering the Asymmetric Error Cost (비대칭 오류비용을 고려한 분류기준값 최적화와 SVM에 기반한 지능형 침입탐지모형)

  • Lee, Hyeon-Uk;Ahn, Hyun-Chul
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.4
    • /
    • pp.157-173
    • /
    • 2011
  • As the Internet use explodes recently, the malicious attacks and hacking for a system connected to network occur frequently. This means the fatal damage can be caused by these intrusions in the government agency, public office, and company operating various systems. For such reasons, there are growing interests and demand about the intrusion detection systems (IDS)-the security systems for detecting, identifying and responding to unauthorized or abnormal activities appropriately. The intrusion detection models that have been applied in conventional IDS are generally designed by modeling the experts' implicit knowledge on the network intrusions or the hackers' abnormal behaviors. These kinds of intrusion detection models perform well under the normal situations. However, they show poor performance when they meet a new or unknown pattern of the network attacks. For this reason, several recent studies try to adopt various artificial intelligence techniques, which can proactively respond to the unknown threats. Especially, artificial neural networks (ANNs) have popularly been applied in the prior studies because of its superior prediction accuracy. However, ANNs have some intrinsic limitations such as the risk of overfitting, the requirement of the large sample size, and the lack of understanding the prediction process (i.e. black box theory). As a result, the most recent studies on IDS have started to adopt support vector machine (SVM), the classification technique that is more stable and powerful compared to ANNs. SVM is known as a relatively high predictive power and generalization capability. Under this background, this study proposes a novel intelligent intrusion detection model that uses SVM as the classification model in order to improve the predictive ability of IDS. Also, our model is designed to consider the asymmetric error cost by optimizing the classification threshold. Generally, there are two common forms of errors in intrusion detection. The first error type is the False-Positive Error (FPE). In the case of FPE, the wrong judgment on it may result in the unnecessary fixation. The second error type is the False-Negative Error (FNE) that mainly misjudges the malware of the program as normal. Compared to FPE, FNE is more fatal. Thus, when considering total cost of misclassification in IDS, it is more reasonable to assign heavier weights on FNE rather than FPE. Therefore, we designed our proposed intrusion detection model to optimize the classification threshold in order to minimize the total misclassification cost. In this case, conventional SVM cannot be applied because it is designed to generate discrete output (i.e. a class). To resolve this problem, we used the revised SVM technique proposed by Platt(2000), which is able to generate the probability estimate. To validate the practical applicability of our model, we applied it to the real-world dataset for network intrusion detection. The experimental dataset was collected from the IDS sensor of an official institution in Korea from January to June 2010. We collected 15,000 log data in total, and selected 1,000 samples from them by using random sampling method. In addition, the SVM model was compared with the logistic regression (LOGIT), decision trees (DT), and ANN to confirm the superiority of the proposed model. LOGIT and DT was experimented using PASW Statistics v18.0, and ANN was experimented using Neuroshell 4.0. For SVM, LIBSVM v2.90-a freeware for training SVM classifier-was used. Empirical results showed that our proposed model based on SVM outperformed all the other comparative models in detecting network intrusions from the accuracy perspective. They also showed that our model reduced the total misclassification cost compared to the ANN-based intrusion detection model. As a result, it is expected that the intrusion detection model proposed in this paper would not only enhance the performance of IDS, but also lead to better management of FNE.

Social Tagging-based Recommendation Platform for Patented Technology Transfer (특허의 기술이전 활성화를 위한 소셜 태깅기반 지적재산권 추천플랫폼)

  • Park, Yoon-Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.3
    • /
    • pp.53-77
    • /
    • 2015
  • Korea has witnessed an increasing number of domestic patent applications, but a majority of them are not utilized to their maximum potential but end up becoming obsolete. According to the 2012 National Congress' Inspection of Administration, about 73% of patents possessed by universities and public-funded research institutions failed to lead to creating social values, but remain latent. One of the main problem of this issue is that patent creators such as individual researcher, university, or research institution lack abilities to commercialize their patents into viable businesses with those enterprises that are in need of them. Also, for enterprises side, it is hard to find the appropriate patents by searching keywords on all such occasions. This system proposes a patent recommendation system that can identify and recommend intellectual rights appropriate to users' interested fields among a rapidly accumulating number of patent assets in a more easy and efficient manner. The proposed system extracts core contents and technology sectors from the existing pool of patents, and combines it with secondary social knowledge, which derives from tags information created by users, in order to find the best patents recommended for users. That is to say, in an early stage where there is no accumulated tag information, the recommendation is done by utilizing content characteristics, which are identified through an analysis of key words contained in such parameters as 'Title of Invention' and 'Claim' among the various patent attributes. In order to do this, the suggested system extracts only nouns from patents and assigns a weight to each noun according to the importance of it in all patents by performing TF-IDF analysis. After that, it finds patents which have similar weights with preferred patents by a user. In this paper, this similarity is called a "Domain Similarity". Next, the suggested system extract technology sector's characteristics from patent document by analyzing the international technology classification code (International Patent Classification, IPC). Every patents have more than one IPC, and each user can attach more than one tag to the patents they like. Thus, each user has a set of IPC codes included in tagged patents. The suggested system manages this IPC set to analyze technology preference of each user and find the well-fitted patents for them. In order to do this, the suggeted system calcuates a 'Technology_Similarity' between a set of IPC codes and IPC codes contained in all other patents. After that, when the tag information of multiple users are accumulated, the system expands the recommendations in consideration of other users' social tag information relating to the patent that is tagged by a concerned user. The similarity between tag information of perferred 'patents by user and other patents are called a 'Social Simialrity' in this paper. Lastly, a 'Total Similarity' are calculated by adding these three differenent similarites and patents having the highest 'Total Similarity' are recommended to each user. The suggested system are applied to a total of 1,638 korean patents obtained from the Korea Industrial Property Rights Information Service (KIPRIS) run by the Korea Intellectual Property Office. However, since this original dataset does not include tag information, we create virtual tag information and utilized this to construct the semi-virtual dataset. The proposed recommendation algorithm was implemented with JAVA, a computer programming language, and a prototype graphic user interface was also designed for this study. As the proposed system did not have dependent variables and uses virtual data, it is impossible to verify the recommendation system with a statistical method. Therefore, the study uses a scenario test method to verify the operational feasibility and recommendation effectiveness of the system. The results of this study are expected to improve the possibility of matching promising patents with the best suitable businesses. It is assumed that users' experiential knowledge can be accumulated, managed, and utilized in the As-Is patent system, which currently only manages standardized patent information.

Ready-to-eat Cereal Consumption Enhances Milk and Calcium Intake in Korean Population from 2001 Korean National Health and Nutrition Survey (한국인의 시리얼 섭취실태와 우유 및 칼슘섭취와의 관련성 연구 - 2001년도 국민건강영양조사 자료를 이용하여 -)

  • Chung, Chin-Eun
    • Journal of Nutrition and Health
    • /
    • v.39 no.8
    • /
    • pp.786-794
    • /
    • 2006
  • The purpose of this study was to establish an association between the consumption of ready-to-eat cereal (RTEC), milk, and calcium within the context of the most current population dietary practice in Korea. Inadequate calcium intake among Korean children and adults is one of the important public health concern. Milk is one of the best calcium sources because or its bioavailability, and RTEC is one or the foods commonly consumed with milk. The most recent Korean National Health and Nutrition Survey, 2001 dataset was used as the source of data for this research. Subjects excluding pregnant women, were categorized according to gender and age ($1{\sim}5,\;6{\sim}11,\;12{\sim}19,\;20{\sim}49,\;50+$ years) and then by consumption of RTEC and milk. SAS and SUDAAN were used for statistical analyses. Sample weighted means, standard errors, and population percentages were calculated, and multiple regression model with adjustment for covariates were used to determine the predictability of total daily calcium intake from inclusion of RTEC and milk compared to the meal without RTEC and milk. RTEC was consumed by 2.4% or Korean people. Average calcium intake was 17 times greater when RTEC was consumed with milk than when RTEC was consumed without milk. Respondents who consumed RTEC with milk had significantly higher mean daily calcium and other nutrient intakes than respondents who consumed neither. in the multiple regression analysis, milk consumption with or without RTEC predicted total daily calcium intake after adjusting for age, income, and alcohol consumption (p<0.0001). The percentage of respondents below the estimated average requirement (EAR) level for calcium was lower for RTEC consumers than for RTEC non-consumers in all age-gender groups, especially significant differences were in children aged $1{\sim}5$, boys and girls aged $12{\sim}19$, men aged $20{\sim}49$, and women older than 50 years of age. RTEC consumption was not associated with intake in excess of the tolerable upper intake level (UL) for calcium. In conclusion, RTEC consumption was positively associated with both milk and calcium intakes in all age and gender groups in Korean population.