• Title/Summary/Keyword: meta data

Search Result 1,383, Processing Time 0.032 seconds

Using Mechanical Learning Analysis of Determinants of Housing Sales and Establishment of Forecasting Model (기계학습을 활용한 주택매도 결정요인 분석 및 예측모델 구축)

  • Kim, Eun-mi;Kim, Sang-Bong;Cho, Eun-seo
    • Journal of Cadastre & Land InformatiX
    • /
    • v.50 no.1
    • /
    • pp.181-200
    • /
    • 2020
  • This study used the OLS model to estimate the determinants affecting the tenure of a home and then compared the predictive power of each model with SVM, Decision Tree, Random Forest, Gradient Boosting, XGBooest and LightGBM. There is a difference from the preceding study in that the Stacking model, one of the ensemble models, can be used as a base model to establish a more predictable model to identify the volume of housing transactions in the housing market. OLS analysis showed that sales profits, housing prices, the number of household members, and the type of residential housing (detached housing, apartments) affected the period of housing ownership, and compared the predictability of the machine learning model with RMSE, the results showed that the machine learning model had higher predictability. Afterwards, the predictive power was compared by applying each machine learning after rebuilding the data with the influencing variables, and the analysis showed the best predictive power of Random Forest. In addition, the most predictable Random Forest, Decision Tree, Gradient Boosting, and XGBooost models were applied as individual models, and the Stacking model was constructed using Linear, Ridge, and Lasso models as meta models. As a result of the analysis, the RMSE value in the Ridge model was the lowest at 0.5181, thus building the highest predictive model.

A Study on Costs of Digital Preservation (디지털 보존의 비용요소에 관한 연구)

  • Chung, Hye-Kyung
    • Journal of the Korean Society for information Management
    • /
    • v.22 no.1 s.55
    • /
    • pp.47-64
    • /
    • 2005
  • To guarantee the long-term access to digital material, digital preservation needs to be systemized, and detailed investigation on cost elements of digital preservation should be done for the continued support of budget. To meet the needs in this area, this paper categorized the digital preservation cost into direct and indirect cost through deriving common elements used in prior research on this issue. For case analysis, two institutions, representing domestic University Library and National Library of Korea under large-scale digitization currently, are selected to analyze the current status of digital preservation and estimate the preservation cost. The case analysis shows the systematic preservation function should be performed to guarantee the long-term access digital material, even though a basic digital preservation is currently conducted. It was projected that the digital preservation cost for the two libraries, accounting for $11.8\%$ and $8.6\%$ of digitization cost, respectively, should be injected every year. However, the estimated figures are very conservative, because the cost for estimating the preservation function, such as installing digital repository and producing meta data, was excluded in the estimation. This proves that digital preservation is a synthetic activity linked directly and indirectly to various activities from production to access of digital object and an essential costs that should be considered from the beginning stage of digitization project.

Fatigue Analysis based on Kriging for Flaperon Joint of Tilt Rotor Type Aircraft (틸트 로터형 항공기의 플랩퍼론 연결부에 대한 크리깅 기반 피로해석)

  • Park, Young-Chul;Jang, Byoung-Uk;Im, Jong-Bin;Lee, Jung-Jin;Lee, Soo-Yong;Park, Jung-Sun
    • Journal of the Korean Society for Aeronautical & Space Sciences
    • /
    • v.36 no.6
    • /
    • pp.541-549
    • /
    • 2008
  • The fatigue analysis is performed to avoid structural failure in aerospace structures under repeated loads. In this paper, the fatigue life is estimated for the design of tilt rotor UAV. First of all, the fatigue load spectrum for tilt rotor UAV is generated. Fatigue analysis is done for the flaperon joint which may have FCL(fracture critical location). Tilt rotor UAV operates at two modes: helicopter mode such as taking off and landing; fixed wing mode like cruising. To make overall fatigue load spectrum, FELIX is used for helicopter mode and TWIST is used for fixed wing mode. The other hand, the Kriging meta model is used to get S-N regression curve for whole range of material life when S-N test data are analyzed. And then, the second order of S-N curve is accomplished by the least square method. In addition, the coefficient of determination method is used to ensure how accuracy it has. Finally, the fatigue life of flaperon joint is compared with that obtained by MSC. Fatigue.

Research on Text Classification of Research Reports using Korea National Science and Technology Standards Classification Codes (국가 과학기술 표준분류 체계 기반 연구보고서 문서의 자동 분류 연구)

  • Choi, Jong-Yun;Hahn, Hyuk;Jung, Yuchul
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.1
    • /
    • pp.169-177
    • /
    • 2020
  • In South Korea, the results of R&D in science and technology are submitted to the National Science and Technology Information Service (NTIS) in reports that have Korea national science and technology standard classification codes (K-NSCC). However, considering there are more than 2000 sub-categories, it is non-trivial to choose correct classification codes without a clear understanding of the K-NSCC. In addition, there are few cases of automatic document classification research based on the K-NSCC, and there are no training data in the public domain. To the best of our knowledge, this study is the first attempt to build a highly performing K-NSCC classification system based on NTIS report meta-information from the last five years (2013-2017). To this end, about 210 mid-level categories were selected, and we conducted preprocessing considering the characteristics of research report metadata. More specifically, we propose a convolutional neural network (CNN) technique using only task names and keywords, which are the most influential fields. The proposed model is compared with several machine learning methods (e.g., the linear support vector classifier, CNN, gated recurrent unit, etc.) that show good performance in text classification, and that have a performance advantage of 1% to 7% based on a top-three F1 score.

Geochemistry and Stable Isotopes of Carbonated Waters in South Korea (남한 탄산수의 지구화학적 특성과 안정동위원소 조성)

  • 윤정아;김규한
    • Journal of the Korean Society of Groundwater Environment
    • /
    • v.7 no.3
    • /
    • pp.116-124
    • /
    • 2000
  • Geochemical and isotopic analyses were carried out to investigate hydrochemical characteristics, source of carbon species in the carbonated waters in South Korea. Most Korean carbonated waters from different geologic settings are characterized by a Ca-HCO$_3$type with a relatively low pH range from 5.3 to 6.3 (avg. 6.0). The concentrations of cations and anions in the carbonate waters are in the order of Ca$^{2+}$>Na$^{+}$>Mg$^{2+}$>Si$^{4+}$>Fe$^{2+}$>K$^{+}$ and HCO$_3$$^{-}$>SO$_4$$^{2-}$>Cl$^{-}$, respectively. The HCO$_3$$^{-}$ ion is more enriched in the carbonated water from the sedimentary rock and granitic rock of Mesozoic age in the Gyungsang basin(GII) and the Precambrian metamorphic rock and Jurassic granitic rocks of the Gyunggj massif in the Gangwon province(GⅠ) than those of the meta-sedimentary rock and granite in the Ogcheon zone(GⅢ). Based on the oxygen and hydrogen isotopic data, the carbonated waters are derived from the meteoric water, showing apparent latitude and altitude effects. The $delta$$^{13}$C values of carbon species in the carbonated water are in between -6.23 and 0.0 $textperthousand$, suggesting inorganic source of carbon originated from the carbonate mineral and carbonate rock in the aquifer.

  • PDF

A genome-wide association study of the association between single nucleotide polymorphisms and brachial-ankle pulse wave velocity in healthy Koreans

  • Xu, EnShi;Shin, Jinho;Lim, Ji Eun;Kim, Mi Kyung;Choi, Bo Youl;Shin, Min-Ho;Shin, Dong Hoon;Lee, Young-Hoon;Chun, Byung-Yeol;Hong, Kyung-Won;Hwang, Joo-Yeon
    • Journal of Genetic Medicine
    • /
    • v.14 no.1
    • /
    • pp.8-17
    • /
    • 2017
  • Purpose: Pulse wave velocity (PWV) is an indicator of arterial stiffness, and is considered a marker of vascular damage. However, a genome-wide association study analyzing single nucleotide polymorphisms (SNPs) associated with brachial-ankle PWV (baPWV) has not been conducted in healthy populations. We performed this study to identify SNPs associated with baPWV in healthy populations in Korea. Materials and Methods: Genomic SNPs data for 2,407 individuals from three sites were analyzed as part of the Korean Genomic Epidemiologic Study. Without replication samples, we performed multivariable analysis as a post hoc analysis to verify the findings in site adjusted analysis. Healthy subjects aged between 40 and 70 years without self-reported history or diagnosis of hypertension, diabetes, hyperlipidemia, heart disease, cerebrovascular disease and cancer were included. We excluded subjects with a creatinine level >1.4 mg/dL (men) and 1.2 mg/dL (women). Results: In the site-adjusted association analysis, significant associations (P<$5{\times}10^{-8}$) with baPWV were detected for only 5 SNPs with low minor allele frequency. In multivariable analysis adjusted by age, sex, height, body mass index, mean arterial pressure, site, smoking, alcohol, and exercise, 11 SNPs were found to be associated (P<$5{\times}10^{-8}$) with baPWV. The 5 SNPs (P<$5{\times}10^{-8}$) linked to three genes (OPCML, PRR35 and RAB40C) were common between site-adjusted analysis and multivariable analysis. However, meta-analysis of the result from three sites for the 11 SNPs showed no significant associations. Conclusion: Using the recent standard for genome-wide association study, we did not find any evidence of significant association signals with baPWV.

A meta-study of Informal Science Learning and Generic Learning Outcomes: Focusing on published papers in the last 10 years (비형식과학교육과 포괄적학습성과의 메타연구: 최근 10년간의 발표논문을 중심으로)

  • Cho, Ig-Hyeng;You, Yen-Yoo;Na, Kwan-Sik
    • Journal of Digital Convergence
    • /
    • v.19 no.9
    • /
    • pp.33-42
    • /
    • 2021
  • Despite the importance of science education in an informal environment, the reality is that there is a lack of trend analysis research on 'Informal Science Learning (ISL)' and its effects. Therefore, the purpose of this paper is to find out the educational effects of ISL and how to use it, and to provide guidelines for future ISL research directions. This study classifies specific ISL-related papers published from 2010 to 2019 and compares them with each element of GLO used to measure the effectiveness of informal education. The fit of the analyzed data was checked for each part through SPSS and Chi-Square. In conclusion, it was found that researchers are using 'ISL' to pursue 'Knowledge and Understanding' and 'Attitudes and Values' among the five performance indicators of 'GLO'. On the other hand, 'Skills' and 'Enjoyment, Inspiration and Creativity' appear to have the least expectations, so supplementation is required in these areas in the future. In addition, this study intends to suggest a direction for informal science education-related program development and future research to various education workers.

Personalized Travel Path Recommendation Scheme on Social Media (소셜 미디어 상에서 개인화된 여행 경로 추천 기법)

  • Aniruddha, Paul;Lim, Jongtae;Bok, Kyoungsoo;Yoo, Jaesoo
    • The Journal of the Korea Contents Association
    • /
    • v.19 no.2
    • /
    • pp.284-295
    • /
    • 2019
  • In the recent times, a personalized travel path recommendation based on both travelogues and community contributed photos and the heterogeneous meta-data (tags, geographical locations, and date taken) which are associated with photos have been studied. The travellers using social media leave their location history, in the form of paths. These paths can be bridged for acquiring information, required, for future recommendation, for the future travellers, who are new to that location, providing all sort of information. In this paper, we propose a personalized travel path recommendation scheme, based on social life log. By taking advantage, of two kinds of social media, such as travelogue and community contributed photos, the proposed scheme, can not only be personalized to user's travel interest, but also be able to recommend, a travel path rather than individual Points of Interest (POIs). The proposed personalized travel route recommendation method consists of two steps, which are: pruning POI pruning step and creating travel path step. In the POI pruning step, candidate paths are created by the POI derived. In the creating travel path step, the proposed scheme creates the paths considering the user's interest, cost, time, season of the topic for more meaningful recommendation.

Retrieval Biases Analysis on Estimation of GNSS Precipitable Water Vapor by Tropospheric Zenith Hydrostatic Models (GNSS 가강수량 추정시 건조 지연 모델에 의한 복원 정밀도 해석)

  • Nam, JinYong;Song, DongSeob
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.37 no.4
    • /
    • pp.233-242
    • /
    • 2019
  • ZHD (Zenith Hydrostatic Delay) model is important parameter in estimating of GNSS (Global Navigation Satellite System) PWV (Precipitable Water Vapor) along with weighted mean temperature. The ZWD (Zenith Wet Delay) is tend to accumulate the ZHD error, so that biases from ZHD will be affected on the precision of GNSS PWV. In this paper, we compared the accuracy of GNSS PWV with radiosonde PWV using three ZHD models, such as Saastamoinen, Hopfield, and Black. Also, we adopted the KWMT (Korean Weighted Mean Temperature) model and the mean temperature which was observed by radiosonde on the retrieval processing of GNSS PWV. To this end, GNSS observation data during one year were processed to produce PWVs from a total of 5 GNSS permanent stations in Korea, and the GNSS PWVs were compared with radiosonde PWVs for the evaluating of biases. The PWV biases using mean temperature estimated by the KWMT model are smaller than radiosonde mean temperature. Also, we could confirm the result that the Saastamoinen ZHD which is most used in the GNSS meteorology is not valid in South Korea, because it cannot be exclude the possibility of biases by latitude or height of GNSS station.

Analysis of Published Research in the Journal of Muscle and Joint Health from 2008 to 2020 (근관절건강학회지 게재 논문 분석: 2008년부터 2020년까지)

  • Park, Mi-Sung;Lee, Kyung-Sook;Shin, Gyeyoung;Woo, Soo-Hee;Lim, Kyung-Choon;Choi, Heejung;Jin, Soo-Ji;Park, Yeon-Hwan
    • Journal of muscle and joint health
    • /
    • v.29 no.1
    • /
    • pp.69-80
    • /
    • 2022
  • Purpose: To identify research trends in the Journal of Muscle and Joint Health. Methods: In total, 315 studies published between 2008 and 2020 in the Journal of Muscle and Joint Health were reviewed using analysis criteria developed by the authors Results: Most participants were adults or older adults, they mostly had arthritis. The types of research design were descriptive research (46.4%), quasi-experimental design (21.9%), randomized controlled trial (1.9%), and qualitative research (4.1%). The occupation of most authors was professor in universities (61.0%). Data were collected mostly in hospitals (41.6%) or communities (24.4%) using a questionnaire (52.4%). Written consent was obtained at 75.6% and 47.9% of studies were approved by the Institutional Review Board (IRB). The instruments measuring physical concepts such as pain, flexibility, sense of balance and fatigue were mostly used. The most common interventions in experimental studies were physical interventions, with the main being exercise. Key words were categorized into four nursing meta-paradigms: human, health, environment and nursing. The most frequently reported key words were included in the health domain. The most frequently used key words were physical intervention, older patient, osteoarthritis, pain and depression. Conclusion: The results suggest that more research studies targeting various age groups related to muscle and joint health are required. Additionally there is a need to increase the number of qualitative studies, randomized experimental studies, and systematic review studies. It is necessary to pay attention to compliance with research ethics publication regulations.