• Title/Summary/Keyword: Algorithm Model

Search Result 12,934, Processing Time 0.038 seconds

Online news-based stock price forecasting considering homogeneity in the industrial sector (산업군 내 동질성을 고려한 온라인 뉴스 기반 주가예측)

  • Seong, Nohyoon;Nam, Kihwan
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.1-19
    • /
    • 2018
  • Since stock movements forecasting is an important issue both academically and practically, studies related to stock price prediction have been actively conducted. The stock price forecasting research is classified into structured data and unstructured data, and it is divided into technical analysis, fundamental analysis and media effect analysis in detail. In the big data era, research on stock price prediction combining big data is actively underway. Based on a large number of data, stock prediction research mainly focuses on machine learning techniques. Especially, research methods that combine the effects of media are attracting attention recently, among which researches that analyze online news and utilize online news to forecast stock prices are becoming main. Previous studies predicting stock prices through online news are mostly sentiment analysis of news, making different corpus for each company, and making a dictionary that predicts stock prices by recording responses according to the past stock price. Therefore, existing studies have examined the impact of online news on individual companies. For example, stock movements of Samsung Electronics are predicted with only online news of Samsung Electronics. In addition, a method of considering influences among highly relevant companies has also been studied recently. For example, stock movements of Samsung Electronics are predicted with news of Samsung Electronics and a highly related company like LG Electronics.These previous studies examine the effects of news of industrial sector with homogeneity on the individual company. In the previous studies, homogeneous industries are classified according to the Global Industrial Classification Standard. In other words, the existing studies were analyzed under the assumption that industries divided into Global Industrial Classification Standard have homogeneity. However, existing studies have limitations in that they do not take into account influential companies with high relevance or reflect the existence of heterogeneity within the same Global Industrial Classification Standard sectors. As a result of our examining the various sectors, it can be seen that there are sectors that show the industrial sectors are not a homogeneous group. To overcome these limitations of existing studies that do not reflect heterogeneity, our study suggests a methodology that reflects the heterogeneous effects of the industrial sector that affect the stock price by applying k-means clustering. Multiple Kernel Learning is mainly used to integrate data with various characteristics. Multiple Kernel Learning has several kernels, each of which receives and predicts different data. To incorporate effects of target firm and its relevant firms simultaneously, we used Multiple Kernel Learning. Each kernel was assigned to predict stock prices with variables of financial news of the industrial group divided by the target firm, K-means cluster analysis. In order to prove that the suggested methodology is appropriate, experiments were conducted through three years of online news and stock prices. The results of this study are as follows. (1) We confirmed that the information of the industrial sectors related to target company also contains meaningful information to predict stock movements of target company and confirmed that machine learning algorithm has better predictive power when considering the news of the relevant companies and target company's news together. (2) It is important to predict stock movements with varying number of clusters according to the level of homogeneity in the industrial sector. In other words, when stock prices are homogeneous in industrial sectors, it is important to use relational effect at the level of industry group without analyzing clusters or to use it in small number of clusters. When the stock price is heterogeneous in industry group, it is important to cluster them into groups. This study has a contribution that we testified firms classified as Global Industrial Classification Standard have heterogeneity and suggested it is necessary to define the relevance through machine learning and statistical analysis methodology rather than simply defining it in the Global Industrial Classification Standard. It has also contribution that we proved the efficiency of the prediction model reflecting heterogeneity.

Development of Gated Myocardial SPECT Analysis Software and Evaluation of Left Ventricular Contraction Function (게이트 심근 SPECT 분석 소프트웨어의 개발과 좌심실 수축 기능 평가)

  • Lee, Byeong-Il;Lee, Dong-Soo;Lee, Jae-Sung;Chung, June-Key;Lee, Myung-Chul;Choi, Heung-Kook
    • The Korean Journal of Nuclear Medicine
    • /
    • v.37 no.2
    • /
    • pp.73-82
    • /
    • 2003
  • Objectives: A new software (Cardiac SPECT Analyzer: CSA) was developed for quantification of volumes and election fraction on gated myocardial SPECT. Volumes and ejection fraction by CSA were validated by comparing with those quantified by Quantitative Gated SPECT (QGS) software. Materials and Methods: Gated myocardial SPECT was peformed in 40 patients with ejection fraction from 15% to 85%. In 26 patients, gated myocardial SPECT was acquired again with the patients in situ. A cylinder model was used to eliminate noise semi-automatically and profile data was extracted using Gaussian fitting after smoothing. The boundary points of endo- and epicardium were found using an iterative learning algorithm. Enddiastolic (EDV) and endsystolic volumes (ESV) and election fraction (EF) were calculated. These values were compared with those calculated by QGS and the same gated SPECT data was repeatedly quantified by CSA and variation of the values on sequential measurements of the same patients on the repeated acquisition. Results: From the 40 patient data, EF, EDV and ESV by CSA were correlated with those by QGS with the correlation coefficients of 0.97, 0.92, 0.96. Two standard deviation (SD) of EF on Bland Altman plot was 10.1%. Repeated measurements of EF, EDV, and ESV by CSA were correlated with each other with the coefficients of 0.96, 0.99, and 0.99 for EF, EDV and ESV respectively. On repeated acquisition, reproducibility was also excellent with correlation coefficients of 0.89, 0.97, 0.98, and coefficient of variation of 8.2%, 5.4mL, 8.5mL and 2SD of 10.6%, 21.2mL, and 16.4mL on Bland Altman plot for EF, EDV and ESV. Conclusion: We developed the software of CSA for quantification of volumes and ejection fraction on gated myocardial SPECT. Volumes and ejection fraction quantified using this software was found valid for its correctness and precision.

Simultaneous estimation of fatty acids contents from soybean seeds using fourier transform infrared spectroscopy and gas chromatography by multivariate analysis (적외선 분광스펙트럼 및 기체크로마토그라피 분석 데이터의 다변량 통계분석을 이용한 대두 종자 지방산 함량예측)

  • Ahn, Myung Suk;Ji, Eun Yee;Song, Seung Yeob;Ahn, Joon Woo;Jeong, Won Joong;Min, Sung Ran;Kim, Suk Weon
    • Journal of Plant Biotechnology
    • /
    • v.42 no.1
    • /
    • pp.60-70
    • /
    • 2015
  • The aim of this study was to investigate whether fourier transform infrared (FT-IR) spectroscopy can be applied to simultaneous determination of fatty acids contents in different soybean cultivars. Total 153 lines of soybean (Glycine max Merrill) were examined by FT-IR spectroscopy. Quantification of fatty acids from the soybean lines was confirmed by quantitative gas chromatography (GC) analysis. The quantitative spectral variation among different soybean lines was observed in the amide bond region ($1,700{\sim}1,500cm^{-1}$), phosphodiester groups ($1,500{\sim}1,300cm^{-1}$) and sugar region ($1,200{\sim}1,000cm^{-1}$) of FT-IR spectra. The quantitative prediction modeling of 5 individual fatty acids contents (palmitic acid, stearic acid, oleic acid, linoleic acid, linolenic acid) from soybean lines were established using partial least square regression algorithm from FT-IR spectra. In cross validation, there were high correlations ($R^2{\geq}0.97$) between predicted content of 5 individual fatty acids by PLS regression modeling from FT-IR spectra and measured content by GC. In external validation, palmitic acid ($R^2=0.8002$), oleic acid ($R^2=0.8909$) and linoleic acid ($R^2=0.815$) were predicted with good accuracy, while prediction for stearic acid ($R^2=0.4598$), linolenic acid ($R^2=0.6868$) had relatively lower accuracy. These results clearly show that FT-IR spectra combined with multivariate analysis can be used to accurately predict fatty acids contents in soybean lines. Therefore, we suggest that the PLS prediction system for fatty acid contents using FT-IR analysis could be applied as a rapid and high throughput screening tool for the breeding for modified Fatty acid composition in soybean and contribute to accelerating the conventional breeding.

Evaluation of Endothelium-dependent Myocardial Perfusion Reserve in Healthy Smokers; Cold Pressor Test using $H_2^{15}O\;PET$ (흡연자에서 관상동맥 내피세포 의존성 심근 혈류 예비능: $H_2^{15}O\;PET$ 찬물자극 검사에 의한 평가)

  • Hwang, Kyung-Hoon;Lee, Dong-Soo;Lee, Byeong-Il;Lee, Jae-Sung;Lee, Ho-Young;Chung, June-Key;Lee, Myung-Chul
    • The Korean Journal of Nuclear Medicine
    • /
    • v.38 no.1
    • /
    • pp.21-29
    • /
    • 2004
  • Purpose: Much evidence suggests long-term cigarette smoking alters coronary vascular endothelial response. On this study, we applied nonnegative matrix factorization (NMF), an unsupervised learning algorithm, to CO-less $H_2^{15}O-PET$ to investigate coronary endothelial dysfunction caused by smoking noninvasively. Materials and methods: This study enrolled eighteen young male volunteers consisting of 9 smokers $(23.8{\pm}1.1\;yr;\;6.5{\pm}2.5$ pack-years) and 9 nonsmokers $(23.8{\pm}2.9 yr)$. They do not have any cardiovascular risk factor or disease history. Myocardial $H_2^{15}O-PET$ was performed at rest, during cold ($5^{\circ}C$) pressor stimulation and during adenosine infusion. Left ventricular blood pool and myocardium were segmented on dynamic PET data by NMF method. Myocardial blood flow (MBF) was calculated from input and tissue functions by a single compartmental model with correction of partial volume and spillover effects. Results: There were no significant difference in resting MBF between the two groups (Smokers: 1.43 0.41 ml/g/min and non-smokers: $1.37{\pm}0.41$ ml/g/min p=NS). during cold pressor stimulation, MBF in smokers was significantly lower than 4hat in non-smokers ($1.25{\pm}0.34$ ml/g/min vs $1.59{\pm}0.29$ ml/gmin; p=0.019). The difference in the ratio of cold pressor MBF to resting MBF between the two groups was also significant (p=0.024; $90{\pm}24%$ in smokers and $122{\pm}28%$ in non-smokers.). During adenosine infusion, however, hyperemic MBF did not differ significantly between smokers and non-smokers ($5.81{\pm}1.99$ ml/g/min vs $5.11{\pm}1.31$ ml/g/min ; p=NS). Conclusion: in smokers, MBF during cold pressor stimulation was significantly lower compared wi4h nonsmokers, reflecting smoking-Induced endothelial dysfunction. However, there was no significant difference in MBF during adenosine-induced hyperemia between the two groups.

The Evaluation of Usefulness of Wide Beam Reconstruction Method on Segmental Perfusion and Regional Wall Motion in Myocardial Perfusion SPECT (심근관류 SPECT의 분절별 관류 및 국소벽 운동에서 Wide Beam Reconstruction기법의 유용성 평가)

  • Seong, Yong-Joon;Kim, Tae-Yeob;Moon, Il-Sang;Cho, Seong-Wook;Woo, Jae-Ryong
    • The Korean Journal of Nuclear Medicine Technology
    • /
    • v.15 no.1
    • /
    • pp.51-57
    • /
    • 2011
  • Purpose: The aim of this study is to identify clinical usefulness of Wide Beam Reconstruction (WBR) which is called Xpress.cardiac$^{TM}$ to confirm the agreement between segmental perfusion and regional wall motion in myocardium compared to conventional OSEM method. Materials and Methods: Subjects were separated two groups. First group was composed of 20 normal control group. Second group was composed of 10 patients (abnormal group) who had coronary artery disease. Subjects underwent myocardial perfusion SPECT ($^{201}Tl$ rest and $^{99m}Tc$-MIBI stress). Image acquisition and reconstruction were that rest stage was each step per 30, 15 seconds and stress stage was each step per 25, 13 seconds, OSEM and WBR methods were applied. Segmental perfusion and regional wall motion were applied 20-segment model of QPS, QGS algorithm in AutoQuant. Status of perfusion was composed of 5 point scoring system (0=normal, 1=mild, 2=moderate, 3=severe hypokinesia, 4=dyskinesia). Status of regional wall motion was also composed of 5 point scoring (0=normal, 1=mild, 2=moderate, 3=severe hypokinesia, 4=dyskinesia). We evaluated the agreement between conventional OSEM and WBR through automatic quantification value. Results: The agreement of rest segmental perfusion between conventional OSEM and WBR in normal patients was 99% (396/400, k=0.662, p<0.0001) and one of rest regional wall motion was 83.8% (335/400, k=0.283), the agreement of stress segmental perfusion was 95.8%(383/400, k=0.656), one of stress regional wall motion was 87.3% (349/400, k=0.390). The match rate of rest segmental perfusion in abnormal patients was 83% (166/200, k=0.605, p<0.0001) and one of rest regional wall motion was 55.5% (111/200, k=0.385), the agreement of stress segmental perfusion was 79.5% (159/200, k=0.682), one of stress regional wall motion was 63.5% (127/200, k=0.486). Conclusion: Compared to conventional OSEM, WBR method had a good agreement of segmental perfusion in myocardium in normal and abnormal groups. However regional wall motion showed meaningful low agreement. Although WBR offers high resolution and contrast ratio, it is not useful method for gated myocardial perfusion SPECT.

  • PDF

Availability Assessment of Single Frequency Multi-GNSS Real Time Positioning with the RTCM-State Space Representation Parameters (RTCM-SSR 보정요소 기반 1주파 Multi-GNSS 실시간 측위의 효용성 평가)

  • Lee, Yong-Chang;Oh, Seong-Jong
    • Journal of Cadastre & Land InformatiX
    • /
    • v.50 no.1
    • /
    • pp.107-123
    • /
    • 2020
  • With stabilization of the recent multi-GNSS infrastructure, and as multi-GNSS has been proven to be effective in improving the accuracy of the positioning performance in various industrial sectors. In this study, in view that SF(Single frequency) GNSS receivers are widely used due to the low costs, evaluate effectiveness of SF Real Time Point Positioning(SF-RT-PP) based on four multi-GNSS surveying methods with RTCM-SSR correction streams in static and kinematic modes, and also derive response challenges. Results of applying SSR correction streams, CNES presented good results compared to other SSR streams in 2D coordinate. Looking at the results of the SF-RT-PP surveying using SF signals from multi-GNSS, were able to identify the common cause of large deviations in the altitude components, as well as confirm the importance of signal bias correction according to combinations of different types of satellite signals and ionospheric delay compensation algorithm using undifferenced and uncombined observations. In addition, confirmed that the improvement of the infrastructure of Multi-GNSS allows SF-RT-SPP surveying with only one of the four GNSS satellites. In particular, in the case of code-based SF-RT-SPP measurements using SF signals from GPS satellites only, the difference in the application effect between broadcast ephemeris and SSR correction for satellite orbits/clocks was small, but in the case of ionospheric delay compensation, the use of SBAS correction information provided more than twice the accuracy compared to result of the Klobuchar model. With GPS and GLONASS, both the BDS and GALILEO constellations will be fully deployed in the end of 2020, and the greater benefits from the multi-GNSS integration can be expected. Specially, If RT-ionospheric correction services reflecting regional characteristics and SSR correction information reflecting atmospheric characteristics are carried out in real-time, expected that the utilization of SF-RT-PPP survey technology by multi-GNSS and various demands will be created in various industrial sectors.

Comparative study of volumetric change in water-stored and dry-stored complete denture base (공기중과 수중에서 보관한 총의치 의치상의 체적변화에 대한 비교연구)

  • Kim, Jinseon;Lee, Younghoo;Hong, Seoung-Jin;Paek, Janghyun;Noh, Kwantae;Pae, Ahran;Kim, Hyeong-Seob;Kwon, Kung-Rock
    • The Journal of Korean Academy of Prosthodontics
    • /
    • v.59 no.1
    • /
    • pp.18-26
    • /
    • 2021
  • Purpose: Generally, patients are noticed to store denture in water when removed from the mouth. However, few studies have reported the advantage of volumetric change in underwater storage over dry storage. To be a reference in defining the proper denture storage method, this study aims to evaluate the volumetric change and dimensional deformation in case of underwater and dry storage. Materials and methods: Definitive casts were scanned by a model scanner, and denture bases were designed with computer-aided design (CAD) software. Twelve denture bases (upper 6, lower 6) were printed with 3D printer. Printed denture bases were invested and flasked with heat-curing method. 6 upper and 6 lower dentures were divided into group A and B, and each group contains 3 upper and 3 lower dentures. Group A was stored dry at room temperature, group B was stored underwater. Group B was scanned at every 24 hours for 28 days and scanned data was saved as stereolithography (SLA) file. These SLA files were analyzed to measure the difference in volumetric change of a month and Kruskal-Wallis test were used for statistical analysis. Best-fit algorithm was used to overlap and 3-dimensional color-coded map was used to observe the changing pattern of impression surface. Results: No significant difference was found in volumetric changes regardless of the storage methods. In dry-stored denture base, significant changes were found in the palate of upper jaw and posterior lingual border of lower jaw in direction away from the underlying tissue, maxillary tuberosity of upper jaw and retromolar pad area of lower jaw in direction towards the underlying tissue. Conclusion: Storing the denture underwater shows less volumetric change of impression surface than storing in the dry air.

A Study on Training Dataset Configuration for Deep Learning Based Image Matching of Multi-sensor VHR Satellite Images (다중센서 고해상도 위성영상의 딥러닝 기반 영상매칭을 위한 학습자료 구성에 관한 연구)

  • Kang, Wonbin;Jung, Minyoung;Kim, Yongil
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_1
    • /
    • pp.1505-1514
    • /
    • 2022
  • Image matching is a crucial preprocessing step for effective utilization of multi-temporal and multi-sensor very high resolution (VHR) satellite images. Deep learning (DL) method which is attracting widespread interest has proven to be an efficient approach to measure the similarity between image pairs in quick and accurate manner by extracting complex and detailed features from satellite images. However, Image matching of VHR satellite images remains challenging due to limitations of DL models in which the results are depending on the quantity and quality of training dataset, as well as the difficulty of creating training dataset with VHR satellite images. Therefore, this study examines the feasibility of DL-based method in matching pair extraction which is the most time-consuming process during image registration. This paper also aims to analyze factors that affect the accuracy based on the configuration of training dataset, when developing training dataset from existing multi-sensor VHR image database with bias for DL-based image matching. For this purpose, the generated training dataset were composed of correct matching pairs and incorrect matching pairs by assigning true and false labels to image pairs extracted using a grid-based Scale Invariant Feature Transform (SIFT) algorithm for a total of 12 multi-temporal and multi-sensor VHR images. The Siamese convolutional neural network (SCNN), proposed for matching pair extraction on constructed training dataset, proceeds with model learning and measures similarities by passing two images in parallel to the two identical convolutional neural network structures. The results from this study confirm that data acquired from VHR satellite image database can be used as DL training dataset and indicate the potential to improve efficiency of the matching process by appropriate configuration of multi-sensor images. DL-based image matching techniques using multi-sensor VHR satellite images are expected to replace existing manual-based feature extraction methods based on its stable performance, thus further develop into an integrated DL-based image registration framework.

Rainfall image DB construction for rainfall intensity estimation from CCTV videos: focusing on experimental data in a climatic environment chamber (CCTV 영상 기반 강우강도 산정을 위한 실환경 실험 자료 중심 적정 강우 이미지 DB 구축 방법론 개발)

  • Byun, Jongyun;Jun, Changhyun;Kim, Hyeon-Joon;Lee, Jae Joon;Park, Hunil;Lee, Jinwook
    • Journal of Korea Water Resources Association
    • /
    • v.56 no.6
    • /
    • pp.403-417
    • /
    • 2023
  • In this research, a methodology was developed for constructing an appropriate rainfall image database for estimating rainfall intensity based on CCTV video. The database was constructed in the Large-Scale Climate Environment Chamber of the Korea Conformity Laboratories, which can control variables with high irregularity and variability in real environments. 1,728 scenarios were designed under five different experimental conditions. 36 scenarios and a total of 97,200 frames were selected. Rain streaks were extracted using the k-nearest neighbor algorithm by calculating the difference between each image and the background. To prevent overfitting, data with pixel values greater than set threshold, compared to the average pixel value for each image, were selected. The area with maximum pixel variability was determined by shifting with every 10 pixels and set as a representative area (180×180) for the original image. After re-transforming to 120×120 size as an input data for convolutional neural networks model, image augmentation was progressed under unified shooting conditions. 92% of the data showed within the 10% absolute range of PBIAS. It is clear that the final results in this study have the potential to enhance the accuracy and efficacy of existing real-world CCTV systems with transfer learning.

Analysis of Landslide Occurrence Characteristics Based on the Root Cohesion of Vegetation and Flow Direction of Surface Runoff: A Case Study of Landslides in Jecheon-si, Chungcheongbuk-do, South Korea (식생의 뿌리 점착력과 지표유출의 흐름 조건을 고려한 산사태의 발생 특성 분석: 충청북도 제천지역의 사례를 중심으로)

  • Jae-Uk Lee;Yong-Chan Cho;Sukwoo Kim;Minseok Kim;Hyun-Joo Oh
    • Journal of Korean Society of Forest Science
    • /
    • v.112 no.4
    • /
    • pp.426-441
    • /
    • 2023
  • This study investigated the predictive accuracy of a model of landslide displacement in Jecheon-si, where a great number of landslides were triggered by heavy rain on both natural (non-clear-cut) and clear-cut slopes during August 2020. This was accomplished by applying three flow direction methods (single flow direction, SFD; multiple flow direction, MFD; infinite flow direction, IFD) and the degree of root cohesion to an infinite slope stability equation. The application assumed that the soil saturation and any changes in root cohesion occurred following the timber harvest (clear-cutting). In the study area, 830 landslide locations were identified via landslide inventory mapping from satellite images and 25 cm resolution aerial photographs. The results of the landslide modeling comparison showed the accuracy of the models that considered changes in the root cohesion following clear-cutting to be improved by 1.3% to 2.6% when compared with those not considered in the area under the receiver operating characteristics (AUROC) analysis. Furthermore, the accuracy of the models that used the MFD algorithm improved by up to 1.3% when compared with the models that used the other algorithms in the AUROC analysis. These results suggest that the discriminatory application of the root cohesion, which considers changes in the vegetation condition, and the selection of the flow direction method may influence the accuracy of landslide predictive modeling. In the future, the results of this study should be verified by examining the root cohesion and its dynamic changes according to the tree species using the field hydrological monitoring technique.