• Title/Summary/Keyword: datasets

Search Result 2,046, Processing Time 0.029 seconds

Bootstrap Evaluation of Stem Density and Biomass Expansion Factors in Pinus rigida Stands in Korea (부트스트랩 시뮬레이션을 이용한 리기다소나무림의 줄기밀도와 바이오매스 확장계수 평가)

  • Seo, Yeon Ok;Lee, Young Jin;Pyo, Jung Kee;Kim, Rae Hyun;Son, Yeong Son;Lee, Kyeong Hak
    • Journal of Korean Society of Forest Science
    • /
    • v.100 no.4
    • /
    • pp.535-539
    • /
    • 2011
  • This study was conducted to examine the bootstrap evaluation of the stem density and biomass expansion factor for Pinus rigida plantations in Korea. The stem density ($g/cm^3$) in less than 20 tree years were 0.460 while more than 21 tree years were 0.456 respectively. Biomass expansion factor of less than 20 years and more than 21 years were 2.013, 1.171, respectively. The results of 100 and 500 bootstrap iterations, stem density ($g/cm^3$) in less than 20 years were 0.456~0.462 while more than 21 years were 0.457~0.456 respectively. Biomass expansion factor of less than 20 years and more than 21 years were 1.990~2.039, 1.173~1.170, respectively. The mean differences between observed biomass factor and average parameter estimates showed within 5 percent differences. The split datasets of younger stands and old stands were compared to the results of bootstrap simulations. The stem density in less than 20 years of mean difference were 0.441~1.049% while more than 21years were 0.123~0.206% respectively. Biomass expansion factor in less than 20 years and more than 21 years were -1.102~1.340%, -0.024~0.215% respectively. Younger stand had relatively higher errors compared to the old stand. The results of stem density and biomass expansion factor using the bootstrap simulation method indicated approximately 1.1% and 1.4%, respectively.

The Prediction of Cryptocurrency on Using Text Mining and Deep Learning Techniques : Comparison of Korean and USA Market (텍스트 마이닝과 딥러닝을 활용한 암호화폐 가격 예측 : 한국과 미국시장 비교)

  • Won, Jonggwan;Hong, Taeho
    • Knowledge Management Research
    • /
    • v.22 no.2
    • /
    • pp.1-17
    • /
    • 2021
  • In this study, we predicted the bitcoin prices of Bithum and Coinbase, a leading exchange in Korea and USA, using ARIMA and Recurrent Neural Networks(RNNs). And we used news articles from each country to suggest a separated RNN model. The suggested model identifies the datasets based on the changing trend of prices in the training data, and then applies time series prediction technique(RNNs) to create multiple models. Then we used daily news data to create a term-based dictionary for each trend change point. We explored trend change points in the test data using the daily news keyword data of testset and term-based dictionary, and apply a matching model to produce prediction results. With this approach we obtained higher accuracy than the model which predicted price by applying just time series prediction technique. This study presents that the limitations of the time series prediction techniques could be overcome by exploring trend change points using news data and various time series prediction techniques with text mining techniques could be applied to improve the performance of the model in the further research.

Face Identification Using a Near-Infrared Camera in a Nonrestrictive In-Vehicle Environment (적외선 카메라를 이용한 비제약적 환경에서의 얼굴 인증)

  • Ki, Min Song;Choi, Yeong Woo
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.3
    • /
    • pp.99-108
    • /
    • 2021
  • There are unrestricted conditions on the driver's face inside the vehicle, such as changes in lighting, partial occlusion and various changes in the driver's condition. In this paper, we propose a face identification system in an unrestricted vehicle environment. The proposed method uses a near-infrared (NIR) camera to minimize the changes in facial images that occur according to the illumination changes inside and outside the vehicle. In order to process a face exposed to extreme light, the normal face image is changed to a simulated overexposed image using mean and variance for training. Thus, facial classifiers are simultaneously generated under both normal and extreme illumination conditions. Our method identifies a face by detecting facial landmarks and aggregating the confidence score of each landmark for the final decision. In particular, the performance improvement is the highest in the class where the driver wears glasses or sunglasses, owing to the robustness to partial occlusions by recognizing each landmark. We can recognize the driver by using the scores of remaining visible landmarks. We also propose a novel robust rejection and a new evaluation method, which considers the relations between registered and unregistered drivers. The experimental results on our dataset, PolyU and ORL datasets demonstrate the effectiveness of the proposed method.

Verification of Cardiac Electrophysiological Features as a Predictive Indicator of Drug-Induced Torsades de pointes (약물의 염전성 부정맥 유발 예측 지표로서 심장의 전기생리학적 특징 값들의 검증)

  • Yoo, Yedam;Jeong, Da Un;Marcellinus, Aroli;Lim, Ki Moo
    • Journal of Biomedical Engineering Research
    • /
    • v.43 no.1
    • /
    • pp.19-26
    • /
    • 2022
  • The Comprehensive in vitro Proarrhythmic Assay(CiPA) project was launched for solving the hERG assay problem of being classified as high-risk groups even though they are low-risk drugs due to their high sensitivity. CiPA presented a protocol to predict drug toxicity using physiological data calculated based on the in-silico model. in this study, features calculated through the in-silico model are analyzed for correlation of changing action potential in the near future, and features are verified through predictive performance according to drug datasets. Using the O'Hara Rudy model modified by Dutta et al., Pearson correlation analysis was performed between 13 features(dVm/dtmax, APpeak, APresting, APD90, APD50, APDtri, Capeak, Caresting, CaD90, CaD50, CaDtri, qNet, qInward) calculated at 100 pacing, and between dVm/dtmax_repol calculated at 1,000 pacing, and linear regression analysis was performed on each of the 12 training drugs, 16 verification drugs, and 28 drugs. Indicators showing high coefficient of determination(R2) in the training drug dataset were qNet 0.93, AP resting 0.83, APDtri 0.78, Ca resting 0.76, dVm/dtmax 0.63, and APD90 0.61. The indicators showing high determinants in the validated drug dataset were APDtri 0.94, APD90 0.92, APD50 0.85, CaD50 0.84, qNet 0.76, and CaD90 0.64. Indicators with high coefficients of determination for all 28 drugs are qNet 0.78, APD90 0.74, and qInward 0.59. The indicators vary in predictive performance depending on the drug dataset, and qNet showed the same high performance of 0.7 or more on the training drug dataset, the verified drug dataset, and the entire drug dataset.

Comparative analysis of Machine-Learning Based Models for Metal Surface Defect Detection (머신러닝 기반 금속외관 결함 검출 비교 분석)

  • Lee, Se-Hun;Kang, Seong-Hwan;Shin, Yo-Seob;Choi, Oh-Kyu;Kim, Sijong;Kang, Jae-Mo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.6
    • /
    • pp.834-841
    • /
    • 2022
  • Recently, applying artificial intelligence technologies in various fields of production has drawn an upsurge of research interest due to the increase for smart factory and artificial intelligence technologies. A great deal of effort is being made to introduce artificial intelligence algorithms into the defect detection task. Particularly, detection of defects on the surface of metal has a higher level of research interest compared to other materials (wood, plastics, fibers, etc.). In this paper, we compare and analyze the speed and performance of defect classification by combining machine learning techniques (Support Vector Machine, Softmax Regression, Decision Tree) with dimensionality reduction algorithms (Principal Component Analysis, AutoEncoders) and two convolutional neural networks (proposed method, ResNet). To validate and compare the performance and speed of the algorithms, we have adopted two datasets ((i) public dataset, (ii) actual dataset), and on the basis of the results, the most efficient algorithm is determined.

Style-Based Transformer for Time Series Forecasting (시계열 예측을 위한 스타일 기반 트랜스포머)

  • Kim, Dong-Keon;Kim, Kwangsu
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.12
    • /
    • pp.579-586
    • /
    • 2021
  • Time series forecasting refers to predicting future time information based on past time information. Accurately predicting future information is crucial because it is used for establishing strategies or making policy decisions in various fields. Recently, a transformer model has been mainly studied for a time series prediction model. However, the existing transformer model has a limitation in that it has an auto-regressive structure in which the output result is input again when the prediction sequence is output. This limitation causes a problem in that accuracy is lowered when predicting a distant time point. This paper proposes a sequential decoding model focusing on the style transformation technique to handle these problems and make more precise time series forecasting. The proposed model has a structure in which the contents of past data are extracted from the transformer-encoder and reflected in the style-based decoder to generate the predictive sequence. Unlike the decoder structure of the conventional auto-regressive transformer, this structure has the advantage of being able to more accurately predict information from a distant view because the prediction sequence is output all at once. As a result of conducting a prediction experiment with various time series datasets with different data characteristics, it was shown that the model presented in this paper has better prediction accuracy than other existing time series prediction models.

CoAID+ : COVID-19 News Cascade Dataset for Social Context Based Fake News Detection (CoAID+ : 소셜 컨텍스트 기반 가짜뉴스 탐지를 위한 COVID-19 뉴스 파급 데이터)

  • Han, Soeun;Kang, Yoonsuk;Ko, Yunyong;Ahn, Jeewon;Kim, Yushim;Oh, Seongsoo;Park, Heejin;Kim, Sang-Wook
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.11 no.4
    • /
    • pp.149-156
    • /
    • 2022
  • In the current COVID-19 pandemic, fake news and misinformation related to COVID-19 have been causing serious confusion in our society. To accurately detect such fake news, social context-based methods have been widely studied in the literature. They detect fake news based on the social context that indicates how a news article is propagated over social media (e.g., Twitter). Most existing COVID-19 related datasets gathered for fake news detection, however, contain only the news content information, but not its social context information. In this case, the social context-based detection methods cannot be applied, which could be a big obstacle in the fake news detection research. To address this issue, in this work, we collect from Twitter the social context information based on CoAID, which is a COVID-19 news content dataset built for fake news detection, thereby building CoAID+ that includes both the news content information and its social context information. The CoAID+ dataset can be utilized in a variety of methods for social context-based fake news detection, thus would help revitalize the fake news detection research area. Finally, through a comprehensive analysis of the CoAID+ dataset in various perspectives, we present some interesting features capable of differentiating real and fake news.

Measuring the Goodness of Fit of Link Reduction Algorithms for Mapping Intellectual Structures in Bibliometric Analysis (계량서지적 분석에서 지적구조 매핑을 위한 링크 삭감 알고리즘의 적합도 측정)

  • Lee, Jae Yun
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.2
    • /
    • pp.233-254
    • /
    • 2022
  • Link reduction algorithms such as pathfinder network are the widely used methods to overcome problems with the visualization of weighted networks for knowledge domain analysis. This study proposed NetRSQ, an indicator to measure the goodness of fit of a link reduction algorithm for the network visualization. NetRSQ is developed to calculate the fitness of a network based on the rank correlation between the path length and the degree of association between entities. The validity of NetRSQ was investigated with data from previous research which qualitatively evaluated several network generation algorithms. As the primary test result, the higher degree of NetRSQ appeared in the network with better intellectual structures in the quality evaluation of networks built by various methods. The performance of 4 link reduction algorithms was tested in 40 datasets from various domains and compared with NetRSQ. The test shows that there is no specific link reduction algorithm that performs better over others in all cases. Therefore, the NetRSQ can be a useful tool as a basis of reliability to select the most fitting algorithm for the network visualization of intellectual structures.

Development of the Automatic Method for Detecting the National River Networks Using the Sentinel-2 Satellite Imagery -A Case Study for Han River, Seoul- (Sentinel-2 위성영상을 활용하여 국가하천망 제작을 위한 자동화 기술 개발 -서울시 한강을 사례로-)

  • KIM, Seon-Woo;KWON, Yong-Ha;CHUNG, Youn-In;CHOUNG, Yun-Jae
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.25 no.2
    • /
    • pp.88-99
    • /
    • 2022
  • The river network is one of the essential topographical characteristics in river management. The river network which as previously constructed by the ground surveying method has recently begun to be efficiently constructed using the remote sensing datasets. Since it is difficult to remove these obstacles such as bridges in the urban rivers, it is rare to construct the urban river networks with the various obstacles. In this study, the Sentinel-2 satellite imagery was used to develop the automatic method for detecting the urban river networks without the obstacles and with the preserved boundaries as follows. First, the normalized difference water index image was generated using the multispectral bands of the given Sentinel-2 satellite imagery, and the binary image that could classify the water body and other regions was generated. Next, the morphological operations were employed for detecting the complete river networks with the obstacles removed and the boundaries preserved. As a result of applying the proposed methodology to Han River in Seoul, the complete river networks with the obstacles removed and the boundaries preserved were well constructed.

Change Detection Using Deep Learning Based Semantic Segmentation for Nuclear Activity Detection and Monitoring (핵 활동 탐지 및 감시를 위한 딥러닝 기반 의미론적 분할을 활용한 변화 탐지)

  • Song, Ahram;Lee, Changhui;Lee, Jinmin;Han, Youkyung
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_1
    • /
    • pp.991-1005
    • /
    • 2022
  • Satellite imaging is an effective supplementary data source for detecting and verifying nuclear activity. It is also highly beneficial in regions with limited access and information, such as nuclear installations. Time series analysis, in particular, can identify the process of preparing for the conduction of a nuclear experiment, such as relocating equipment or changing facilities. Differences in the semantic segmentation findings of time series photos were employed in this work to detect changes in meaningful items connected to nuclear activity. Building, road, and small object datasets made of KOMPSAT 3/3A photos given by AIHub were used to train deep learning models such as U-Net, PSPNet, and Attention U-Net. To pick relevant models for targets, many model parameters were adjusted. The final change detection was carried out by including object information into the first change detection, which was obtained as the difference in semantic segmentation findings. The experiment findings demonstrated that the suggested approach could effectively identify altered pixels. Although the suggested approach is dependent on the accuracy of semantic segmentation findings, it is envisaged that as the dataset for the region of interest grows in the future, so will the relevant scope of the proposed method.