• Title/Summary/Keyword: resampling

Search Result 249, Processing Time 0.021 seconds

Sentiment Analysis of Product Reviews to Identify Deceptive Rating Information in Social Media: A SentiDeceptive Approach

  • Marwat, M. Irfan;Khan, Javed Ali;Alshehri, Dr. Mohammad Dahman;Ali, Muhammad Asghar;Hizbullah;Ali, Haider;Assam, Muhammad
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.3
    • /
    • pp.830-860
    • /
    • 2022
  • [Introduction] Nowadays, many companies are shifting their businesses online due to the growing trend among customers to buy and shop online, as people prefer online purchasing products. [Problem] Users share a vast amount of information about products, making it difficult and challenging for the end-users to make certain decisions. [Motivation] Therefore, we need a mechanism to automatically analyze end-user opinions, thoughts, or feelings in the social media platform about the products that might be useful for the customers to make or change their decisions about buying or purchasing specific products. [Proposed Solution] For this purpose, we proposed an automated SentiDecpective approach, which classifies end-user reviews into negative, positive, and neutral sentiments and identifies deceptive crowd-users rating information in the social media platform to help the user in decision-making. [Methodology] For this purpose, we first collected 11781 end-users comments from the Amazon store and Flipkart web application covering distant products, such as watches, mobile, shoes, clothes, and perfumes. Next, we develop a coding guideline used as a base for the comments annotation process. We then applied the content analysis approach and existing VADER library to annotate the end-user comments in the data set with the identified codes, which results in a labelled data set used as an input to the machine learning classifiers. Finally, we applied the sentiment analysis approach to identify the end-users opinions and overcome the deceptive rating information in the social media platforms by first preprocessing the input data to remove the irrelevant (stop words, special characters, etc.) data from the dataset, employing two standard resampling approaches to balance the data set, i-e, oversampling, and under-sampling, extract different features (TF-IDF and BOW) from the textual data in the data set and then train & test the machine learning algorithms by applying a standard cross-validation approach (KFold and Shuffle Split). [Results/Outcomes] Furthermore, to support our research study, we developed an automated tool that automatically analyzes each customer feedback and displays the collective sentiments of customers about a specific product with the help of a graph, which helps customers to make certain decisions. In a nutshell, our proposed sentiments approach produces good results when identifying the customer sentiments from the online user feedbacks, i-e, obtained an average 94.01% precision, 93.69% recall, and 93.81% F-measure value for classifying positive sentiments.

Automated Image Matching for Satellite Images with Different GSDs through Improved Feature Matching and Robust Estimation (특징점 매칭 개선 및 강인추정을 통한 이종해상도 위성영상 자동영상정합)

  • Ban, Seunghwan;Kim, Taejung
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_1
    • /
    • pp.1257-1271
    • /
    • 2022
  • Recently, many Earth observation optical satellites have been developed, as their demands were increasing. Therefore, a rapid preprocessing of satellites became one of the most important problem for an active utilization of satellite images. Satellite image matching is a technique in which two images are transformed and represented in one specific coordinate system. This technique is used for aligning different bands or correcting of relative positions error between two satellite images. In this paper, we propose an automatic image matching method among satellite images with different ground sampling distances (GSDs). Our method is based on improved feature matching and robust estimation of transformation between satellite images. The proposed method consists of five processes: calculation of overlapping area, improved feature detection, feature matching, robust estimation of transformation, and image resampling. For feature detection, we extract overlapping areas and resample them to equalize their GSDs. For feature matching, we used Oriented FAST and rotated BRIEF (ORB) to improve matching performance. We performed image registration experiments with images KOMPSAT-3A and RapidEye. The performance verification of the proposed method was checked in qualitative and quantitative methods. The reprojection errors of image matching were in the range of 1.277 to 1.608 pixels accuracy with respect to the GSD of RapidEye images. Finally, we confirmed the possibility of satellite image matching with heterogeneous GSDs through the proposed method.

Evaluation of NDVI Retrieved from Sentinel-2 and Landsat-8 Satellites Using Drone Imagery Under Rice Disease (드론 영상을 이용한 Sentinel-2, Landsat-8 위성 NDVI 평가: 벼 병해 발생 지역을 대상으로)

  • Ryu, Jae-Hyun;Ahn, Ho-yong;Na, Sang-Il;Lee, Byungmo;Lee, Kyung-do
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_1
    • /
    • pp.1231-1244
    • /
    • 2022
  • The frequency of exposure of field crops to stress situations is increasing due to abnormal weather conditions. In South Korea, large-scale diseases in representative paddy rice cultivation area were happened. There are limits to field investigation on the crop damage due to large-scale. Satellite-based remote sensing techniques are useful for monitoring crops in cities and counties, but the sensitivity of vegetation index measured from satellite under abnormal growth of crop should be evaluated. The goal is to evaluate satellite-based normalized difference vegetation index (NDVI) retrieved from different spatial scales using drone imagery. In this study, Sentinel-2 and Landsat-8 satellites were used and they have spatial resolution of 10 and 30 m. Drone-based NDVI, which was resampled to the scale of satellite data, had correlation of 0.867-0.940 with Sentinel-2 NDVI and of 0.813-0.934 with Landsat-8 NDVI. When the effects of bias were minimized, Sentinel-2 NDVI had a normalized root mean square error of 0.2 to 2.8% less than that of the drone NDVI compared to Landsat-8 NDVI. In addition, Sentinel-2 NDVI had the constant error values regardless of diseases damage. On the other hand, Landsat-8 NDVI had different error values depending on degree of diseases. Considering the large error at the boundary of agricultural field, high spatial resolution data is more effective in monitoring crops.

Analysis for Precipitation Trend and Elasticity of Precipitation-Streamflow According to Climate Changes (기후변화에 따른 강우 경향성 및 유출과의 탄성도 분석)

  • Shon, Tae Seok;Shin, Hyun Suk
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.30 no.5B
    • /
    • pp.497-507
    • /
    • 2010
  • Climate changes affect greatly natural ecosystem, human social and economic system acting on constituting the climate system such as air, ocean, life, glacier and land, etc. and estimating the current impact of climate change would be the most important thing to adapt to the climate changes. This study set the target area to Nakdong river watershed and investigated the impact of climate changes through analyzing precipitation tendency, and to understand the impact of climate changes on hydrological elements, analyzed elasticity of precipitation-streamflow. For the analysis of precipitation trend, collecting the precipitation data of the National Weather Service from major points of Nakdong river watershed, resampling them at the units of year, season and month, used as the data of precipitation trend analysis. To analyze precipitation-streamflow elasticity, collecting area average precipitation and long-term streamflow data provided by WAMIS, annual and seasonal time-series were analyzed. In addition, The results of this study and elasticity, and other abroad study compared with the elasticity analysis and the validity of this study was verified. Results of this study will be able to be utilized for study on a plan to increase of flood control ability of flooding constructs caused by the increase of streamflow around Nakdong river watershed due to climate changes and on a plan of adapting to water environment according to climate changes.

T1 Map-Based Radiomics for Prediction of Left Ventricular Reverse Remodeling in Patients With Nonischemic Dilated Cardiomyopathy

  • Suyon Chang;Kyunghwa Han;Yonghan Kwon;Lina Kim;Seunghyun Hwang;Hwiyoung Kim;Byoung Wook Choi
    • Korean Journal of Radiology
    • /
    • v.24 no.5
    • /
    • pp.395-405
    • /
    • 2023
  • Objective: This study aimed to develop and validate models using radiomics features on a native T1 map from cardiac magnetic resonance (CMR) to predict left ventricular reverse remodeling (LVRR) in patients with nonischemic dilated cardiomyopathy (NIDCM). Materials and Methods: Data from 274 patients with NIDCM who underwent CMR imaging with T1 mapping at Severance Hospital between April 2012 and December 2018 were retrospectively reviewed. Radiomic features were extracted from the native T1 maps. LVRR was determined using echocardiography performed ≥ 180 days after the CMR. The radiomics score was generated using the least absolute shrinkage and selection operator logistic regression models. Clinical, clinical + late gadolinium enhancement (LGE), clinical + radiomics, and clinical + LGE + radiomics models were built using a logistic regression method to predict LVRR. For internal validation of the result, bootstrap validation with 1000 resampling iterations was performed, and the optimism-corrected area under the receiver operating characteristic curve (AUC) with 95% confidence interval (CI) was computed. Model performance was compared using AUC with the DeLong test and bootstrap. Results: Among 274 patients, 123 (44.9%) were classified as LVRR-positive and 151 (55.1%) as LVRR-negative. The optimism-corrected AUC of the radiomics model in internal validation with bootstrapping was 0.753 (95% CI, 0.698-0.813). The clinical + radiomics model revealed a higher optimism-corrected AUC than that of the clinical + LGE model (0.794 vs. 0.716; difference, 0.078 [99% CI, 0.003-0.151]). The clinical + LGE + radiomics model significantly improved the prediction of LVRR compared with the clinical + LGE model (optimism-corrected AUC of 0.811 vs. 0.716; difference, 0.095 [99% CI, 0.022-0.139]). Conclusion: The radiomic characteristics extracted from a non-enhanced T1 map may improve the prediction of LVRR and offer added value over traditional LGE in patients with NIDCM. Additional external validation research is required.

Quality of Radiomics Research on Brain Metastasis: A Roadmap to Promote Clinical Translation

  • Chae Jung Park;Yae Won Park;Sung Soo Ahn;Dain Kim;Eui Hyun Kim;Seok-Gu Kang;Jong Hee Chang;Se Hoon Kim;Seung-Koo Lee
    • Korean Journal of Radiology
    • /
    • v.23 no.1
    • /
    • pp.77-88
    • /
    • 2022
  • Objective: Our study aimed to evaluate the quality of radiomics studies on brain metastases based on the radiomics quality score (RQS), Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) checklist, and the Image Biomarker Standardization Initiative (IBSI) guidelines. Materials and Methods: PubMed MEDLINE, and EMBASE were searched for articles on radiomics for evaluating brain metastases, published until February 2021. Of the 572 articles, 29 relevant original research articles were included and evaluated according to the RQS, TRIPOD checklist, and IBSI guidelines. Results: External validation was performed in only three studies (10.3%). The median RQS was 3.0 (range, -6 to 12), with a low basic adherence rate of 50.0%. The adherence rate was low in comparison to the "gold standard" (10.3%), stating the potential clinical utility (10.3%), performing the cut-off analysis (3.4%), reporting calibration statistics (6.9%), and providing open science and data (3.4%). None of the studies involved test-retest or phantom studies, prospective studies, or cost-effectiveness analyses. The overall rate of adherence to the TRIPOD checklist was 60.3% and low for reporting title (3.4%), blind assessment of outcome (0%), description of the handling of missing data (0%), and presentation of the full prediction model (0%). The majority of studies lacked pre-processing steps, with bias-field correction, isovoxel resampling, skull stripping, and gray-level discretization performed in only six (20.7%), nine (31.0%), four (3.8%), and four (13.8%) studies, respectively. Conclusion: The overall scientific and reporting quality of radiomics studies on brain metastases published during the study period was insufficient. Radiomics studies should adhere to the RQS, TRIPOD, and IBSI guidelines to facilitate the translation of radiomics into the clinical field.

Prediction of the Gold-silver Deposits from Geochemical Maps - Applications to the Bayesian Geostatistics and Decision Tree Techniques (지화학자료를 이용한 금${\cdot}$은 광산의 배태 예상지역 추정-베이시안 지구통계학과 의사나무 결정기법의 활용)

  • Hwang, Sang-Gi;Lee, Pyeong-Koo
    • Economic and Environmental Geology
    • /
    • v.38 no.6 s.175
    • /
    • pp.663-673
    • /
    • 2005
  • This study investigates the relationship between the geochemical maps and the gold-silver deposit locations. Geochemical maps of 21 elements, which are published by KIGAM, locations of gold-silver deposits, and 1:1,000,000 scale geological map of Korea are utilized far this investigation. Pixel size of the basic geochemical maps is 250m and these data are resampled in 1km spacing for the statistical analyses. Relationship between the mine location and the geochemical data are investigated using bayesian statistics and decision tree algorithms. For the bayesian statistics, each geochemical maps are reclassified by percentile divisions which divides the data by 5, 25, 50, 75, 95, and $100\%$ data groups. Number of mine locations in these divisions are counted and the probabilities are calculated. Posterior probabilities of each pixel are calculated using the probability of 21 geochemical maps and the geological map. A prediction map of the mining locations is made by plotting the posterior probability. The input parameters for the decision tree construction are 21 geochemical elements and lithology, and the output parameters are 5 types of mines (Ag/Au, Cu, Fe, Pb/Zn, W) and absence of the mine. The locations for the absence of the mine are selected by resampling the overall area by 1 km spacing and eliminating my resampled points, which is in 750m distance from mine locations. A prediction map of each mine area is produced by applying the decision tree to every pixels. The prediction by Bayesian method is slightly better than the decision tree. However both prediction maps show reasonable match with the input mine locations. We interpret that such match indicate the rules produced by both methods are reasonable and therefore the geochemical data has strong relations with the mine locations. This implies that the geochemical rules could be used as background values oi mine locations, therefore could be used for evaluation of mine contamination. Bayesian statistics indicated that the probability of Au/Ag deposit increases as CaO, Cu, MgO, MnO, Pb and Li increases, and Zr decreases.

A Study on Automated Lineament Extraction with Respect to Spatial Resolution of Digital Elevation Model (수치표고모형 공간해상도에 따른 선구조 자동 추출 연구)

  • Park, Seo-Woo;Kim, Geon-Il;Shin, Jin-Ho;Hong, Sang-Hoon
    • Korean Journal of Remote Sensing
    • /
    • v.34 no.3
    • /
    • pp.439-450
    • /
    • 2018
  • The lineament is a linear or curved terrain element to discriminate adjacent geological structures in each other. It has been widely used for analysis of geology, mineral exploration, natural disasters, and earthquake, etc. In the past, the lineament has been extracted using cartographic map or field survey. However, it is possible to extract more efficiently the lineament for a very wide area thanks to development of remote sensing technique. Remotely sensed observation by aircraft, satellite, or digital elevation model (DEM) has been used for visual recognition for manual lineament extraction. Automatic approaches using computer science have been proposed to extract lineament more objectively. In this study, we evaluate the characteristics of lineament which is automatically extracted with respect to difference of spatial resolution of DEM. We utilized two types of DEM: one is Shuttle Radar Topography Mission (SRTM) with spatial resolution of about 90 m (3 arc sec), and the other is the latest world DEM of TerraSAR-X add-on for Global DEM with 12 m spatial resolution. In addition, a global DEM was resampled to produce a DEM with a spatial resolution of 30 m (1 arc sec). The shaded relief map was constructed considering various sun elevation and solar azimuth angle. In order to extract lineament automatically, we used the LINE module in PCI Geomatica software. We found that predominant direction of the extracted lineament is about $N15-25^{\circ}E$ (NNE), regardless of spatial resolution of DEM. However, more fine and detailed lineament were extracted using higher spatial resolution of DEM. The result shows that the lineament density is proportional to the spatial resolution of DEM. Thus, the DEM with appropriate spatial resolution should be selected according to the purpose of the study.

A Reflectance Normalization Via BRDF Model for the Korean Vegetation using MODIS 250m Data (한반도 식생에 대한 MODIS 250m 자료의 BRDF 효과에 대한 반사도 정규화)

  • Yeom, Jong-Min;Han, Kyung-Soo;Kim, Young-Seup
    • Korean Journal of Remote Sensing
    • /
    • v.21 no.6
    • /
    • pp.445-456
    • /
    • 2005
  • The land surface parameters should be determined with sufficient accuracy, because these play an important role in climate change near the ground. As the surface reflectance presents strong anisotropy, off-nadir viewing results a strong dependency of observations on the Sun - target - sensor geometry. They contribute to the random noise which is produced by surface angular effects. The principal objective of the study is to provide a database of accurate surface reflectance eliminated the angular effects from MODIS 250m reflective channel data over Korea. The MODIS (Moderate Resolution Imaging Spectroradiometer) sensor has provided visible and near infrared channel reflectance at 250m resolution on a daily basis. The successive analytic processing steps were firstly performed on a per-pixel basis to remove cloudy pixels. And for the geometric distortion, the correction process were performed by the nearest neighbor resampling using 2nd-order polynomial obtained from the geolocation information of MODIS Data set. In order to correct the surface anisotropy effects, this paper attempted the semiempirical kernel-driven Bi- directional Reflectance Distribution Function(BRDF) model. The algorithm yields an inversion of the kernel-driven model to the angular components, such as viewing zenith angle, solar zenith angle, viewing azimuth angle, solar azimuth angle from reflectance observed by satellite. First we consider sets of the model observations comprised with a 31-day period to perform the BRDF model. In the next step, Nadir view reflectance normalization is carried out through the modification of the angular components, separated by BRDF model for each spectral band and each pixel. Modeled reflectance values show a good agreement with measured reflectance values and their RMSE(Root Mean Square Error) was totally about 0.01(maximum=0.03). Finally, we provide a normalized surface reflectance database consisted of 36 images for 2001 over Korea.