• Title/Summary/Keyword: Data Input Approach

Search Result 787, Processing Time 0.026 seconds

Sentiment Analysis of Product Reviews to Identify Deceptive Rating Information in Social Media: A SentiDeceptive Approach

  • Marwat, M. Irfan;Khan, Javed Ali;Alshehri, Dr. Mohammad Dahman;Ali, Muhammad Asghar;Hizbullah;Ali, Haider;Assam, Muhammad
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.3
    • /
    • pp.830-860
    • /
    • 2022
  • [Introduction] Nowadays, many companies are shifting their businesses online due to the growing trend among customers to buy and shop online, as people prefer online purchasing products. [Problem] Users share a vast amount of information about products, making it difficult and challenging for the end-users to make certain decisions. [Motivation] Therefore, we need a mechanism to automatically analyze end-user opinions, thoughts, or feelings in the social media platform about the products that might be useful for the customers to make or change their decisions about buying or purchasing specific products. [Proposed Solution] For this purpose, we proposed an automated SentiDecpective approach, which classifies end-user reviews into negative, positive, and neutral sentiments and identifies deceptive crowd-users rating information in the social media platform to help the user in decision-making. [Methodology] For this purpose, we first collected 11781 end-users comments from the Amazon store and Flipkart web application covering distant products, such as watches, mobile, shoes, clothes, and perfumes. Next, we develop a coding guideline used as a base for the comments annotation process. We then applied the content analysis approach and existing VADER library to annotate the end-user comments in the data set with the identified codes, which results in a labelled data set used as an input to the machine learning classifiers. Finally, we applied the sentiment analysis approach to identify the end-users opinions and overcome the deceptive rating information in the social media platforms by first preprocessing the input data to remove the irrelevant (stop words, special characters, etc.) data from the dataset, employing two standard resampling approaches to balance the data set, i-e, oversampling, and under-sampling, extract different features (TF-IDF and BOW) from the textual data in the data set and then train & test the machine learning algorithms by applying a standard cross-validation approach (KFold and Shuffle Split). [Results/Outcomes] Furthermore, to support our research study, we developed an automated tool that automatically analyzes each customer feedback and displays the collective sentiments of customers about a specific product with the help of a graph, which helps customers to make certain decisions. In a nutshell, our proposed sentiments approach produces good results when identifying the customer sentiments from the online user feedbacks, i-e, obtained an average 94.01% precision, 93.69% recall, and 93.81% F-measure value for classifying positive sentiments.

Quality Control of Two Dimensions Using Digital Image Processing and Neural Networks (디지털 영상처리와 신경망을 이용한 2차원 평면 물체 품질 제어)

  • Kim, Jin-Hwan;Seo, Bo-Hyeok;Park, Seong-Wook
    • Proceedings of the KIEE Conference
    • /
    • 2004.07d
    • /
    • pp.2580-2582
    • /
    • 2004
  • In this paper, a Neural Network(NN) based approach for classification of two dimensions images. The proposed algorithm is able to apply in the actual industry. The described diagnostic algorithm is presented to defect surface failures on tiles. A way to get data for a digital image process is several kinds of it. The tiles are scanned and the digital images are preprocessed and classified using neural networks. It is important to reduce the amount of input data with problem specific preprocessing. The auto-associative neural network is used for feature generation and selection while the probabilistic neural network is used for classification. The proposed algorithm is evaluated experimentally using one hundred of the real tile images. Sample image data to preprocess have histogram. The histogram is used as input value of probabilistic neural network. Auto-associative neural network compress input data and compressed data is classified using probabilistic neural network. Classified sample images are determined by human state. So it is intervened human subjectivity. But digital image processing and neural network are better than human classification ability. Therefore it is very useful of quality control improvement.

  • PDF

Using Artificial Neural Network in the reverse design of a composite sandwich structure

  • Mortda M. Sahib;Gyorgy Kovacs
    • Structural Engineering and Mechanics
    • /
    • v.85 no.5
    • /
    • pp.635-644
    • /
    • 2023
  • The design of honeycomb sandwich structures is often challenging because these structures can be tailored from a variety of possible cores and face sheets configurations, therefore, the design of sandwich structures is characterized as a time-consuming and complex task. A data-driven computational approach that integrates the analytical method and Artificial Neural Network (ANN) is developed by the authors to rapidly predict the design of sandwich structures for a targeted maximum structural deflection. The elaborated ANN reverse design approach is applied to obtain the thickness of the sandwich core, the thickness of the laminated face sheets, and safety factors for composite sandwich structure. The required data for building ANN model were obtained using the governing equations of sandwich components in conjunction with the Monte Carlo Method. Then, the functional relationship between the input and output features was created using the neural network Backpropagation (BP) algorithm. The input variables were the dimensions of the sandwich structure, the applied load, the core density, and the maximum deflection, which was the reverse input given by the designer. The outstanding performance of reverse ANN model revealed through a low value of mean square error (MSE) together with the coefficient of determination (R2) close to the unity. Furthermore, the output of the model was in good agreement with the analytical solution with a maximum error 4.7%. The combination of reverse concept and ANN may provide a potentially novel approach in designing of sandwich structures. The main added value of this study is the elaboration of a reverse ANN model, which provides a low computational technique as well as savestime in the design or redesign of sandwich structures compared to analytical and finite element approaches.

Anomaly Detection and Diagnostics (ADD) Based on Support Vector Data Description (SVDD) for Energy Consumption in Commercial Building (SVDD를 활용한 상업용 건물에너지 소비패턴의 이상현상 감지)

  • Chae, Young-Tae
    • Journal of Korean Institute of Architectural Sustainable Environment and Building Systems
    • /
    • v.12 no.6
    • /
    • pp.579-590
    • /
    • 2018
  • Anomaly detection on building energy consumption has been regarded as an effective tool to reduce energy saving on building operation and maintenance. However, it requires energy model and FDD expert for quantitative model approach or large amount of training data for qualitative/history data approach. Both method needs additional time and labors. This study propose a machine learning and data science approach to define faulty conditions on hourly building energy consumption with reducing data amount and input requirement. It suggests an application of Support Vector Data Description (SVDD) method on training normal condition of hourly building energy consumption incorporated with hourly outdoor air temperature and time integer in a week, 168 data points and identifying hourly abnormal condition in the next day. The result shows the developed model has a better performance when the ${\nu}$ (probability of error in the training set) is 0.05 and ${\gamma}$ (radius of hyper plane) 0.2. The model accuracy to identify anomaly operation ranges from 70% (10% increase anomaly) to 95% (20% decrease anomaly) for daily total (24 hours) and from 80% (10% decrease anomaly) to 10%(15% increase anomaly) for occupied hours, respectively.

Free-form Surface Generation from Measuring Points using Laser Scanner

  • Park, Jae-Won;Hur, Sugn-Min;Lee, Seok-Hee
    • International Journal of Precision Engineering and Manufacturing
    • /
    • v.3 no.4
    • /
    • pp.15-23
    • /
    • 2002
  • With the development of a laser scanner of high precision and increased speed, reverse engineering becomes a key approach to reduce the time for the development of new products. But the modeling process is not so automated enough until now. Modeling in real workshops is usually performed by the experienced operators and it requires a skillful technique to get the resultant surface of high quality and precision. In this paper, a systematic solution is proposed to automate the free-form surface generation from the measured point data. Compatibility is imposed to the measured point data during input curve generation. And the compatibility of cross-sectional curve is also considered for the loft surface generation. The data in each step is produced in IGES file format to make an easy interface to other CAD/CAM software without any further data manipulation.

Prediction of Crest Settlement of Center Cored Rockfill Dam using an Artificial Neural Network Model (인공신경망기법을 이용한 중심차수벽형 석괴댐의 정부침하량 예측)

  • Kim, Yong-Seong;Kim, Bum-Joo;Oh, Sang-Eun
    • Journal of The Korean Society of Agricultural Engineers
    • /
    • v.54 no.4
    • /
    • pp.73-81
    • /
    • 2012
  • In this study, the settlement data of 32 center cored rockfill dams (total 39 monitored data) were collected and analyzed to develop the method to predict the crest settlement of a CCRD after impounding by using the internal settlement data occurred during construction. An artificial neural network (ANN) modeling was used in developing the method, which was considered to be a more reliable approach since in the ANN model dam height, core width, and core type were all considered as input variables in deriving the crest settlement, whereas in conventional methods, such as Clements's method, only dam height is used as a variable. The ANN analysis results showed a good agreement with the measured data, compared to those by the conventional methods using regression analysis. In addition, a simple procedure to use the ANN model for engineers in practice was provided by proposing the equations used for given input values.

a improved neighborhood selection of simulated annealing technique for test data generation (테스트 데이터 생성을 위한 개선된 이웃 선택 방법을 이용한 담금질 기법 기술)

  • Choi, Hyun Jae;Lee, Seon Yeol;Chae, Heung Seok
    • Journal of Software Engineering Society
    • /
    • v.24 no.2
    • /
    • pp.35-45
    • /
    • 2011
  • Simulated annealing has been studied a long times. And it is one of the effective techniques for test data generation. But basic SA methods showed bad performance because of neighborhood selection strategies in the case of large input domain. To overcome this limitation, we propose new neighborhood selection approach, Branch Distance. We performs case studies based on the proposed approach to evaluate it's performance and to compare it whit basic SA and Random test generation. The results of the case studies appear that proposed approach show better performance than the other approach.

  • PDF

Risk Analysis of Thaw Penetration Due to Global Climate Change in Cold Regions

  • Bae, Yoon-Shin
    • Journal of the Korean Society of Hazard Mitigation
    • /
    • v.9 no.2
    • /
    • pp.45-51
    • /
    • 2009
  • A probabilistic approach may be adopted to predict freeze and thaw depths to account for the variability of (1) material properties, and (2) contemporary and future surface energy input parameters(e.g. air temperatures, cloud cover, snow cover) predicted with global climate models. To illustrate the probabilistic approach, an example of the predicted of thaw depths in cold regions is considered. More specifically, the Stefan equation is used together with the Monte Carlo simulation technique to make a probabilistic prediction of thaw penetration. The simulation results indicate that the variability in material properties, surface energy input parameters and temperature data can lead to significant uncertainty in predicting thaw penetration.

Time-Discretization of Delayed Multi-Input Nonlinear System Using A new algorithm

  • Qiang, Zhang;Zhang, Zheng;Kim, Sung-Jung;Chong, Kil-To
    • Proceedings of the KIEE Conference
    • /
    • 2007.04a
    • /
    • pp.89-91
    • /
    • 2007
  • In this paper, a new approach for a sampled-data representation of nonlinear system that has time-delayed multi-input is proposed. That is largely devoid of illconditioning and is suitable for any nonlinear problem. The new scheme is applied to nonlinear systems with two or three inputs; and then the delayed multi-input general equation is derived. The method is based on thematrix exponential theory. Itdoes not require excessive computational resources and lends itself to a short and robust piece of software that can be easily inserted into large simulation packages. A performance of the proposed method is evaluated using a nonlinear system with time-delay: maneuvering an automobile.

  • PDF

Adaptive image contrast enhancement algorithm based on block approach (블럭방법에 근거한 영상의 적응적 대비증폭 알고리즘)

  • Kim, Yeong-Hwa
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.3
    • /
    • pp.371-380
    • /
    • 2011
  • The noise caused by a variety of reasons worsens the quality of input image when we use the images reproducing device. The basic difficulty to solve this problem is that the noise and the signal are difficult to be distinguished. Contrast enhancement such as unsharp masking is one of the most important procedures to improve the quality of input images. The conventional unsharp masking enhances the images by adding their amplified high frequency components. The noise component of the input images, however, also tends to be amplified due to the nature of the unsharp masking. This paper considers the block approach for detecting niose and image feature of the input image so that the unsharp masking could be adaptively applied accordingly. Simulation results show that it is made possible to enhance contrast of the image without boosting up the noisy components by applying the proposed algorithm.