• Title/Summary/Keyword: Data Quality Testing

Search Result 509, Processing Time 0.033 seconds

Stock News Dataset Quality Assessment by Evaluating the Data Distribution and the Sentiment Prediction

  • Alasmari, Eman;Hamdy, Mohamed;Alyoubi, Khaled H.;Alotaibi, Fahd Saleh
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.2
    • /
    • pp.1-8
    • /
    • 2022
  • This work provides a reliable and classified stocks dataset merged with Saudi stock news. This dataset allows researchers to analyze and better understand the realities, impacts, and relationships between stock news and stock fluctuations. The data were collected from the Saudi stock market via the Corporate News (CN) and Historical Data Stocks (HDS) datasets. As their names suggest, CN contains news, and HDS provides information concerning how stock values change over time. Both datasets cover the period from 2011 to 2019, have 30,098 rows, and have 16 variables-four of which they share and 12 of which differ. Therefore, the combined dataset presented here includes 30,098 published news pieces and information about stock fluctuations across nine years. Stock news polarity has been interpreted in various ways by native Arabic speakers associated with the stock domain. Therefore, this polarity was categorized manually based on Arabic semantics. As the Saudi stock market massively contributes to the international economy, this dataset is essential for stock investors and analyzers. The dataset has been prepared for educational and scientific purposes, motivated by the scarcity of data describing the impact of Saudi stock news on stock activities. It will, therefore, be useful across many sectors, including stock market analytics, data mining, statistics, machine learning, and deep learning. The data evaluation is applied by testing the data distribution of the categories and the sentiment prediction-the data distribution over classes and sentiment prediction accuracy. The results show that the data distribution of the polarity over sectors is considered a balanced distribution. The NB model is developed to evaluate the data quality based on sentiment classification, proving the data reliability by achieving 68% accuracy. So, the data evaluation results ensure dataset reliability, readiness, and high quality for any usage.

Evaluation of Genome Based Estimated Breeding Values for Meat Quality in a Berkshire Population Using High Density Single Nucleotide Polymorphism Chips

  • Baby, S.;Hyeong, K.E.;Lee, Y.M.;Jung, J.H.;Oh, D.Y.;Nam, K.C.;Kim, T.H.;Lee, H.K.;Kim, Jong-Joo
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.27 no.11
    • /
    • pp.1540-1547
    • /
    • 2014
  • The accuracy of genomic estimated breeding values (GEBV) was evaluated for sixteen meat quality traits in a Berkshire population (n = 1,191) that was collected from Dasan breeding farm, Namwon, Korea. The animals were genotyped with the Illumina porcine 62 K single nucleotide polymorphism (SNP) bead chips, in which a set of 36,605 SNPs were available after quality control tests. Two methods were applied to evaluate GEBV accuracies, i.e. genome based linear unbiased prediction method (GBLUP) and Bayes B, using ASREML 3.0 and Gensel 4.0 software, respectively. The traits composed different sets of training (both genotypes and phenotypes) and testing (genotypes only) data. Under the GBLUP model, the GEBV accuracies for the training data ranged from $0.42{\pm}0.08$ for collagen to $0.75{\pm}0.02$ for water holding capacity with an average of $0.65{\pm}0.04$ across all the traits. Under the Bayes B model, the GEBV accuracy ranged from $0.10{\pm}0.14$ for National Pork Producers Council (NPCC) marbling score to $0.76{\pm}0.04$ for drip loss, with an average of $0.49{\pm}0.10$. For the testing samples, the GEBV accuracy had an average of $0.46{\pm}0.10$ under the GBLUP model, ranging from $0.20{\pm}0.18$ for protein to $0.65{\pm}0.06$ for drip loss. Under the Bayes B model, the GEBV accuracy ranged from $0.04{\pm}0.09$ for NPCC marbling score to $0.72{\pm}0.05$ for drip loss with an average of $0.38{\pm}0.13$. The GEBV accuracy increased with the size of the training data and heritability. In general, the GEBV accuracies under the Bayes B model were lower than under the GBLUP model, especially when the training sample size was small. Our results suggest that a much greater training sample size is needed to get better GEBV accuracies for the testing samples.

MSCTest: An Automated Testing Tool for Embedded Software (MSCTest: 내장 소프트웨어 테스트를 위한 자동화 도구)

  • Lee, Nam-Hee;Seo, Sun-Ae;Kim, Tae-Hyo;Cha, Sung-Deok;Lee, Jae-Won;Park, Ki-Woong
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.6 no.2
    • /
    • pp.187-195
    • /
    • 2000
  • Embedded software generates its outputs using current states of the system as well as external inputs. When a module in embedded software is tested, we need an automated testing tool, which generates possible sequences to reach the module as well as input data of the module, to reduce the testing time and to improve the quality of software. In this paper, we use decision table to specify the functionality of the module and data-annotated MSC (Message Sequence Charts) to describe scenarios, and implement a tool, which we call MSCTest, to automate the testing process. MSCTest consists of MSC graphic editor, test sequence and data generator, and test driver generator. MSCTest is effectively applied to test EsWin which is a kind of window library used in embedded systems.

  • PDF

Computational assessment of blockage and wind simulator proximity effects for a new full-scale testing facility

  • Bitsuamlak, Girma T.;Dagnew, Agerneh;Chowdhury, Arindam Gan
    • Wind and Structures
    • /
    • v.13 no.1
    • /
    • pp.21-36
    • /
    • 2010
  • A new full scale testing apparatus generically named the Wall of Wind (WoW) has been built by the researchers at the International Hurricane Research Center (IHRC) at Florida International University (FIU). WoW is capable of testing single story building models subjected up to category 3 hurricane wind speeds. Depending on the relative model and WoW wind field sizes, testing may entail blockage issues. In addition, the proximity of the test building to the wind simulator may also affect the aerodynamic data. This study focuses on the Computational Fluid Dynamics (CFD) assessment of the effects on the quality of the aerodynamic data of (i) blockage due to model buildings of various sizes and (ii) wind simulator proximity for various distances between the wind simulator and the test building. The test buildings were assumed to have simple parallelepiped shapes. The computer simulations were performed under both finite WoW wind-field conditions and in an extended Atmospheric Boundary Layer (ABL) wind flow. Mean pressure coefficients for the roof and the windward and leeward walls served as measures of the blockage and wind simulator proximity effects. The study uses the commercial software FLUENT with Reynolds Averaged Navier Stokes equations and a Renormalization Group (RNG) k-${\varepsilon}$ turbulence model. The results indicated that for larger size test specimens (i.e. for cases where the height of test specimen is larger than one third of the wind field height) blockage correction may become necessary. The test specimen should also be placed at a distance greater than twice the height of the test specimen from the fans to reduce proximity effect.

Modelling and simulation of a closed-loop electrodynamic shaker and test structure model for spacecraft vibration testing

  • Waimer, Steffen;Manzato, Simone;Peeters, Bart;Wagner, Mark;Guillaume, Patrick
    • Advances in aircraft and spacecraft science
    • /
    • v.5 no.2
    • /
    • pp.205-223
    • /
    • 2018
  • During launch a spacecraft is subjected to a variety of dynamical loads transmitted through the launcher to spacecraft interface or air-born transmission excitations in the acoustic pressure field inside the fairing. As a result, spacecraft are tested on ground to ensure and demonstrate the global integrity of the structure against these loads, to screen the flight hardware for quality of workmanship and to validate mathematical models. This paper addresses the numerical modelling and simulation of the low frequency sine and random vibration tests performed on electrodynamic shaker facilities to comprise the mechanical-borne transmission loads through the launcher to spacecraft interface. Consequently, the paper reviews techniques and methodologies to derive a reliable and representative coupled virtual vibration testing simulation environment based on experimental data. These technologies are explored with the main objectives to ensure a stable, reliable and accurate control while testing. As a result, the use of the derived simulation models in combination with the added value of improved control and signal processing algorithms can lead to a safer and smoother vibration test control of the entire environmental test campaign.

Development of Accelerated Life Test Method for Constant Electrical Potential Electrolysis Gas Sensor (정전위 전해식 가스센서의 가속수명시험법 개발)

  • Yang, Il Young;Kang, Jun Gu;Yu, Sang Woo;Oh, Geun Tae;Na, Yoon Gyoon
    • Journal of Applied Reliability
    • /
    • v.16 no.3
    • /
    • pp.180-191
    • /
    • 2016
  • Purpose: The purpose of this study was to develop the accelerated life test method for Constant Electrical Potential Electrolysis gas sensor (CEPE gas sensor). Methods: The parts and modules of CEPE gas sensor were analyzed by using Reliability Block Diagram (RBD). Failure Mode and Effect Analysis (FMEA) and Quality Function Deployment (QFD) methods were performed for each part to determine the most affecting stress factor in its life cycle. The long term testing was conducted at three different dry heat levels and the acceleration factor was developed by using Arrhenius relationship. Conclusion: The acceleration factor for CEPE gas sensor was developed by using FMEA, QFD, and statistical analysis for its failure data. Also qualification tests were designed to meet the target life.

Application of Digital Signal Analysis Technique to Enhance the Quality of Tracer Gas Measurements in IAQ Model Tests

  • Lee, Hee-Kwan;Awbi, Hazim B.
    • Journal of Korean Society for Atmospheric Environment
    • /
    • v.23 no.E2
    • /
    • pp.66-73
    • /
    • 2007
  • The introduction of tracer gas techniques to ventilation studies in indoor environments provides valuable information that used to be unattainable from conventional testing environments. Data acquisition systems (DASs) containing analogue-to-digital (A/D) converters are usually used to function the key role that records signals to storage in digital format. In the testing process, there exist a number of components in the measuring equipment which may produce system-based inference to the monitored results. These unwanted fluctuations may cause significant error in data analysis, especially when non-linear algorithms are involved. In this study, a pre-processor is developed and applied to separate the unwanted fluctuations (noise or interference) in raw measurements and to reduce the uncertainty in the measurement. Moving average, notch filter, FIR (Finite Impulse Response) filters, and IIR (Infinite Impulse Response) filters are designed and applied to collect the desired information from the raw measurements. Tracer gas concentrations are monitored during leakage and ventilation tests in the model test room. The signal analysis functions are introduced to carry out the digital signal processing (DSP) work. Overall the FIR filters process the $CO_2$ measurement properly for ventilation rate and mean age of air calculations. It is found that, the Kaiser filter was the most applicable digital filter for pre-processing the tracer gas measurements. Although the IIR filters help to reduce the random noise in the data, they cause considerable changes to the filtered data, which is not desirable.

Analysis of Failutr Count Data Based on NHPP Models (NHPP모형에 기초한 고장 수 자료의 분석)

  • Kim, Seong-Hui;Jeong, Hyang-Suk;Kim, Yeong-Sun;Park, Jung-Yang
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.2
    • /
    • pp.395-400
    • /
    • 1997
  • An important quality characteristic of a software reliability.Software reliablilty growh models prvied the tools to evluate and moniter the reliabolty growth behavior of the sofwate during the testing phase Therefore failure data collected during the testing phase should be continmuosly analyzed on the basis of some selected software reliability growth models.For the cases where nonhomogeneous Poisson proxess models are the candiate models,we suggest Poisson regression model, which expresses the relationship between the expeted and actual failures counts in disjonint time intervals,for analyzing the failure count data.The weighted lest squares method is then used to-estimate the paramethers in the parameters in the model:The resulting estimators are equivalent to the maximum likelihood estimators. The method is illustrated by analyzing the failutr count data gathered from a large- scale switchong system.

  • PDF

A Study on Engineering Characteristics of Load Reducing Material EPS (도로성토하중경감재 EPS의 공학적 특성에 관한 연구)

  • Jang, Myeong-Sun;Cheon, Byeong-Sik;Im, Hae-Sik
    • Geotechnical Engineering
    • /
    • v.12 no.2
    • /
    • pp.59-70
    • /
    • 1996
  • The EPS has the unit weight of only 20~30kg/m3 and is used as one of the methods of reducing road embankment loads. Parts of it's applications are for backfill materials of structures like abutment, retaining wall, etc., to reduce horizontal earth pressure and for banking materials to secure the safety of settlement and bearing capacity by minimizing the stress Increment. However, the Korean Standards (KS) has not yet proposed any testing method for use of EPS as a engineering banking material. Only its testing and quality ordinance as a heat insulation material has been standardized. Therefore, in Korea, EPS is used as banking material without any systematic testing data as a civil engineering material. In this point of view, this paper deals with the engineering characteristics of EPS through many laboratory tests on strength, strain, absorption, and creep. from the results achived through tests, this paper proposes the enactment of a suitable quality testing ordinance and the criteria of unconfined design strength of EPS for use as engineering material.

  • PDF

An Analysis of the Relationship between Elevator Emergency(Entrapment) and Situation Measurement of Sag/Interruptions at the Switchboard

  • Kim, Gi-Hyun;Lim, Young-Bae;Bai, Seok-Myung;Kim, Jae-Chul;Lee, Hee-Tae
    • Journal of the Korean Institute of Illuminating and Electrical Installation Engineers
    • /
    • v.21 no.4
    • /
    • pp.36-44
    • /
    • 2007
  • This study researched the mutual relationship between elevator emergencies(sudden stops, entrapment) and power quality such as voltage interruptions or sags. To analyze the power quality at the elevator switchboard of an apartment, the on-line power quality(sag, swell, and interruption, etc.) were measured and error messages(under voltage error, etc.) in the elevator control room were researched. Also the performance and susceptibility for stops were evaluated and stated through the testing of three pieces of simulated test equipment with an EN12016(2004) standard setting and the magnitude and duration data of the measured sag or interruption. This paper will use these data in the analysis of the mutual relationship between power quality and elevator emergencies.