• Title/Summary/Keyword: vector data

Search Result 3,324, Processing Time 0.031 seconds

Ensemble Design of Machine Learning Technigues: Experimental Verification by Prediction of Drifter Trajectory (앙상블을 이용한 기계학습 기법의 설계: 뜰개 이동경로 예측을 통한 실험적 검증)

  • Lee, Chan-Jae;Kim, Yong-Hyuk
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.8 no.3
    • /
    • pp.57-67
    • /
    • 2018
  • The ensemble is a unified approach used for getting better performance by using multiple algorithms in machine learning. In this paper, we introduce boosting and bagging, which have been widely used in ensemble techniques, and design a method using support vector regression, radial basis function network, Gaussian process, and multilayer perceptron. In addition, our experiment was performed by adding a recurrent neural network and MOHID numerical model. The drifter data used for our experimental verification consist of 683 observations in seven regions. The performance of our ensemble technique is verified by comparison with four algorithms each. As verification, mean absolute error was adapted. The presented methods are based on ensemble models using bagging, boosting, and machine learning. The error rate was calculated by assigning the equal weight value and different weight value to each unit model in ensemble. The ensemble model using machine learning showed 61.7% improvement compared to the average of four machine learning technique.

Modeling of a Dynamic Membrane Filtration Process Using ANN and SVM to Predict the Permeate Flux (ANN 및 SVM을 사용하여 투과 유량을 예측하는 동적 막 여과 공정 모델링)

  • Soufyane Ladeg;Mohamed Moussaoui;Maamar Laidi;Nadji Moulai-Mostefa
    • Membrane Journal
    • /
    • v.33 no.1
    • /
    • pp.34-45
    • /
    • 2023
  • Two computational intelligence techniques namely artificial neural networks (ANN) and support vector machine (SVM) are employed to model the permeate flux based on seven input variables including time, transmembrane pressure, rotating velocity, the pore diameter of the membrane, dynamic viscosity, concentration and density of the feed fluid. The best-fit model was selected through the trial-error method and the two statistical parameters including the coefficient of determination (R2) and the average absolute relative deviation (AARD) between the experimental and predicted data. The obtained results reveal that the optimized ANN model can predict the permeate flux with R2 = 0.999 and AARD% = 2.245 versus the SVM model with R2 = 0.996 and AARD% = 4.09. Thus, the ANN model is found to predict the permeate flux with high accuracy in comparison to the SVM approach.

Automated Prioritization of Construction Project Requirements using Machine Learning and Fuzzy Logic System

  • Hassan, Fahad ul;Le, Tuyen;Le, Chau;Shrestha, K. Joseph
    • International conference on construction engineering and project management
    • /
    • 2022.06a
    • /
    • pp.304-311
    • /
    • 2022
  • Construction inspection is a crucial stage that ensures that all contractual requirements of a construction project are verified. The construction inspection capabilities among state highway agencies have been greatly affected due to budget reduction. As a result, efficient inspection practices such as risk-based inspection are required to optimize the use of limited resources without compromising inspection quality. Automated prioritization of textual requirements according to their criticality would be extremely helpful since contractual requirements are typically presented in an unstructured natural language in voluminous text documents. The current study introduces a novel model for predicting the risk level of requirements using machine learning (ML) algorithms. The ML algorithms tested in this study included naïve Bayes, support vector machines, logistic regression, and random forest. The training data includes sequences of requirement texts which were labeled with risk levels (such as very low, low, medium, high, very high) using the fuzzy logic systems. The fuzzy model treats the three risk factors (severity, probability, detectability) as fuzzy input variables, and implements the fuzzy inference rules to determine the labels of requirements. The performance of the model was examined on labeled dataset created by fuzzy inference rules and three different membership functions. The developed requirement risk prediction model yielded a precision, recall, and f-score of 78.18%, 77.75%, and 75.82%, respectively. The proposed model is expected to provide construction inspectors with a means for the automated prioritization of voluminous requirements by their importance, thus help to maximize the effectiveness of inspection activities under resource constraints.

  • PDF

The Analysis of Export-led Growth in the U.S. Economy: An Application for Agricultural Exports by 50 States (미국 경제의 수출견인성장에 대한 분석: 50개 주(州)의 농산물 수출을 중심으로)

  • Kang, Hyunsoo
    • International Area Studies Review
    • /
    • v.15 no.1
    • /
    • pp.107-133
    • /
    • 2011
  • This paper aims to analyze the causal relationships between agricultural exports and economic growth in the U.S. economy by 50 states. Using the annual data from 1973 to 2007, the theoretical methodologies based on the export-led growth (ELG) model under the static model, the impulse response function (IRF) and forecast error variation decomposition (FEVD) under the vector autoregressive (VAR) model, and the Granger causality test. The results show the causal relationship between agricultural exports and economic growth at the states' level. Especially, the ELG hypothesis is strongly supported in the case of 16 states (HI, ID, KS, MD, MI, MN, NJ, NC, ND, OK, OR, RI, SD, TX, WA, and WI) and is also weakly supported in the case of 31 states. Therefore, the agricultural exports are important factor of developing in the U.S. economy, and furthermore some states (located in coastal area and breadbasket) indicate the strong evidence for agricultural exports-led growth.

Impact of shiitake mushroom (Lentinula edodes) spawn imports on fresh shiitake mushroom import volumes -Focus on the Korea-China FTA- (표고버섯 접종배지 수입이 신선 표고버섯 수입량 변화에 미친 영향 -한중 FTA를 중심으로-)

  • Byung-Heon Jung;Dong-Hyun Kim
    • Journal of Mushroom
    • /
    • v.21 no.4
    • /
    • pp.200-208
    • /
    • 2023
  • This study was conducted to investigate the reasons for the decreased importation of fresh Shiitake mushrooms into Korea after implementation of the Korea-China Free Trade Agreement (FTA). Monthly time-series data from January 2009 to December 2022 were analyzed using regression analysis and vector autoregression (VAR) models to determine the relationship between the amounts of fresh and spawn Shiitake mushrooms imported. The analysis revealed that a major reason for the decreased importation of fresh Shiitake mushrooms was an increase in mushroom spawn imports after Korea-China FTA implementation. The same results were obtained from the VAR model analysis. However, in terms of the dynamic changes in amount of fresh shiitake mushrooms imported, it was confirmed that the impact of the change in mushroom spawn imports could increase the amount of Shiitake mushrooms imported.

A Unicode based Deep Handwritten Character Recognition model for Telugu to English Language Translation

  • BV Subba Rao;J. Nageswara Rao;Bandi Vamsi;Venkata Nagaraju Thatha;Katta Subba Rao
    • International Journal of Computer Science & Network Security
    • /
    • v.24 no.2
    • /
    • pp.101-112
    • /
    • 2024
  • Telugu language is considered as fourth most used language in India especially in the regions of Andhra Pradesh, Telangana, Karnataka etc. In international recognized countries also, Telugu is widely growing spoken language. This language comprises of different dependent and independent vowels, consonants and digits. In this aspect, the enhancement of Telugu Handwritten Character Recognition (HCR) has not been propagated. HCR is a neural network technique of converting a documented image to edited text one which can be used for many other applications. This reduces time and effort without starting over from the beginning every time. In this work, a Unicode based Handwritten Character Recognition(U-HCR) is developed for translating the handwritten Telugu characters into English language. With the use of Centre of Gravity (CG) in our model we can easily divide a compound character into individual character with the help of Unicode values. For training this model, we have used both online and offline Telugu character datasets. To extract the features in the scanned image we used convolutional neural network along with Machine Learning classifiers like Random Forest and Support Vector Machine. Stochastic Gradient Descent (SGD), Root Mean Square Propagation (RMS-P) and Adaptative Moment Estimation (ADAM)optimizers are used in this work to enhance the performance of U-HCR and to reduce the loss function value. This loss value reduction can be possible with optimizers by using CNN. In both online and offline datasets, proposed model showed promising results by maintaining the accuracies with 90.28% for SGD, 96.97% for RMS-P and 93.57% for ADAM respectively.

Optimize KNN Algorithm for Cerebrospinal Fluid Cell Diseases

  • Soobia Saeed;Afnizanfaizal Abdullah;NZ Jhanjhi
    • International Journal of Computer Science & Network Security
    • /
    • v.24 no.2
    • /
    • pp.43-52
    • /
    • 2024
  • Medical imaginings assume a important part in the analysis of tumors and cerebrospinal fluid (CSF) leak. Magnetic resonance imaging (MRI) is an image segmentation technology, which shows an angular sectional perspective of the body which provides convenience to medical specialists to examine the patients. The images generated by MRI are detailed, which enable medical specialists to identify affected areas to help them diagnose disease. MRI imaging is usually a basic part of diagnostic and treatment. In this research, we propose new techniques using the 4D-MRI image segmentation process to detect the brain tumor in the skull. We identify the issues related to the quality of cerebrum disease images or CSF leakage (discover fluid inside the brain). The aim of this research is to construct a framework that can identify cancer-damaged areas to be isolated from non-tumor. We use 4D image light field segmentation, which is followed by MATLAB modeling techniques, and measure the size of brain-damaged cells deep inside CSF. Data is usually collected from the support vector machine (SVM) tool using MATLAB's included K-Nearest Neighbor (KNN) algorithm. We propose a 4D light field tool (LFT) modulation method that can be used for the light editing field application. Depending on the input of the user, an objective evaluation of each ray is evaluated using the KNN to maintain the 4D frequency (redundancy). These light fields' approaches can help increase the efficiency of device segmentation and light field composite pipeline editing, as they minimize boundary artefacts.

An Ensemble Classification of Mental Health in Malaysia related to the Covid-19 Pandemic using Social Media Sentiment Analysis

  • Nur 'Aisyah Binti Zakaria Adli;Muneer Ahmad;Norjihan Abdul Ghani;Sri Devi Ravana;Azah Anir Norman
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.18 no.2
    • /
    • pp.370-396
    • /
    • 2024
  • COVID-19 was declared a pandemic by the World Health Organization (WHO) on 30 January 2020. The lifestyle of people all over the world has changed since. In most cases, the pandemic has appeared to create severe mental disorders, anxieties, and depression among people. Mostly, the researchers have been conducting surveys to identify the impacts of the pandemic on the mental health of people. Despite the better quality, tailored, and more specific data that can be generated by surveys,social media offers great insights into revealing the impact of the pandemic on mental health. Since people feel connected on social media, thus, this study aims to get the people's sentiments about the pandemic related to mental issues. Word Cloud was used to visualize and identify the most frequent keywords related to COVID-19 and mental health disorders. This study employs Majority Voting Ensemble (MVE) classification and individual classifiers such as Naïve Bayes (NB), Support Vector Machine (SVM), and Logistic Regression (LR) to classify the sentiment through tweets. The tweets were classified into either positive, neutral, or negative using the Valence Aware Dictionary or sEntiment Reasoner (VADER). Confusion matrix and classification reports bestow the precision, recall, and F1-score in identifying the best algorithm for classifying the sentiments.

Estimating the tensile strength of geopolymer concrete using various machine learning algorithms

  • Danial Fakhri;Hamid Reza Nejati;Arsalan Mahmoodzadeh;Hamid Soltanian;Ehsan Taheri
    • Computers and Concrete
    • /
    • v.33 no.2
    • /
    • pp.175-193
    • /
    • 2024
  • Researchers have embarked on an active investigation into the feasibility of adopting alternative materials as a solution to the mounting environmental and economic challenges associated with traditional concrete-based construction materials, such as reinforced concrete. The examination of concrete's mechanical properties using laboratory methods is a complex, time-consuming, and costly endeavor. Consequently, the need for models that can overcome these drawbacks is urgent. Fortunately, the ever-increasing availability of data has paved the way for the utilization of machine learning methods, which can provide powerful, efficient, and cost-effective models. This study aims to explore the potential of twelve machine learning algorithms in predicting the tensile strength of geopolymer concrete (GPC) under various curing conditions. To fulfill this objective, 221 datasets, comprising tensile strength test results of GPC with diverse mix ratios and curing conditions, were employed. Additionally, a number of unseen datasets were used to assess the overall performance of the machine learning models. Through a comprehensive analysis of statistical indices and a comparison of the models' behavior with laboratory tests, it was determined that nearly all the models exhibited satisfactory potential in estimating the tensile strength of GPC. Nevertheless, the artificial neural networks and support vector regression models demonstrated the highest robustness. Both the laboratory tests and machine learning outcomes revealed that GPC composed of 30% fly ash and 70% ground granulated blast slag, mixed with 14 mol of NaOH, and cured in an oven at 300°F for 28 days exhibited superior tensile strength.

Generation of Induced Pluripotent Stem Cells from Lymphoblastoid Cell Lines by Electroporation of Episomal Vectors

  • Myunghyun Kim;Junmyeong Park;Sujin Kim;Dong Wook Han;Borami Shin;Hans Robert Scholer;Johnny Kim;Kee-Pyo Kim
    • International Journal of Stem Cells
    • /
    • v.16 no.1
    • /
    • pp.36-43
    • /
    • 2023
  • Background and Objectives: Lymphoblastoid cell lines (LCLs) deposited from disease-affected individuals could be a valuable donor cell source for generating disease-specific induced pluripotent stem cells (iPSCs). However, generation of iPSCs from the LCLs is still challenging, as yet no effective gene delivery strategy has been developed. Methods and Results: Here, we reveal an effective gene delivery method specifically for LCLs. We found that LCLs appear to be refractory toward retroviral and lentiviral transduction. Consequently, lentiviral and retroviral transduction of OCT4, SOX2, KFL4 and c-MYC into LCLs does not elicit iPSC colony formation. Interestingly, however we found that transfection of oriP/EBNA-1-based episomal vectors by electroporation is an efficient gene delivery system into LCLs, enabling iPSC generation from LCLs. These iPSCs expressed pluripotency makers (OCT4, NANOG, SSEA4, SALL4) and could form embryoid bodies. Conclusions: Our data show that electroporation is an effective gene delivery method with which LCLs can be efficiently reprogrammed into iPSCs.